Stock Prediction by Polyglot Sentiment Analysis on Twitter

Authors

  • Yizhou Wang Marianopolis College

DOI:

https://doi.org/10.47611/jsr.v12i2.1910

Keywords:

Sentiment Analysis, Natural Language Processing, Twitter, Stock Prediction, Polyglot, Desjardins

Abstract

Research in the economics field has extensively documented the impact of media sentiments on the stock market. Sentiment analysis, as a tool to predict equity prices, has been popularized in the past years. Recently, Twitter has received a lot of attention due to the diversity of opinions on social media platforms. A common obstruction to sentiment analysis is the resource gap between English and other languages. This pilot study examines the effect of polyglotism in tweets concerning bilingual companies and develops a model that extracts sentiments from polyglot tweets to make price predictions. Results suggest that taking non-English tweets into consideration decreases errors in price predictions and that the random forest models have higher performance than linear regression models. The results of this pilot study need to be confirmed with larger sets of data.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References or Bibliography

Tetlock, Paul C. (2005) Giving Content to Investor Sentiment: The Role of Media in the Stock Market. Journal of Finance. https://ssrn.com/abstract=685145

Jin, Z., Yang, Y., & Liu, Y. (2020). Stock closing price prediction based on sentiment analysis and LSTM. Neural Computing & Applications, 32(13), 9713–9729. https://doi-org.ezproxy.marianopolis.edu/10.1007/s00521-019-04504-2

Mendoza-Urdiales, R. A., Núñez-Mora, J. A., Santillán-Salgado, R. J., & Valencia-Herrera, H. (2022). Twitter Sentiment Analysis and Influence on Stock Performance Using Transfer Entropy and EGARCH Methods. Entropy, 24(7), N.PAG. https://doi-org.ezproxy.marianopolis.edu/10.3390/e24070874

Kim, J., Jung, H.-Y., Lee, Y., & Lee, J.-H. (2009). Conveying Subjectivity of a Lexicon of One Language into Another Using a Bilingual Dictionary and a Link Analysis Algorithm. International Journal of Computer Processing of Languages, 22(2/3), 205–218. https://doi-org.ezproxy.marianopolis.edu/10.1142/S1793840609002044

Tellez, E. S., Miranda-Jiménez, S., Graff, M., Moctezuma, D., Suárez, R. R., & Siordia, O. S. (2017). A simple approach to multilingual polarity classification in Twitter. Pattern Recognition Letters, 94, 68–74. https://doi-org.ezproxy.marianopolis.edu/10.1016/j.patrec.2017.05.024

Xiu, D. (2022, November 8). Expected Returns and Foundation Models of Language. Seminar presented at the meeting of NYU Courant Mathematics Department.

Statistics Canada. (2011) Visual Census. https://www12.statcan.gc.ca/census-recensement/2011/dp-pd/vc-rv/index.cfm?Lang=ENG&VIEW=D&GEOCODE=462&TOPIC_ID=4

Published

05-31-2023

How to Cite

Wang, Y. (2023). Stock Prediction by Polyglot Sentiment Analysis on Twitter. Journal of Student Research, 12(2). https://doi.org/10.47611/jsr.v12i2.1910

Issue

Section

Research Articles