Natural Language Processing used in sentiment analysis of poetry: a study of six common techniques

Authors

  • Tanya Nair Eastlake High School

DOI:

https://doi.org/10.47611/jsrhs.v12i2.4418

Keywords:

Natural Language Processing, Poetry Analysis, Sentiment Analysis, unigram, bigram, word embedding, Mood Recognition, topic modeling, dictionaries, distributed dictionaries

Abstract

This paper explores and compares the accuracies of six different NLP model techniques when applied to analysis of English poetry. It was found that bigram-based and word embedding-based models were the most accurate in deriving emotions from the bodies of text in the corpus, with respective accuracies of 61.67% and 61.68%. The least accurate models were the unigram-based model and distributed dictionary-based model, constructed using the traditional approach rather than the new methodology, and had accuracies of 11.173% and 48.17% respectively. Most models passed the benchmark accuracies of 49.42% and 47.86%, the higher accuracy one being a word count model and lower one being a sentiment model. The need for newer methodologies that allow for higher dimensional levels of semantic analysis to be performed is also discussed in this paper, along with the potential impacts of this research to the field of Natural Language Processing.

Downloads

Download data is not yet available.

References or Bibliography

Bird, S., Klein, E., & Loper, E. (2009). Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.".

Hirschberg, J. & Manning, C. D. (2015). Advances in natural language processing. Science, 349 ( 6245), 261-266. https://doi.org/10.1126/science.aaa8685

Pandya, Marmik. (2016). NLP based Poetry Analysis and Generation. http://dx.doi.org/10.13140/RG.2.2.35878.73285.

Ahmed, M.A., & Trausan-Matu, S. (2017). Using natural language processing for analyzing Arabic poetry rhythm. 2017 16th RoEduNet Conference: Networking in Education and Research (RoEduNet), 1-5. https://doi.org/10.1109/ROEDUNET.2017.8123759

Kao, J., & Jurafsky, D. (2012, June). A computational analysis of style, affect, and imagery in contemporary poetry. In Proceedings of the NAACL-HLT 2012 workshop on computational linguistics for literature (pp. 8-17).

P S, SREEJA & G S, Mahalakshmi. (2017). Emotion Models: A Review. International Journal of Control Theory and Applications. 10. 651-657.

Rice, D., & Zorn, C. (2021). Corpus-based dictionaries for sentiment analysis of specialized vocabularies. Political Science Research and Methods, 9(1), 20-35. doi:10.1017/psrm.2019.10

Published

05-31-2023

How to Cite

Nair, T. (2023). Natural Language Processing used in sentiment analysis of poetry: a study of six common techniques. Journal of Student Research, 12(2). https://doi.org/10.47611/jsrhs.v12i2.4418

Issue

Section

HS Research Projects