doi: 10.17586/2226-1494-2022-22-6-1159-1165


Application of the text wave model to the sentiment analysis problem

A. S. Gruzdeva, R. N. Iurev, I. A. Bessmertny


Read the full article  ';
Article in Russian

For citation:

Gruzdeva A.S., Iurev R.N., Bessmertny I.A. Application of the text wave model to the sentiment analysis problem. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2022, vol. 22, no. 6, pp. 1159–1165 (in Russian). doi: 10.17586/2226-1494-2022-22-6-1159-1165



Abstract
Authors researched the wave model of text representation which is one of the implementations of distributive semantics. This model takes into account not only the frequency of words occurrence in the text, but also their mutual location. The purpose of the study: to increase the accuracy of the analysis of the tonality of short texts based on the wave model. The method of determining the relationship between the text and the term is based on the calculation of the probability amplitude of the text and term proximity using a wave model. The term with the highest probability amplitude is considered to correspond most closely to the meaning of the text. The wave model allowed taking into account the fact that well-known methods define antonyms as semantically close lexical units. For the experimental study of this technique, a solution to the problem of sentiment analysis was chosen, exactly, finding the correspondence of user reviews about the product to the classes “positive” and “negative”. As a result, the accuracy of the text tonality defining was obtained up to 76.4 %, which exceeds the accuracy of the classical approach as well as the well-known methods of sentiment analysis for the Russian language. In addition, authors detected significant influence on classification accuracy of such model parameters as the choice of a basic distributive semantic model, the choice of a control point for calculating wave numbers, taking into account the influence of antonyms. The presented model has shown high accuracy in identifying the relationships of the text with concepts that are not explicitly present in it and can be successfully used as a mathematical basis for solving problems of sentiment analysis. In addition, the results obtained indicate the potential use of the wave model in other areas that require the classification of texts by indirect signs, for example, to determine the elements of author psychological portrait.

Keywords: sentiment analysis, classification, natural language processing, wave model, quantum-like model

Acknowledgements. The work was carried out within the framework of the project no. 620164 (Artificial intelligence methods for cyberphysical systems).

References
  1. Garg N., Sharma K. Text pre-processing of multilingual for sentiment analysis based on social network data. International Journal of Electrical and Computer Engineering, 2022, vol. 12, no. 1, pp. 776–784. https://doi.org/10.11591/ijece.v12i1.pp776-784
  2. Korney A.O., Kryuchkova E.N. Performance aspects of the russian texts sentiment analysis in social networks. High-perfomance computing systems and technologies, 2018, vol. 2, no. 2, pp. 87–92. (in Russian)
  3. Pazelskaia A.G., Solovev A.N. Method for determining emotions in Russian texts. Proc. of the International conference on computational linguistics and intellectual technologies “Dialogue 2011”. Moscow, RSUH, 2011, pp. 510–522. (in Russian)
  4. KlekovkinaM., KotelnikovE. The automatics entimentt extclassificat ion method based on emotional vocabulary. Digital Libraries: Advanced Methods and Technologies. Proceedings of the RCDL 2012, 2012, pp. 118–123. (inRussian)
  5. BessmertnyiI.A., DzhaliashviliZ.O., MaksimovV.V., MarkinD.A. Linguistic evaluation of the text. Abstracts of the X International Conference "Application of New Technologies in Education", Troitsk, BYTIC, 1999. (in Russian)
  6. Harris Z. Mathematical Structures of Language. Wiley, 1968, 230 p.
  7. BogdanovA.L., DulyaI.S. Sentimentanalysis of shor tRussian texts inso cial media. Tomsk State University Journal of Economics, 2019, no. 47, pp. 220–241. (in Russian). https://doi.org/10.17223/19988648/47/17
  8. Riabykin A.S., Sukhov E.A.Neural network methods in the Sentiment Analysis problem. DSPA, 2022, vol. 2, no. 2, pp. 41–57. (in Russian)
  9. VorobyovA.A., RybakA.M., SeredkinR.A., AndrosovA.Yu, SolovyovB.I. Method sof collecting and processing sociological information from theI nternet. Izvestiya Tula State University. Technical Sciences, 2022, no. 2, pp. 208–213. (in Russian). https://doi.org/10.24412/2071-6168-2022-2-208-214
  10. Fomina I.K., Tataurova A.S. Increasing the efficiency of the classification model for determining the text key. Aktual’nye problemy ekonomiki i upravleniya, 2022, no. 1, pp. 55–58. (in Russian)
  11. Zuccon G., Azzopardi L., van Rijsbergen K. The quantum probability ranking principle for information retrieval. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),2009,vol. 5766, pp. 232–240.https://doi.org/10.1007/978-3-642-04417-5_21
  12. Melucci M. Introduction to Information Retrieval and Quantum Mechanics. Springer-Verlag Berlin Heidelberg, 2015, 247 р. https://doi.org/10.1007/978-3-662-48313-8
  13. KhrennikovA. Introduction to QuantumI nformationTheory. Moscow, FizmatlitPubl., 2008, 283 p. (in Russian)
  14. Blacoe W., Kashefi E., Lapata M. A quantum-theoretic approach to distributional semantics. Proc. of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(NAACL HLT), 2013, pp. 847–857.
  15. Jaiswal A.K., Holdack G., Frommholz I., Liu H. Quantum-like Generalization of Complex Word Embedding: a lightweight approach for textual classification. CEUR Workshop Proceedings,2018, vol. 2191, pp. 159–168.
  16. SurovI.A., Semenenko E., Platonov A.V., Bessmertny I.A., Galofaro F., Toffano Z., Khrennikov A.Yu., Alodjants A.P. Quantum semantics oft extperception. Scientific Reports, 2021,vol. 11, no. 1, pp. 4193. https://doi.org/10.1038/s41598-021-83490-9
  17. GruzdevaA.S., BessmertnyI.A Classification of short texts using awave model. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2022, vol. 22, no. 2, pp. 287–293. (inRussian). https://doi.org/10.17586/2226-1494-2022-22-2-287-293
  18. Frenkel Ia.I.Wave Mechanics. Part I. Elementary Theory. Quantum physics. Moscow, 2019, 386 p. (in Russian)
  19. KutuzovA., KuzmenkoE. Webvectors: At oolkitf orbuilding web interfac esforve ctorse mantic models. Communications in Computer and Information S cience,2017,vol. 661,pp. 155–161. https://doi.org/10.1007/978-3-319-52920-2_15
  20. Panchenko A., Ustalov D., Arefyev N., Paperno D., Konstantinova N., Loukachevitch N.,Biemann C. Human and machine judgements about russian semantic relatedness.Communications in Computer and Information Science, 2016, vol. 661, pp. 221–235. https://doi.org/10.1007/978-3-319-52920-2_21
  21. Pang B., Lee L. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. Proc. of the 42nd Annual Meeting Association for Computational Linguistics (ACL),2004, pp. 271–278.https://doi.org/10.3115/1218955.1218990
  22. Menshikov I. Sentiment analysis of the text in Russian using graph models. Reports of the All-Russian Scientific Conference "Analysis of Images, Networks and Texts, AINT (AIST)", 2013, pp. 151-155. (in Russian)
  23. González F.A., Caicedo J.C. Quantum latent semantic analysis.Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),2011, vol. 6931, pp. 52–63.https://doi.org/10.1007/978-3-642-23318-0_7
  24. Dalla Chiara M.L., Giuntini R., Luciani A.R., Negri E.A quantum-like semantic analysis of ambiguity in music.Soft Computing,2017, vol. 21,no. 6, pp. 1473–1481.https://doi.org/10.1007/s00500-015-1895-y


Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Copyright 2001-2024 ©
Scientific and Technical Journal
of Information Technologies, Mechanics and Optics.
All rights reserved.

Яндекс.Метрика