doi: 10.17586/2226-1494-2020-20-1-118-124


MACHINE LEARNING METHODS FOR FORECASTING OF SOCIAL NETWORK USERS’ REACTION

E. P. Popova, V. N. Leonenko


Read the full article  ';
Article in Russian

For citation:
Popova E.P., Leonenko V.N. Machine learning methods for forecasting of social network users’ reaction. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 2020, vol. 20, no. 1, pp. 118–124 (in Russian). doi: 10.17586/2226-1494-2020-20-1-118-124


Abstract
Subject of Research. The paper presents comparative analysis of forecasting methods for users’ response to a message emerged on social media and application of machine learning. The method that provides the highest accuracy is preferred. Method. The following machine learning methods were applied: support vector method, gradient boosting methodrandom forest and multilayer perceptron. The statistical method of regression analysis — linear regression — was used as a reference forecasting method. Vectorization of records for obtaining quantitative characteristics of their content was carried out using such methods as: “Bag of Words”, TF-IDF and Word2Vec. The forecast quality was evaluated by R2 determination coefficient. Main Results. A numerical experiment was performed using a data set collected on the VKontakte social network. The set contained information about community subscribers, publications, “I like” and “tell friends” marks and comments on publications. The number of marks and comments under the posted publication was projected, depending on its content. The most accurate results were obtained when predicting the number of comments. The quality of forecasts for the number of “I like” and “tell friends” marks turned out to be lower. Practical Relevance. The results of the work can be used in analyzing the effect of various news, including fake news, on users of social networks. The development of forecasting methods provides the planning of measures for acceleration or containment of the messaging distribution.

Keywords: social networks, reaction forecasting, natural language processing, machine learning, regression analysis

Acknowledgements. The study was supported by the Russian Science Foundation (agreement No. 19-11-00326).

References
1. Sukhodolov A.P., Bychkova A.M. Fake News as a Modern Media Phenomenon: Definition, Types, Role of Fake News and Ways of counteracting it. Theoretical and practical issues of journalism, 2017, vol. 6, no 2, pp. 143–169. (in Russian). doi: 10.17150/2308-6203.2017.6(2).143-169
2. Li M., Wang X., Gao K., Zhang S. A survey on information diffusion in online social networks: Models and methods. Information (Switzerland), 2017, vol. 8, no. 4, pp. 118. doi: 10.3390/info8040118
3. Vega-Oliveros D.A., Berton L., Vazquez F., Rodrigues F.A. The impact of social curiosity on information spreading on networks. Proc. 9th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2017, pp. 459–466. doi: 10.1145/3110025.3110039
4. Cyberemotions: Collective Emotions in Cyberspace / Eb. by J.A. Holyst. Berlin, Springer International Publishing, 2017, 318 p. doi: 10.1007/978-3-319-43639-5
5. Rodrigues H.S. Application of SIR epidemiological model: new trends. International Journal of Applied Mathematics and Informatics, 2016, vol. 10, pp. 92–97.
6. Mann R.P., Faria J., Sumpter D., Krause J. The dynamics of audience applause. Journal of the Royal Society Interface, 2013, vol. 10, no. 85, pp. 20130466. doi: 10.1098/rsif.2013.0466
7. Kostylenko O., Rodrigues H.S. Torres D.F.M. Banking risk as an epidemiological model: an optimal control approach. Springer Proceedings in Mathematics and Statistics, 2018, vol. 223, pp. 165– 176. doi: 10.1007/978-3-319-71583-4_12
8. Horkovenko D.K. Overview of the models of information distribution in social networks. Young Scientist, 2017, no. 8, pp. 23–28. (in Russian)
9. Lande D., Dodonov V. Nonlinear properties of agent-based news distribution model. Information Technology and Security, 2016, vol. 4, no. 2, pp. 137–146. (in Russian)
10. Lande D.V., Hraivoronska A.M., Berezin B.O. Model of information spread in social networks. European Journal of Natural History, 2016, no. 5, pp. 41–44.
11. Shotylo D.M., Kraynova V.E., Skurydin A.V. Trends in the development of artificial neural networks in digital economy. EKONOMINFO, 2018, vol. 15, no. 4, pp. 65–69. (in Russian)
12. Seber G.A.F., Lee A.J. Linear Regression Analysis. 2nd ed. New Jersey, John Wiley & Sons, 2003, 582 p., Wiley Series in Probability and Statistics.
13. Liu X., Zhu T. Deep learning for constructing microblog behavior representation to identify social media user’s personality. PeerJ Computer Science, 2016, vol. 2, pp. e81. doi: 10.7717/peerj-cs.81
14. Viugin V.V. Mathematical foundations of machine learning and forecasting theory. Moscow, MCCME Publ., 2013, 390 p. (in Russian)
15. Jamali S., Rangwala H. Digging digg: comment mining, popularity prediction, and social network analysis. Proc. of the 2009 International Conference on Web Information Systems and Mining (WISM), 2009, P. 32–38. doi: 10.1109/WISM.2009.15
16. Zhou Z.-H. Ensemble Methods: Foundations and Algorithms.
Chapman&Hall/CRS, 2012, 236 p.
17. Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. NY, Springer-Verlag, 2009, 746 p. doi: 10.1007/978-0-387-84858-7
18. Sheng Y., Subhash K. A Survey of Prediction Using Social Media. Department of Computer Science, Oklahoma State University. Stillwater, Oklahoma, U.S.A. Available at: https://arxiv.org/ abs/1203.1647 (accessed: 19.08.2019).
19. LeCun Y., Boser B., Denker J.S., Henderson D., Howard R.E., Hubbard W., Jackel L.D. Backpropagation Applied to Handwritten Zip Code Recognition. Neural Computation, 1989, vol. 1, no. 4, pp. 541–551. doi: 10.1162/neco.1989.1.4.541
20. Cai G., Xia B. Convolutional neural networks for multimedia sentiment analysis. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, vol. 9362, pp. 159–167. doi: 10.1007/978- 3-319-25207-0_14
21. Rosenblatt F. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Washington, Spartan Books, 1961, 616 p.
22. Kesarev S., Severiukhina O., Bochenina K. Parallel simulation of community-wide information spreading in online social networks.
Communications in Computer and Information Science, 2019, vol. 965, pp. 136–148. doi: 10.1007/978-3-030-05807-4_12
23. Gali K., Venkatapathy S. Sentence realisation from bag of words with dependency constraints. Proc. of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, USA, 2009, pp. 19–24.
24. Shahzad Q., Ramsha A. Text mining: Use of TF-IDF to examine the relevance of words to documents. International Journal of Computer Applications, 2018, vol. 181, no. 1, pp. 25–29. doi: 10.5120/ ijca2018917395
25. Jang B., Kim I., Kim J.W. Word2vec convolutional neural networks for classification of news articles and tweets. PLoS ONE, 2019, vol. 14, no. 8, pp. e0220976 doi: 10.1371/journal.pone.0220976
 


Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Copyright 2001-2024 ©
Scientific and Technical Journal
of Information Technologies, Mechanics and Optics.
All rights reserved.

Яндекс.Метрика