Information system for analyzing public sentiment in web platforms based on machine learning

Authors

  • Dmytro I. Uhryn Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine
  • Artem O. Karachevtsev Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine
  • Yurii Ya. Tomka Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine
  • Mykyta M. Zakharov Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine
  • Yuliia L. Troianovska Odessa Polytechnic National University. 1, Shevchenko Ave. Odessa, Ukraine

DOI:

https://doi.org/10.15276/hait.07.2024.14

Keywords:

Web platform, information system, public mood, propaganda, disinformation, fake, message, text, data mining, artificial intelligence, machine learning

Abstract

The systems for studying public sentiment in web platforms are analyzed. Various tools and methods for effectively determining the mood in textual data from web platforms are described, including the formalization of the social graph and the content graph. The process of classifying comments, which includes the systematization and categorization of statements, is investigated. Based on the studied dataset, information on customer reviews and hotel ratings in Europe from the booking.com web platform is selected. Taking into account the requirements of the information system and the results of the analysis, it is determined that in order to obtain better results in determining the emotional connotation of the texts of reviews and messages from users, the most appropriate is the use of machine learning methods, taking into account natural language methods for processing text data. When choosing a text vectorization method for machine learning, the Term Frequency Inverse Document Frequency Vectorizer was chosen as the most effective among the studied methods. The architectural structure of the studied system is proposed, which is aimed at effective interaction between components and modules. The LogisticRegression model is chosen to determine the public mood. An information system has been developed that analyzes public sentiment about objects, uses advanced machine learning technologies to assess the emotional connotation of text comments, and provides users with insights and analysis of the results.

Downloads

Download data is not yet available.

Author Biographies

Dmytro I. Uhryn, Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine

Doctor of Engineering Sciences, Associate professor, Computer Science Department

Scopus Author ID: 57163746300

Artem O. Karachevtsev, Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine

PhD (Physical and Mathematical Sciences), Assistant Professor, Computer Science Department

Scopus Author ID: 36925155800

Yurii Ya. Tomka, Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine

PhD (Physical and Mathematical Sciences), Associate Professor, Computer Science Department

Scopus Author ID: 9279702200

Mykyta M. Zakharov , Yuriy Fedkovych Chernivtsi National University, 2, Kotsyubynsky Str. Chernivtsi, 58002, Ukraine

PhD student, Computer Science Department

Yuliia L. Troianovska, Odessa Polytechnic National University. 1, Shevchenko Ave. Odessa, Ukraine

Senior Lecturer, Information System Department

Scopus Author ID: 57211747293

Downloads

Published

2024-05-15

How to Cite

Uhryn, D. I., Karachevtsev, A. O., Tomka, Y. Y. ., Zakharov , M. M., & Troianovska, Y. L. . (2024). Information system for analyzing public sentiment in web platforms based on machine learning. Herald of Advanced Information Technology, 7(2), 199–212. https://doi.org/10.15276/hait.07.2024.14