Classification and Prediction of Opinion Mining in Social Networks Data

Mohamed, Shaimaa Mahmoud and Hussien, Mahmoud and Keshk, Arabi (2020) Classification and Prediction of Opinion Mining in Social Networks Data. IJCI. International Journal of Computers and Information, 7 (1). pp. 32-41. ISSN 1687-7853

[thumbnail of IJCI_Volume 7_Issue 1_Pages 32-41.pdf] Text
IJCI_Volume 7_Issue 1_Pages 32-41.pdf - Published Version

Download (550kB)

Abstract

opinion mining in social networks data considers one of the most significant and challenging tasks in our days due to the huge number of information that distributed each day. We can profit from these opinions by utilizing two significant procedures (classification and prediction). Although there is many researchers’ work at this point, it still needs improvement. Therefore, in this paper, we present a method to improve the accuracy of both processes. The improvement is done through cleaning the data set by converting all words to lower case, removing usernames, mentions, links, repeated characters, numbers, delete more than two spaces between words, empty tweets, punctuations and stop words, and converting all words like “isn't” to “is not”. we using both unigrams and bigrams as features. Our data set contains the user's feelings about distributed products, tweets labeled positive or negative, and each product rate from one to five. We implemented this work using different supervised machine learning algorithms like Naïve Bayes, Support Vector Machine and MaxEntropy for the classification process, and Random Forest Regression, Logistic Regression, and Support Vector Regression for the prediction process. At last, we have accuracy in both processes better than existing works. In classification, we achieved an accuracy of 90% and in the prediction process, Support Vector Regression model is able to predict future product rate with a Mean Squared Error (MSE) of 0.4122, Logistic Regression model is able to predict with MSE of 0.4986 and Random Forest Regression model able to predict with MSE of 0.4770.

Item Type: Article
Subjects: Library Keep > Computer Science
Depositing User: Unnamed user with email support@librarykeep.com
Date Deposited: 17 Jul 2023 06:08
Last Modified: 13 Oct 2023 04:29
URI: http://archive.jibiology.com/id/eprint/1392

Actions (login required)

View Item
View Item