Sentiment Scoring and Performance Metrics Examination of Various Supervised Classifiers
Sherin Mariam John1, K. Kartheeban2

1Sherin Mariam John, Department of Computer Science Engineering, Kalasalingam Academy of Research and Education, Krishnankoil (Tamil Nadu), India.

2K. Kartheeban, Department of Computer Science Engineering, Kalasalingam Academy of Research and Education, Krishnankoil (Tamil Nadu), India.

Manuscript received on 13 December 2019 | Revised Manuscript received on 22 December 2019 | Manuscript Published on 30 December 2019 | PP: 1120-1126 | Volume-9 Issue-2S2 December 2019 | Retrieval Number: B11111292S219/2019©BEIESP | DOI: 10.35940/ijitee.B1111.1292S219

Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Sentiment Analysis probes public opinion on user generated content on Web like blogs, social media or e-commerce websites. The results of Sentiment Analysis are getting much attention with marketers that they are able to evaluate the success of an advertising campaign or the attitude of people on a new product launch. Business owners and advertising companies are using Sentiment Analysis to start new business strategies and to identify opportunities for new product development. In this paper, with R programming, the tweets from Twitter about Samsung Galaxy mobile phone and Apple Iphone were retrieved from three countries namely USA, UK and India for creating the dataset. The collected tweets were classified into positive, negative and neutral sentiments. The machine learning classifier algorithms like Naïve Bayes, Support Vector Machine, Random Forest, Decision Tree, Artificial Neural Network, XGBoost with K Fold cross validation were applied on the dataset and the results were tabulated for comparing and estimating which classifier algorithm yields the best accuracy. Other performance metric values like F Score, Precision, Recall were also calculated for comparison of various classifier performances on Sentiment Analysis. It was found that XGBoost method combined with K Fold cross validation has produced the best accuracy in prediction. We have also applied SentiStrength algorithm to find out the intensity or the strength of positive and negative comments from each sentence. With the help of the results in hand, we were able to predict the brand of mobile phone that was preferred in each country.

Keywords: Sentiment Analysis, Machine Learning, Text Mining and Analytics, Web Data Mining, Predictive Analytics.
Scope of the Article: Performance Evaluation of Networks