Materials Today: Proceedings
Materials Today: Proceedings
a r t i c l e i n f o a b s t r a c t
Article history: Twitter, as is well known, is one of the most active social media platforms, with millions of tweets posted
Available online 28 April 2022 every day, in which different people express their opinions on topics such as travel, economic concerns,
political decisions, and so on. As a result, it is a useful source of knowledge. We offer Sentiment Analysis
Keywords: using Twitter Data for the research. Initially, our technology retrieves currently accessible tweets and
Public sentiments hashtags about various types of covid vaccinations posted on Twitter through using Twitter’s API.
Tweets Following that, the imported Tweets are automatically configured to generate a collection of untrained
Hashtags on covid vaccines
rules and random variables. To create our model, we’re utilizing, Tweepy, which is a wrapper for
Sentiment analysis
Machine learning algorithms
Twitter’s API. Following that, as part of the sentiment analysis of new Messages, the software produces
donut graphs.
Copyright Ó 2022 Elsevier Ltd. All rights reserved.
Selection and peer-review under responsibility of the scientific committee of the International Confer-
ence on Advanced Materials for Innovation and Sustainability.
https://doi.org/10.1016/j.matpr.2022.04.809
2214-7853/Copyright Ó 2022 Elsevier Ltd. All rights reserved.
Selection and peer-review under responsibility of the scientific committee of the International Conference on Advanced Materials for Innovation and Sustainability.
P. Chinnasamy, V. Suresh, K. Ramprathap et al. Materials Today: Proceedings 64 (2022) 448–451
2. Related works rithms, the MNB had the lowest accuracy of 88 percent. Marcec
and Likic [8] introduced vaccination drive analysis using the Twit-
Sentiment analysis is a technique for analyzing user’s thoughts, ter based sentiment analysis. Here they have used social network-
feelings, and judgments. According to some viewpoints, sentiment ing sites like Twitter might very well prove to be a useful medium
analysis is used to determine what users believe based on informa- of communication on SARS-CoV-2 vaccinations feelings and beliefs
tion such as written opinions. The analysis was carried out on the which can be observed nearly instantly.
written opinion, according to the two opinions. People frequently Alam et al [9], they have used valence aware dictionary for sen-
express and post their ideas on social media as a result of the dig- timent reasoned, a natural language processing (NLP) tool, was
ital era’s development, which makes us unable to avoid it. When it used to assess people’s feelings towards certain vaccines (VADER).
comes to internet, Twitter is one of the most popular places for They were able to illustrate the entire scenario by grouping the
people to express themselves. This can be utilized as a data source received attitudes into three categories (positive, negative, and
for analysis. Because of its prominence, Twitter was chosen as a neutral). We found that 33.96 percent of the respondents were
source of opinion mining. A concise explanation of beliefs and per- positive, 17.55 percent were negative, and 48.49 percent were neu-
ceptions up to 150 characters long on Twitter. This can be utilized tral. Nezhad et al [10,11], investigated Iranian people’s views on
as information in sentiment analysis to generate information and COVID-19 vaccinations by analyzing Iranian twitter posts. Discuss
determine people’s true influence. Text mining, which is a way of the various Iranian attitudes toward indigenous and imported
extracting important information from a material, was used in this COVID-19 vaccinations. They have used CNN-LSTM architecture,
sentiment analysis study. Text mining is that entails document we determined emotions of recovered tweets. Furthermore, we
preparation. Preprocessing transforms text from unstructured doc- looked into Iranian attitudes on the COVID-19 vaccine.
uments into structured information. The structured information is Samira et al [4], developed an analysis model based on the con-
subsequently classified using a data mining classification approach tent of Twitter. They were able to determine popular attitude and
[1]. thoughts about the COVID-19 vaccinations. They have discovered
The second study [2] used Naive Bayes, Decision Trees, and Ran- vaccination feelings and views in retweets after separation and
dom Forest to analyze sentiment in social media. Positive, negative, analyzed their growth over time, geographical location, principal
and neutral sentiments are categorized based on Twitter data. As a themes, hashtags, post analytics tools, as well as accounts features.
result, Indonesian Twitter users are more neutral in their views. Such tweets are collected from Twitter using a Twitter API authen-
The three methods with the highest accuracy were compared using tication token in this investigation. NLP is used to receive and ana-
data from fast miner tools. With an accuracy of 86.43 percent, Nave lyze the raw twitter posts. A supervised KNN clustering algorithm
Bayes is more accurate than the other method. The Decision Tree is then used to classify the produced data. The data is divided into
and Random Forest were both 82.91percent accurate. three categories by the algorithm: positive, negative, and neutral.
The third study [3] looked at classification with the Naive Bayes These categories pertain to the prevailing attitude of the persons
Classifier and Decision Tree Algorithms. The Naive Bayes classifica- whom Retweets are being analysed by Javed et al [12]. Ansari
tion and decision tree method are suggested for this study as a et al [13], they have analysed COVID-19 immunization retweets
more accurate and effective prediction in the assumption sort of to offer an assessment of the government’s sentiments to ongoing
brain tumor. The goal of this research is to show that the Decision immunization drives. Investigations on sentiment classification
Tree algorithm is more straightforward and straightforward than were also conducted in order to reveal new knowledge regarding
the Nave Bayes algorithm. The type of tumor has been identified the effects of location and gender including [3,14,15].
using these two algorithms, and it allows for the examination of As a result of the best approach that can be taken is that the
historical data from data sets, allowing neurologists to make edu- Naive Bayes algorithm is used for selection for training web pages.
cated guesses about the type of tumor. The Decision Tree algorithm
is, without a doubt, faster and more accurate than the Naive Bayes 3. Proposed method
algorithm. Furthermore, the fourth study uses Decision Tree, K-
Nearest Neighbor, and Nave Bayes to analyze sentiment from Panda’s library was used to turn the data-set into a data frame
tweets. A tweet from e-commerce is used as a case study in this for the purposes of this paper, making data manipulation and anal-
study. Text mining, text preprocessing, categorization, and other ysis easier. The Natural Language Toolkit (NLTK) is a computational
approaches are used in this study. These strategies are used to linguistics package that provides easy-to-use interfaces to over 50
develop sentiment analysis classification and analysis. Rapid miner lexical resources. This library was used for text pre-processing
is also utilized to aid in the comparison of analysis attitudes by uti- (stemming, lemmatization, and stop word removal), tokenization,
lizing three different categories inside the dataset. According to the sentiment analysis (NLTK Sentiment Intensity Analyzer), and Emo-
findings of this study [4,5], the Nave Bayes algorithm has the max- tional Analysis (Lexicon based) in this article. Matplotlib is a data
imum accuracy of 77 percent. The fifth study compares Naive visualization package that may be used to create static, animated,
Bayes, Decision Trees, and Neural Networks in classification. This and interactive data visualizations. This library was used to plot
study categorizes web pages through a of the three algorithms. time series graphs and donut chart for the purposes of this paper.
According to the findings of this study [6], they improved Nave. Tokenization is the process of dividing a large chunk of text into
The Bayes classifier not only out performs the traditional Nave smaller tokens. In this case, tokens can be words, characters, or sub
classifier. Bayes, on the other hand, performs similarly well. words. I utilized NLTK’s treebank tokenizer to perform word level
Khakharia et al [7], they have used Machine Learning to scan tokenization on tweets in this project. Text Pre-processing cleans
public tweets from Twitter connected to COVID-19 immunizations the text data and removes superfluous elements such as stop
in order to determine the user’s perspective on the vaccine (ML). words, hashtags, and emoticons, among other things, so that the
The analysis of the Twitter data into positive or negative messages text is ready to be given to the model.
was done using three distinct machine learning algorithms such as These methods were used in this study. Because of the strong
Multinomial Nave Bayes (MNB), Support Vector Machine (SVM), assumptions and great accuracy, the Naive Bayes method is used.
and Logistic Regression (LR). The LR algorithm produced the best The Decision Tree method was chosen because it is more straight-
results in the analysis, with a 97.3 percent accuracy, while SVM forward and straightforward to utilize. The following are the expla-
had a 96.26 percent accuracy. In comparison to the other ML algo- nations for the two methods:
449
P. Chinnasamy, V. Suresh, K. Ramprathap et al. Materials Today: Proceedings 64 (2022) 448–451
4. Performance evaluation
Fig. 3. The neutral value. Fig. 5. (a) (b) the unigram and Bigram of each sentiment.
450
P. Chinnasamy, V. Suresh, K. Ramprathap et al. Materials Today: Proceedings 64 (2022) 448–451
demic, they might use sites such as twitter to spread good mes-
sages and reduce negative ones.
P.Chinnasamy, Writing the papers and responsible for
Implementation.
V.Suresh, Collection of DataSet.
K.Ramprathap - Proof Reading this article.
B Jency A Jebamani - Support for Plagiarism removal.
K. Srinivas Rao – Article writing.
Shiva Kranthi – Results and discussion section writing.
References
451