Professional Documents
Culture Documents
Abstract— For many people, deciding on a movie night We can see various areas where opinion mining is
begins with a search of online review sites. Consumer employed such as Analysis of Movies, Products which
information has been redefined by social media platforms.
Regarding movies, one such platform is Rotten Tomatoes, are commercially used, varying kinds of services, news
which is focused on in this project. The goal is to test and so on.
various machine learning methods for predicting the
sentiment of unseen reviews using an upgraded corpus with Subtasks include:
more information about the opinions for various sub- a. Subjective analysis: This is one of the
categories. The Amazon Mechanical Turk-annotated categories of the subtasks of Sentiment Analysis. Here
corpus of review of movies mentioned on Rotten Tomatoes after examining the text we classify them as objective or
is used, which is considerably optimised and annotated with subjective. Objective means that the meaning hidden
a high emotion attribute. This work tests if tighter inside the text is neutral. Whereas in the subjective class
sentiment annotations for each root and span for a set of
characters that is parsed into a training set can aid in
of the text, it is inclined towards the positive or negative
determining general emotion or sentiment or opinion of side up to a certain degree.
unnoticed phrases. b. Polarity Analysis: This method is used in the
text which is classified as subjective. Here we have two
Keywords- Naive Bayes, Sentiment analysis, Support Vector sub-categories. The text can be positive or negative
Machine, Rotten Tomatoes, Deep Learning, Movie Reviews, which should be decided after considering the polarity of
Neural Network, Natural Language Processing, Machine the meaning concealed in the text.
Learning c. Degree of Polarity: Sentiment analysis as the
name suggests is the investigation and tracking of the
mood of the customers towards a product or service or
I. INT RODUCT ION topic by collecting their feedback. Now, the polarity
degree expresses the text’s degree of inclination towards
Sometimes it is necessary to comprehend the point of
the negative or positive side. Sentiment analysis is used
view of any creators or authors over the subject instead
of considering the subject itself. Sentiment analysis is a here in a way to identify the attention of the customers
gained by the movie and the impact it had on audience.
part of Data Mining technology. It is becoming a highly
researched topic among the researchers. It is used in the
field of collecting the users’ feedback over a particular With the developing prevalence of online entertainment
stages which incorporate Facebook and survey sites, for
subject. It is also very necessary to understand the
example, Zomato or Rotten Tomatoes, it's basic to have
feedback by getting insights of it. This can be achieved
using NLP. the option to naturally translate gigantic measures of
emotionally one-sided information. Sentiment analysis,
which groups abstract human assessments or opinions
Today in this ever growing competitive world where it is
highly required to maintain the pace with other utilizing regular language handling and AI procedures, is
rapidly acquiring prevalence as an approach to breaking
competitors, businesses and organizations are diving
down huge corpora for an assortment of utilizations.
deep into the application of NLP so that they can use
various features of it which may prove beneficial for
In this review, we gather emotional human opinions
them as well as the business. Their main motto is to
from subsets of concentrates, down to the feelings of
understand the customer feedback, views and opinions
over services and products and use it in an effective individual words. Fusing the feeling of individual words
into learning calculations would provide us with a
manner so that their objectives can be achieved. Users
superior comprehension of how the opinion of the whole
post various comments and reviews on social media for
any type of service or product they use. Those views can example is created from its constituents. We trust that
be classified into positive, negative or neutral comment. better grained opinion investigation will further develop
exactness on clever models.
Sometimes it is necessary to examine the traits of
movies. At that time features are extracted and the
meaning or sentiment hidden inside them are
investigated.
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
B. Support Vector Machine (SVM) After performing the 10-cross validation, the results
obtained are as follows:
"Support Vector Machine" is a machine learning model
that classifies some given data into pre-defined
categories, and thus it is a supervised learning method. T ABLE III. RESULT OF 10-FOLD CROSS VALIDAT ION
The classification predictions made via SVM can be
used for various purposes and artificial intelligent # SVM - uni MNB - uni
products. Notwithstanding, for the majority of times, its 1 73.1 75.9
2 72.6 76.1
usage is in the issues of order categorisation. While 3 73.2 75.4
implementing SVM and performing calculation, all the 4 73.8 75.2
data and information provided is considered as a point in 5 74.1 76.3
the space with n-layers, n being the number of attributes 6 73.6 75.9
or elements within the given dataset. The points, 7 73.2 74.5
representing data, are plotted there and each of them has 8 72.9 73.8
9 73.8 74.2
a specific direction. As it is a classification algorithm, 10 74.3 74.9
we observe the hyper-planes which are separating the Average 73.4 75.2
whole dataset into two or more classes in the overall
picture.
In the above results, we observed that this model was
slightly bent towards predicting number of positives more
C. Deep Learning than the number of negatives. This was observed due to
Machine learning has a branch called deep learning. the rounding off the ratings in the initial arrangement
Unlike typical machine learning algorithms, which have from 0 to 1, causing an unequal ratio or balance in the
a fixed ability to learn regardless of the amount of data polarity of the mentioned dataset, basically converting the
they collect, deep learning systems may increase their neutral data too into positive.
performance with additional data: the computer
equivalent of more experience. Machines can be put to In the next step, we have further made some
work for specialized tasks such as text categorization improvisations and optimized the Support Vector
when they have accumulated enough knowledge through Machine model for the sentiment analysis of Rotten
deep learning. Tomatoes reviews. This was done by varying the
regularization parameter for the model. For the best
Deep learning is a method that executes and takes model that would give better accuracy and correct results,
decisions and implement them for multiple applications the regularization parameter was found to be C = 0.08
using the L2-loss and L2-regularization. The percentages
and reasonings, that can even produce and develop
numerous computerized services/products/applications. found for TP, TN, FP, FN were as follows:
Most importantly, it does not require any human
interference or command to decide or perform any T ABLE IV. RESULT SHOWING T P, T N, FP AND FN
service that requires insights and thinking.
True False
The ratings determined during the training process of this Positive 39.6% 13.7%
dataset are impacting whole scale, and the model is Negative 32.4% 11.3%
computed around the same approximately. Therefore, Accuracy 77.5%
another technique is to scale using computed probability Precision 74.7%
and take these intervals into use for each rating from the Recall 79.2%
training set. F-Ratio 76.8%
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Sixth International Conference on Intelligent Computing and Control Systems (ICICCS 2022)
IEEE Xplore Part Number: CFP22K74-ART; ISBN: 978-1-6654-1035-9
REFERENCES
[1] Cambria, E., Schuller, B., Xia, Y. and C. Havasi, 2013, New
Avenues in Opinion Mining and Sentiment Analysis. IEEE Intelligent
Systems, 28, 2.
[2] Cvijickj, I.P. and F. Michahelles, 2011, Understanding Social
Media Marketing: A Case Study on T opics, Categories and Sentiment
on a Facebook Brand Page. MindT rek '11 Proceedings of the 15th
International Academic MindT rek Conference: Envisioning Future
Media Environments, Finland. 175182
[3] Younis, E.M.G., 2015, Sentiment Analysis and T ext Mining for
Social Media Microblogs Using Open Source T ools: An Empirical
Study. International Journal of Computer Applications, 112, 5.
Authorized licensed use limited to: b-on: ISCTE. Downloaded on March 25,2023 at 15:08:39 UTC from IEEE Xplore. Restrictions apply.