Professional Documents
Culture Documents
To understand howToopinion
understand
mining how can
opinion
help, anmining
example canishelp,
given.
anInexample
a television
is given. In a television
manufacturing company,
manufacturing
supposecompany,
a businesssuppose
problema is business
discussed
problem
to analyze
is discussed
the to analyze the
sentiment from reviews.
sentiment
Theyfromwanted
reviews.
to understand
They wanted the to
contentment
understandofthe users
contentment
in of users in
product pricing andproduct
features.
pricing
A veryand
thorough
features.analysis
A very of
thorough
the reviews
analysis
was conducted
of the reviews was conducted
and the features most
and liked,
the features
featuresmostmostliked,
disliked,
features
viewsmost
on pricing,
disliked,and
views
segments
on pricing, and segments
of users with whichofpriorities
users with(forwhich
example,
priorities
certain(forusers
example,
were more
certainconcerned
users wereabout
more concerned about
the pricing than the
thequality,
pricingsomethanjust
the need
quality,
baresome
minimal
just need
features
bareinminimal
TV) were features in TV) were
identified. After this
identified.
study, theAfter
manufacturer
this study, was
the manufacturer
able to understandwas able
howtothey
understand
need how they need
to pitch what type to
of pitch
television
whattotypewhichof television
type of users.
to which
Also,type
the most
of users.
commonly
Also, the most commonly
features liked werefeatures
upgraded liked
permanently
were upgradedand thepermanently
most commonlyand thedisliked
most commonly
features disliked features
were removed. were removed.
• Aspect-based •sentimentAspect-based
analysis: A text
sentiment
analysis analysis:
techniqueA that
text analysis
divides the
technique that divides the
text data and defines itstext
sentiment
data andbased
defines
on its sentiment
aspects. It analyzes
based onconsumer
its aspects. It analyzes consume
feedback data by correlating
feedback
emotions
data bytocorrelating
different aspects
emotions
of atoproduct
differentor aspects of a product or
service. service.
• Fine-grained •sentimentFine-grained
analysis: Thissentiment
sentimentanalysis:
analysisThis
modelsentiment
contributes
analysis model contribute
to the advancement of polarity
to the advancement
accuracy. Sentiment
of polarity
analysis
accuracy.
can Sentiment
be done in analysis can be done in
the different categories the
discussed
different
in categories
the above section
discussed
based
in the
on above
polarity.
section based on polarity.
• Emotion detection:
• Emotion
Emotionanalysis
detection:
is the method
Emotionofanalysis
defining is and
the method
evaluating
of defining and evaluatin
the emotions conveyed thein textual
emotions data.
conveyed
Emotionindetection
textual data.
and Emotion
classification
detection and classification
are straightforward tasks
arethat
straightforward
can be completed tasks based
that canon be
thecompleted
emotions based
con- on the emotions con
veyed in the text, such as
veyed
fear,inrage,
the text,
happiness,
such assorrow,
fear, rage,
affection,
happiness,
motivation,
sorrow, affection, motivation
or neutral. or neutral.
• Intent analysis:
• The method
Intentofanalysis:
analyzingThetext
method
data toofdetermine
analyzingthe
textauthor's
data to determine the author's
intent is known as intent
intent
detection.
is known
Manyas intent
humandetection.
behaviorsMany
and actions
humanarebehaviors and actions are
motivated by motives, and
motivated
knowing byintentions
motives, andwillknowing
help youintentions
perceive these
will help you perceive these
behaviors. behaviors.
• sentences taken
• out of context,
sentences taken out of context,
Listing 5. ExampleListing
of aspect
5. Example
term extraction
of aspect
andterm
aspect
extraction
term polarity
and aspect
detection
term polarity detection
Aspect-based sentiment
Aspect-based
analysis sentiment
identifies theanalysis
aspects
identifies
of a given
thetarget
aspectsentity
of a and
given
the
target entity and the
sentiment expressedsentiment
toward each
expressed
aspect.toward
Aspecteach
categories
aspect.(eg,
Aspect
food,
categories
price) identify
(eg, food, price) identify
coarser features than
coarser
aspectfeatures
terms,than
and they
aspect doterms,
not necessarily
and they do occur
notasnecessarily
terms in aoccur as terms in a
given sentence. In given
our example,
sentence.theInterms
our example,
fajitas and
thepico
terms
de fajitas
gallo refer
and to
pico
thedesame
gallo refer to the same
aspect category food,
aspect
while
category
the termfood,
service
whilerefers
the term
to theservice
aspectrefers
category
to the
service.
aspect category service.
In Listing 6 we annotate
In Listing
aspect
6 we
categories
annotateandaspect
introduce
categories
the and
use of
introduce
the property
the use of the property
marl:describesFeature
marl:describesFeature
for this purpose. for this purpose.
Listing 6. Annotation
Listing
of aspect
6. Annotation
categories
ofand
aspect
their
categories
polarity and their polarity
Bag of words: The most common statistical approach for sentiment analysis is to
treat the text content in an input document as a bag of n-grams without taking into
account the grammar and the n-grams order. The frequency of each term (n-gram)
occurring in the input is usually considered as a feature vector for the machine
learning model. Sentiment, emoticon, and emoji dictionary-based features can be
effectively combined with n-gram features and a hybrid machine learning model
can be built to learn underlying sentiment patterns. Since 2003, the majority of the
research work in sentiment analysis using statistical approaches mainly employs the
bag of words approach with notable work being done by Pang et al. [50].
Bag of words: The most common statistical approach for sentiment analysis is to
treat the text content in an input document as a bag of n-grams without taking into
account the grammar and the n-grams order. The frequency of each term (n-gram)
occurring in the input is usually considered as a feature vector for the machine
learning model. Sentiment, emoticon, and emoji dictionary-based features can be
effectively combined with n-gram features and a hybrid machine learning model
can be built to learn underlying sentiment patterns. Since 2003, the majority of the
research work in sentiment analysis using statistical approaches mainly employs the
bag of words approach with notable work being done by Pang et al. [50].
Aspect-based sentiment
Aspect-based
analysis:sentiment
In online analysis:
rating systems,
In onlinetherating
overallsystems,
rating ofthe overall rating of
an entity does not an
necessarily
entity does
explain
not necessarily
the quality explain
of the different
the qualitydimensions
of the different
of dimensions of
an entity. Consider,anfor
entity.
example,
Consider,
the restaurant
for example, dining
the domain.
restaurant Thedining
objective
domain.
of The objective of
a sentiment analysisa sentiment
system in analysis
this domainsystemis toinanalyze
this domain
the opinions
is to analyze
of people
the opinions of people
on different aspects onofdifferent
a restaurant
aspects
(e.g.,
of ambience,
a restaurant food
(e.g.,
quality,
ambience,
service
food
andquality,
price service and price
range). A typical restaurant
range). Areview
typicallooks
restaurant
like thereview
following
looks(Fig.
like the
7): following (Fig. 7):
A compositional function
A compositional
takes the vector
functionrepresentation
takes the vector
of child
representation
nodes at eachof child
levelnodes at each level
and computes the and
vector
computes
representation
the vector
of arepresentation
current node in of aa bottom-up
current node fashion.
in a bottom-up fashion.
The function uses the
The vector
function
representation
uses the vector
of arepresentation
node as a featureof aof
node
a sentiment
as a feature of a sentiment
classifier to compute
classifier
the sentiment
to compute
valuethe
at sentiment
the node level.
valueSocher
at the et
node
al. [53]
level.
compute
Socher et al. [53] compute
sentiment values with
sentiment
the softmax
valuesclassifier
with the[58,59],
softmaxasclassifier
shown in [58,59],
Fig. 10.as shown in Fig. 10.
Fig. 10. SentimentFig.
value10.computation
Sentiment value
at each
computation
node [53]. at each node [53].
by identifying the most similar neighbors within the cluster of a specific user. To
implement this method, they carried out two major operation. First, they extracted
the user's preferences for the given items based on multicriteria ratings. Second, on
the basis of the preferences of the user the cluster centers were defined.
Akhtar et al. [9] presented a technique for analyzing hotel reviews and extracted some
valuable information and/or knowledge from them to assist the service providers as
well as to help customers identify the loopholes and strengths in the service sector
to improve their business performance.
Akhtar et al. [9] presented a technique for analyzing hotel reviews and extracted some
valuable information and/or knowledge from them to assist the service providers as
well as to help customers identify the loopholes and strengths in the service sector
to improve their business performance.
Appendix: ListAppendix:
of Abbreviations
List of Abbreviations
ABSA: Aspect-Based
ABSA:
Sentiment
Aspect-Based
AnalysisSentiment Analysis
ACD: Aspect Category
ACD:Detection
Aspect Category Detection
CNN: Convolutional
CNN:
Neural
Convolutional
Networks Neural Networks
CRF: Conditional Random
CRF: Conditional
Field Random Field
DL: Deep LearningDL: Deep Learning
DNN: Deep NeuralDNN:
Networks
Deep Neural Networks
GRU: Gated Recurrent
GRU:Unit
Gated Recurrent Unit
LSTM: Long-ShortLSTM:
Term Memory
Long-Short Term Memory
ME: Maximum Entropy
ME: Maximum Entropy
MemNet: MemoryMemNet:
NetworksMemory Networks
NER: Name EntityNER:
Recognition
Name Entity Recognition
NLP: Natural Language
NLP: Natural
Processing
Language Processing
OTE: Opinion Target
OTE:Extraction
Opinion Target Extraction
POS: Part Of Speech
POS: Part Of Speech
RAE: Recursive Auto-encoders
RAE: Recursive Auto-encoders
RecNN: Recursive RecNN:
Neural Networks
Recursive Neural Networks
RNN: Recurrent Neural
RNN: Networks
Recurrent Neural Networks
SemEval: International
SemEval:
Workshop
International
on Semantic
Workshop
Evaluation
on Semantic Evaluation
SP: Sentiment Polarity
SP: Sentiment Polarity
SVM: Support Vector
SVM:
Machine
Support Vector Machine
WE: Word Embeddings
WE: Word Embeddings
Comprehensive
Comprehensive
analysis of
analysis
aspectof
term
aspect term
extraction
extraction
methodsmethods
using various
usingtext
various text
embeddings
embeddings
Łukasz Augustyniak,
Łukasz
... Przemysław
Augustyniak,
Kazienko,
... Przemysław
in Computer
Kazienko,
Speech
in Computer
& Language,
Speech & Language,
2021 2021
Abstract
Abstract
Recently, a variety of
Recently,
model a variety
designs andof model
methodsdesigns
haveandblossomed
methodsinhave
the context
blossomed in the context
of the sentiment analysis
of the sentiment
domain. However,
analysis domain.
there is still
However,
a lack of
there
comprehensive
is still a lack of comprehensive
studies of Aspect-based
studiesSentiment
of Aspect-based
Analysis.Sentiment
We want to Analysis.
fill thisWe
gapwant
and to
propose
fill this gap and propose
a comparison witha comparison
ablation analysis
withofablation
Aspect Term
analysis
Extraction
of Aspect using
Termvarious
Extraction
text using various text
embeddings methods. embeddings
We particularly
methods. focused
We particularly
on simple focused
architectures
on simple
basedarchitectures
on based on
long short-term memory
long short-term
(LSTM) withmemory
optional
(LSTM)
conditional
with optional
random conditional
field (CRF)random field (CRF)
enhancement using enhancement
different pre-trained
using different
word embeddings.
pre-trained wordMoreover,
embeddings.
we analyzed
Moreover, we analyzed
the influence on the theperformance
influence onofthe extending
performance
the word
of extending
vectorization
the word
step with
vectorization step with
character-based wordcharacter-based
embeddings.word The experimental
embeddings. results
The experimental
on SemEvalresults
datasetson SemEval datasets
revealed that bi-directional
revealed that
longbi-directional
short-term memory
long short-term
(BiLSTM) memory
could be(BiLSTM)
used as could be used as
a very good predictor,
a very
even
good
comparing
predictor,toeven
very comparing
sophisticated to very
and complex
sophisticated
models and complex models
using huge word embeddings
using huge word
or language
embeddings
models.or We
language
presented
models.
a comprehensive
We presented a comprehensive
analysis of various analysis
customizations
of variousof LSTM-based
customizations architecture
of LSTM-basedand word/character
architecture and word/character
embeddings that could
embeddings
be usedthat
as a could
guideline
be used
to choose
as a guideline
the best to
model
choose
version
the best
for model version for
particular user needs.
particular user needs.
1
Opinion
1 mining 2008 2487
Opinion mining276 2008
and sentiment and sentiment
analysis [27] analysis [27]
2 Mining
2 and 2004 1400
Mining and 108 2004
summarizing summarizing
customer customer
reviews [48] reviews [48]
3 Lexicon-based
3 2011 471Lexicon-based 79 2011
methods for methods for
sentiment sentiment
analysis [53] analysis [53]
4 Recognizing
4 2005 657Recognizing 55 2005
contextual contextual
polarity in polarity in
phrase-level phrase-level
sentiment sentiment
analysis [ 51] analysis [ 51]
5 Techniques
5 and 2013 203Techniques and 51 2013
applications for applications for
sentiment analy- sentiment analy-
sis: The main sis: The main
applications and applications and
challenges of one challenges of one
of the hottest re- of the hottest re-
search areas in search areas in
computer sci- computer sci-
ence [54] ence [54]
6 New
6 avenues in 2013 202New avenues in 51 2013
opinion mining opinion mining
and sentiment and sentiment
analysis [55] analysis [55]
7 Measuring
7 praise 2003 694Measuring praise50 2003
and criticism: In- and criticism: In-
ference of se- ference of se-
mantic orienta- mantic orienta-
tion from associa- tion from associa-
tion [43] tion [43]
8 Sentiment
8 2012 228Sentiment 46 2012
strength strength
detection for the detection for the
social web [56] social web [56]
9 Mining
9 the peanut 2003 617Mining the peanut44 2003
gallery: Opinion gallery: Opinion
extraction and se- extraction and se-
mantic classificat- mantic classificat-
ion of product re- ion of product re-
views [3] views [3]
10 Extracting
10 prod- 2005 517Extracting prod-
43 2005
uct features and uct features and
opinions from opinions from
reviews [ 57] reviews [ 57]
11
lexical resource lexical resource
for opinion for opinion
mining [63] mining [63]
9 SentiWordNet
9 2010 1304
SentiWordNet 186 2010
3.0: An Enhanced 3.0: An Enhanced
Lexical Resource Lexical Resource
for Sentiment for Sentiment
Analysis and Analysis and
Opinion Opinion
Mining [64] Mining [64]
10 From
10 tweets to 2010 1256
From tweets to 179 2010
polls: Linking text polls: Linking text
sentiment to pub- sentiment to pub-
lic opinion time lic opinion time
series [65] series [65]
11 Recognizing
11 2005 Recognizing
2141 178 2005
contextual contextual
polarity in polarity in
phrase-level phrase-level
sentiment sentiment
analysis [ 51] analysis [ 51]
11 Extracting
11 prod- 2007 Extracting prod-
1775 178 2007
uct features and uct features and
opinions from opinions from
reviews [ 57] reviews [ 57]
One of the top cited literature reviews is a book by Bing Liu [60]. The 167 pages
contain a wide array of topics, with chapters about document, sentence and as-
pect-based sentiment analysis. Overall the topic is approached first by introducing
the research problems of sentiment analysis and then answering them with the latest
knowledge available during the writing of the book.
orientations of the phrases it consists of, using the PMI score. The approach is
evaluated with reviews from different domains, i.e. automobile, bank, movie and
travel reviews. Movie reviews prove to be particularly challenging for the approach,
as a review of a recommendable movie can contain negative adjectives describing
incidents in the movie, e.g. violence and horror.
orientations of the phrases it consists of, using the PMI score. The approach is
evaluated with reviews from different domains, i.e. automobile, bank, movie and
travel reviews. Movie reviews prove to be particularly challenging for the approach,
as a review of a recommendable movie can contain negative adjectives describing
incidents in the movie, e.g. violence and horror.
Tumasjan et al. [62]Tumasjan
examine et al. [62]
around 100,000
examinetweets
aroundfrom
100,000
Twittertweets
as a predictor
from Twitter as a predictor
for election results.for
They
election
find out
results.
that They
the proportion
find out that
of mentions
the proportion
of parties
of mentions
and of parties and
prominent politicians
prominent
mirror the
politicians
electionmirror
resultsthe
quite
election
closely,
results
however
quitehaving
closely, however having
less accuracy than less
opinion
accuracy
polls.than
Another
opinion
interesting
polls. Another
findinginteresting
was that 40%finding
of the
was that 40% of the
messages were postedmessages
by thewere
4% of posted
the users.
by the 4% of the users.
SentiWordNet is anSentiWordNet
automaticallyisgenerated
an automatically
lexicon developed
generated by
lexicon
Baccianella,
developedEsuli
by Baccianella, Esuli
and Sebastiani [63,64]
and to
Sebastiani [63,64]
be specifically used
to beinspecifically
sentiment used
analysis.
in sentiment
The lexicon
analysis.
was The lexicon was
generated using a generated
semi-supervised
using alearning
semi-supervised
algorithmlearning
and evaluated
algorithm
by comparing
and evaluated by comparing
the generated semantic
the generated
values ofsemantic
words withvalues
the manually
of words with
annotated
the manually
sets of syn-
annotated sets of syn-
onyms. onyms.