You are on page 1of 8

Taking the Pulse of Political Emotions in Latin America

Based on Social Web Streams

Ernesto Diaz-Aviles Claudia Orellana-Rodriguez Wolfgang Nejdl

L3S Research Center / University of Hannover. Hannover, Germany


{diaz, orellana, nejdl}@L3S.de

Abstract—Social media services have become increasingly example, automatically detecting emotions such as joy, sad-
popular and their penetration is worldwide. Micro-blogging ness, fear, anger, and surprise in the social web has several
services, such as Twitter, allow users to express themselves, practical applications, for instance, tracking the popularity of
share their emotions and discuss their daily life affairs in
real-time, covering a variety of different points of view and political figures or public response to new released products.
opinions, including political and event-related topics such as This is the field of sentiment analysis, which involves
immigration, economic issues, tax policy or election campaigns. determining the opinions and private states (beliefs, feelings,
On the other hand, traditional methods tracking public opinion and speculations) of the speaker towards a target entity [3].
still heavily rely upon opinion polls, which are usually limited
to small sample sizes and can incur in significant costs in terms Our goal in this work is to explore the sentiments and
of time and money. In this paper, we leverage state-of-the-art emotions towards political figures in Latin America. To
techniques of sentiment analysis for real-time political emotion this end, we analyze mentions on Twitter and blogs of
tracking. In particular, we analyze mentions of personal names eighteen Latin American presidents, between October 1,
of 18 presidents in Latin America, and measure each political
figure’s effect in the emotions reflected on the social web.
2011 and April 1, 2012. The names of the presidents and
their respective country are listed in Table I. By making use
Keywords- Computational Social Science; Sentiment Analy- of an emotion lexicon, we study the emotions evoked by
sis; Social Media Analytics; Twitter.
each president. While this approach is standard in many
I. I NTRODUCTION applications (e.g., [4], [5], [6]), we felt that a study on
political emotion detection via the social web, covering Latin
The Social Web has been successfully established and is America in particular, was necessary.
growing. Real-time microblogging services, such as Twitter
In summary the contributions of this paper are:
(twitter.com), have experienced an explosion in global user
adoption over the past years. It is estimated that the Twitter • We present an extensive sentiment analysis of the
users surpassed 300 million, and they generate more than political landscape of Latin America. To the best of our
200 million of 140-character Twitter messages – tweets – knowledge, this is the first study of this nature with a
every day [1]. Latin America is not the exception, Brazil, coverage of eighteen countries in the region.
Mexico, Venezuela, Colombia, Argentina, and Chile figure • For polarity and emotion detection, many studies have
among the top-20 countries in terms of Twitter accounts, been made in sentiment analysis. This paper presents,
as reported by a recent study by Semiocast, a provider of not only the extracted emotions and polarity, but also
consumer insight and brand management solutions [2]. goes a step forward and quantifies which combination
The high rate at which users share their opinions on blogs, of emotions explains better the public’s opinion.
forums, and social networking sites, such as Facebook or • The paper provides evidence of the potential uses of
Twitter, makes this kind of media even more attractive to twitter in emerging regions. It provides an example of
measure specific sentiments towards current affairs. Teal- integration of sentiment analysis in Spanish and the
time access to the large amount of user generated content real-time nature of Twitter using Latin America as test
available can provide the tools to social researchers, and bed.
citizens in general, to monitor the pulse of the society
towards specific topics of interest, a task traditionally ac- The rest of the paper is organized as follows. In Section II,
complished only through opinion polls, which are costly and we discuss our methodology and instruments: we describe
time consuming to conduct, and therefore frequently limited the preprocessing and analysis of the documents collected
to small sample sizes. for each president and also present our sentiment analysis
Real-time analysis of social media streams allows for framework and model. Section III discusses the results of our
discovery of latent patterns in public opinion, which can study. We present related work in Section IV. We conclude
be exploited to improve decision making processes. For the paper in Section V.
ID Country President ID Country President
P1 Argentina Cristina Fernández P10 Guatemala Otto Pérez Molina
P2 Bolivia Evo Morales P11 Honduras Porfirio Lobo
P3 Brazil Dilma Rouseff P12 Mexico Felipe Calderón
P4 Chile Sebastián Piñera P13 Nicaragua Daniel Ortega
P5 Colombia Juan Manuel Santos P14 Panama Ricardo Martinelli
P6 Costa Rica Laura Chinchilla P15 Paraguay Fernando Lugo
P7 Dominican Republic Leonel Fernández P16 Peru Ollanta Humala
P8 Ecuador Rafael Correa P17 Uruguay José Mujica
P9 El Salvador Mauricio Funes P18 Venezuela Hugo Chávez

Table I: Presidents of Latin America considered in the analysis (listed alphabetically by country name).

II. TAKING THE P ULSE OF P OLITICAL E MOTIONS • Blog posts were fetched using Google News RSS
Our approach consists of the following steps: Feeds4 . Similarly as in the case of tweets, we used
1) Data collection process as query term the name of the president and forced an
2) Emotion and Polarity Analysis exact match. We restricted the sources of information
3) Pattern recognition from the sentiment analysis to be exclusively blogs in the Spanish language. Again,
the time range was specified to the period under anal-
In this section, we first present how we collected the set
ysis. In this case, we consider as document the post’s
of documents used in the study and the dataset statistics.
title and the short snippet of text (∼300 characters)
Second, we explain the preprocessing techniques on the
contained in the item’s description tag of the RSS result
dataset, and finally, we explain the approach for emotion
as returned by Google.
analysis employed in this work. In Section III, we present
the results and discuss the patterns discovered from the
Stream of
sentiment analysis that can help explain the popularity Tweets and Blog snippets
observed in opinion polls.
A. Data Collection Process
We perform our study on a collection of 165,484 docu-
Social Web
ments, from them, 155,280 are 140-character Twitter mes- Social Analytics

sages or tweets, and 10,204 are snippets of weblog posts. The Data Collector

total of documents was produced by 55,013 distinct users Sentiment Miner

during the six-month period between 1st of October, 2011


and 1st of April, 2012. We chose this period of time because Results and
Visualizations
it allowed us to discuss and contrast our findings against an
independent opinion poll published in April 2012 [7]. The
Figure 1: Social Analytics Process. The process monitors in
same procedure and analytic techniques discussed in this real-time the interactions of people in the Social Web, e.g., the
work can be directly applied on real-time data streams, as content they generate and exchange. The data is analyzed and
illustrated in Figure 1. Both, tweets and blog posts are in global-patterns are discovered. Finally, the loop is closed, and
Spanish1 and they were collected as follows: the results are given back to the web of people.
2
• Twitter messages were retrieved using Topsy , a Twit-
ter search engine that indexes and archives messages
posted on Twitter3 . For each president name listed in B. Model of Emotion and Polarity Analysis for Political
Table I, we issued a query against Topsy using its API. Figures
We forced an exact match on the name by enclosing it Our objective is to identify the emotions reflected in
in double quotes. We also included in the parameters the Social Web towards a political figure, in our case, a
the corresponding start and end date of interest. particular Latin American president. To this end, we analyze
1 For the president of Brazil, we analyzed a total of 18,933 documents (tweets and tweets and blog post snippets of maximum 140-character
blog posts) in Spanish. Future work includes also the analysis of documents in the and 300-character long, respectively. Given the short text of
Portuguese language.
2 Topsy: topsy.com
the documents, we assume that words close to the president’s
3 Please note that Twitter’s search API only allows to retrieve recent
tweets (between 6-9 days old) (https://dev.twitter.com/docs/using-search) 4 Google News: news.google.com
For instance, the word friend has the emotion joy and a
positive polarity associated to it, whereas the word violent
has associated the emotions of anger, disgust, fear, surprise,
and trust; and a negative polarity.

Sentiment Analysis and Multilingualism


Note that the terms in the lexicon are in English, however,
the profile of the presidents contains text in Spanish. One
approach to address this issue is to use machine translation
to translate the text of each document into English, and
conduct the analysis in this language, for example in [11]
tweets are translated from German to English to extract
the emotions. However, our objective is to process the
social stream in real-time, and translating each and every
microblog post would be costly. Instead, we propose to
machine translate the terms in the lexicon from English
to Spanish, in this way, the process is performed once,
offline, and the resulting terms are used to perform the
analysis in the same language of the posts. To this end,
Figure 2: Plutchik’s Wheel of Emotions. (The image is taken we translated the terms in EmoLex using three different
from Wikimedia Commons). services: Google Translate6 , Bing Translator7 , and Yahoo!
Babel Fish8 . The resulting terms in Spanish were associated
to the corresponding English term’s emotions and polarity.
name convey the emotion to be captured. In particular, we
focus our analysis on nouns and adjectives. President’s Emotional Vector
Our emotion detection approach comprises the following We define the emotional vector, ep , for president p as
procedure: follows: Let Tp be the set of terms extracted from the
1) Create a profile for each president president’s profile p, and Tm the set of all terms in EmoLex
2) Extract the terms from the profile annotated with emotion m, where m ∈ M ; M := {joy, sad-
3) Associate to each term an emotion and polarity based ness, anger, fear, trust, disgust, anticipation, surprise}, i.e.,
on an emotion lexicon Plutchik’s eight basic emotions. Then, the mth dimension
4) Compute the emotion vector and polarity for each of emotional vector ep ∈ R|M | is given by:
president X
First, we build a profile for each of the 18 presidents. ep [m] := Im (t)
t∈Tp
The profile consists of all tweets and blog post snippets
collected for the corresponding president. After building where Im (t) is an indicator function that outputs 1 if the
the profiles, we use TreeTagger to perform part-of-speech term t ∈ Tp is associated to emotion m, and 0 otherwise.
tagging on each of them [8]. Then, based on the output of Finally, we normalize vector ep to produce a probability
TreeTagger, we extract the nouns and adjectives. Finally, we vector
use a term-based matching technique to associate each term
with emotion and polarity values. ep
eˆp =
We used in our study the NRC Emotion Lexicon NM
(EmoLex), a large set of human-provided word emotion where NM is a normalization constant that corresponds to
association ratings. EmoLex was created by crowdsourcing the total number of terms t ∈ Tp associated to an emotion.
to Amazon’s Mechanical Turk, and it is described in [9]. For example, the emotional vector for the president of El
The lexicon contains 3292 distinct words5 annotated ac- Salvador over dimensions [joy, sadness, anger, fear, trust,
cording Plutchik’s psychoevolutionary theory of eight basic disgust, anticipation, surprise], corresponds to : [0.08, 0.18,
emotions, which form four opposing pairs, joy–sadness, 0.14, 0.21, 0.15, 0.17, 0.03, 0.04] (see Figure 5i). Note that
anger–fear, trust–disgust, and anticipation–surprise [10]. the components of the probability vector add up to 1, and
This emotion contrast is shown in Figure 2 by the spatial each of them is a positive number between 0 and 1.
opposition of these pairs. In addition, EmoLex also includes
positive and negative sentiments associated to the words. 6 Google Translate: http://translate.google.com/
7 BingTranslator: www.microsofttranslator.com/
5 We used EmoLex version 0.5 for academic institutions. 8 Yahoo! Babel Fish: http://babelfish.yahoo.com/
President’s Polarity positive negative

Cristina Fernández (Argentina)


Similarly as in the case of emotions, we compute the
Evo Morales (Bolivia)
polarity tuple (positive, negative)p of president p as follows:
Dilma Rouseff (Brazil)
Sebastián Piñera (Chile)
P P
t∈Tp I+ (t), t∈Tp I− (t))
( Juan Manuel Santos (Colombia)
(positive, negative)p = Laura Chinchilla (Costa Rica)
Npolarity
Leonel Fernández (Dominican Republic)
Rafael Correa (Ecuador)
where the indicator functions are defined analogously as
Mauricio Funes (El Salvador)
in the case of the emotions, and the normalization constant
Otto Pérez Molina (Guatemala)
Npolarity is the sum of terms with a polarity value assigned.
Porfirio Lobo (Honduras)
Felipe Calderón (Mexico)
III. R ESULTS Daniel Ortega (Nicaragua)
Ricardo Martinelli (Panama)
In this Section, we present the results of our investigation. Fernando Lugo (Paraguay)
First, we explore the polarity of the terms in each president’s Ollanta Humala (Peru)
profile. Second, we will analyze the emotions associated to José Mujica (Uruguay)
each president, and the extracted patterns that could explain Hugo Chávez (Venezuela)
the degree of acceptance of each political figure. 0%# 10%# 20%# 30%# 40%# 50%# 60%# 70%# 80%# 90%# 100%#

As reference we use a survey report published in April


2012 by the Mexican analytics group Consulta Mitofsky [7]. Figure 3: Polarity Analysis
The report is a compilation of the results of individual
opinion polls of Latin American presidents. Note that the
president of Guatemala was not included in this survey
report, hence, for this particular case we used the results
of another poll published in December 2011 [12].
Emotional Pattern Analysis and Opinion Poll

Polarity Detection Can the polarity or emotions, alone or combined, explain


the (un-)popularity of a particular political figure? and what
The polarity detected for each president is shown in are the most influential emotions that could help predict the
Figure 3. Polarity detection provides a quick overview of outcome of a traditional opinion poll? In this section, we
the sentiment conveyed by the terms co-occurring with the seek to shed some light on these questions.
presidents’ name, but it is too coarse grained, and as we In Table II, we contrast the opinion poll results, polar-
will discuss later in this section, it does not fully explain ity (positive–negative), and Plutchik’s eight basic emotions
the popularity as measured by the opinion poll. opposing pairs: joy–sadness, anger–fear, trust–disgust, and
anticipation–surprise. The opinion poll reflects the percent-
Emotion Detection age of people’s approval with respect to the corresponding
president’s job performance [7], [12]. if the percentage of
In order to illustrate the terms behind each of the emotions approval is strictly over 50% we depict a black-filled triangle
extracted, we present in Figure 4 a tag cloud per each pointing up (N), otherwise an empty-filled triangle pointing
emotion considered. Each emotion tag-cloud includes the down is presented (O). In the case of polarity and emotion
top-25 most frequent terms aggregated over all presidents9 . pairs, for each pair A–B, if the emotional dimension A is
The president emotional vectors are visualized as pie grater than emotional dimension B, i.e., A > B, then a
charts in Figure 5. ‘N’ is shown, if A == B, then a symbol equal (‘=’) is
We can observe that the emotion analysis provides more presented, and finally, a ‘O’ is depicted if A < B.
insights on the perception of the presidents in the social We can observe that neither the polarity extracted, nor the
stream, than the polarity value alone. For example, in the single emotion pairs alone can fully explain the results of
case of the Mexican president F. Calderón, he has a negative the opinion poll.
polarity value of 54%, which can be better qualified by the In order to analyze this outcome more in depth, we cast
predominant emotions extracted from his profile, namely: the problem as a binary classification problem [13], where
sadness, anger, fear, and disgust. the objective is to predict whereas the public would approve
(N) or disapprove (O) the president’s job, we associate the
9 The tag clouds were created using Wordle: http://www.wordle.net numerical label values of +1 and −1, correspondingly.
(a) Joy (b) Trust (c) Fear (d) Surprise

(e) Sadness (f) Disgust (g) Anger (h) Anticipation


Figure 4: Tag clouds for each emotion. The terms presented in each tag cloud correspond to the top-25 most frequent terms
aggregated over all presidents. For (a) joy, the salient terms are happy, friend, special, truly, and powerful (feliz, amigo, especial,
verdadero, poderoso); for (b) trust, the terms important, friend, truly, and main (importante, amigo, verdadero, principal) are the
most mentioned; the most popular terms in the emotion (c) fear are bad, military, urgent, terrorist, and dictator (malo, militar,
urgente, terrorista, dictador); for (d) surprise, the main terms are urgent, crazy, terrorist, violent, and different (urgente, loco,
terrorista, violento, diferente); for (e) sadness the salient terms are bad, and dead (malo, muerto); for (f) disgust, the terms bad,
strong, prisoner, false, and powerful (malo, fuerte, preso, falso y poderoso) are the most mentioned; the most popular terms in
the emotion (g) anger are bad, false, powerful, terrorist, and expensive (malo, falso, poderoso, terrorista, caro); for (h) anticipation,
the main terms are urgent, early, powerful, superior, and successful (urgente, pronto, poderoso, superior, exitoso).

ID Country opinion poll positive–negative joy–sadness anger–fear trust–disgust anticipation–surprise


P1 Argentina N N N O N N
P2 Bolivia O N O O O O
P3 Brazil N N N O N N
P4 Chile O N O O O =
P5 Colombia N O O O O N
P6 Costa Rica O O O O N N
P7 Dominican Republic N N N O N N
P8 Ecuador N O O O = N
P9 El Salvador N N O O O O
P10 Guatemala O N N O N N
P11 Honduras O O O = N O
P12 Mexico O O O O O =
P13 Nicaragua N N N O = =
P14 Panama O O O O O O
P15 Paraguay O N O O O N
P16 Peru N N N O O N
P17 Uruguay O N N O N O
P18 Venezuela N N O O O =

Table II: Contrast of opinion poll results, polarity (positive–negative), and Plutchik’s eight basic emotions opposing pairs: joy–
sadness, anger–fear, trust–disgust, and anticipation–surprise. (Presidents are listed alphabetically by country name).
As training vectors we will use president’s emotional vec- score pair. Note that the negative weight for the pair trust–
tors based on the emotion opposing pairs, plus the polarity disgust suggests a negative correlation with the opinion poll
dimensions, i.e., the training instance, Xp , for president p, results.
is defined as follows:

Xp := [Fpositive–negative , Fjoy–sadness , Fanger–fear ,


IV. R ELATED W ORK
Ftrust–disgust , Fanticipation–surprise ] ,

where FA–B , corresponds to the binary feature for polarity Social media provides a rich and diverse source of users’
or emotion pair, A–B. FA–B is equal to +1, if A > B; equal opinions that has shown great potential for political analysis.
to 0, if A == B; and equal to -1, if A < B. For example, Demartini et al. [6] apply sentiment and
For example, the corresponding training instance for the time series analysis techniques on blog data to estimate
Argentinian president, is as follows: XP 1 = [1, 1, −1, 1, 1], the temporal development of opinions for two candidates,
and the corresponding label is +1, and for the president Obama and McCain, during the US presidential elections
of Mexico, the training instance corresponds XP 12 = in 2008. In contrast, our work applies sentiment analysis
[−1, −1, −1, −1, 0], with a label −1 (see Table II). on tweets and small blog post snippets referring to Latin
The predicted value of the poll is given by: American presidents to estimate the effect that each of them
has in the emotions reflected on user generated content.
poll(p) = w ~p
~ ·X Another contrasting aspect is that we do not limit the
analysis to candidates within the same country, but we study
where w ~ is the weight vector to be learned. Our objective is the presidents of eighteen Latin American countries.
to fit a linear model, e.g., using a Support Vector Machine
Andranik Tumasjan et al. [11], examine whether Twitter
(SVM), that is able to predict the binary value of the opinion
is a vehicle for online political deliberation by looking
poll. Please note that given the small amount of training
at how people use microblogging to exchange information
data, we do not expect that this model will generalize well
about political issues. The authors evaluate whether Twitter
for unseen data, our objective is to discover what are the
messages reflect the current offline political sentiment in a
emotional features that are more influential. To this end, we
meaningful way and analyze whether the activity on Twitter
will analyze the learned weights w. ~
can be used to predict the popularity of parties or coalitions
As metric, we use the Area Under ROC Curve (AUC) [14] in the real world. Similarly, we analyze tweets and blog
to measure how well the linear model fits the data. The snippets as a source of political sentiment, however, our
AUC value will always be between 0.0 and 1.0, being the aim is to show how from these data, it is possible to build
best classifiers the ones with a higher AUC value. A random a profile of political figures, but in particular, current Latin
guess has an AUC equal to 0.5 . American presidents. Furthermore, our analysis is performed
We train the SVM model using Stochastic Gradient De- outside elections period, which gives us interesting insights
scent [15], with a learning rate equal to 0.001 and 5 epochs of how users feel with respect to actual presidents on a daily
of 10000 iterations each, the regularization parameter is set basis. The geographical area of our study and the language
to 0.0, since as we mentioned before, we are interested in of the analyzed tweets and snippets are contrasting aspects
fitting the model to the observed data. The averaged model as well, since we consider the area of Latin America and
weights obtained after repeating the procedure 100 times, we analyze content written in Spanish.
are as follows:
Johan Bollen et al. [16] explore how public mood patterns
~ = [0.249, 1.748, 0.265, −1.192, 1.694] .
w relate to fluctuations in macroscopic social and economic
indicators in the same time period. The authors perform a
This particular model achieves an AUC = 0.81. If we use sentiment analysis of Twitter data using an extended version
only polarity scores as predictors, the AUC drops signifi- of a psychometric instrument, the Profile of Mood States
cantly to 0.61. This indicates that polarity analysis is limited (POMS). Our work, on the contrary, focuses on how to
for popularity prediction, and a combination of emotions is exploit Social Media content (tweets and snippets) to build
a better approach for short-term popularity forecasts. a profile of each one of the Latin American presidents
Roughly speaking, a high positive (resp. negative) weight based on what people write about them and the sentiment
indicates that presidents with these emotional features should expressed on the used vocabulary. Yet another contrast is
be approved (resp. disapproved) by the people. The fea- the fact that we do not limit our analysis to the emotion or
tures corresponding to the emotional pairs joy–sadness and polarity extraction, but also exploit Plutchik’s four opposing
anticipation–surprise are the dominant terms of the expres- emotions pairs to train a linear model using SVM that pro-
sion, with weights 1.748 and 1.694, respectively. The pair vides insights on which combination of emotions explains
of emotions sadness–fear has a weight still over the polarity the outcome of people’s opinion.
8% 1% 2% 20% 7% 7% 8% 1%
anger trust anger anger 7% 10% trust anticipation 2% 20% anti
anger joy
14% 15% trust 15% anger3%
18% 15% 18% 10% trust anger
21%
anger trust
anger 10% 12%
14% 16% 14%
16%

disgust
17% disgust disgust
disgust disgust
16% 16% 16%
fear disgust fear 17%
18% 16% fear fear
fear 18%
disgust disgust 20% 17%
21%
fear 18% fear 34%
19% 21%
sadness sadness sadness sadness
surprise sadness
18% surprise
23% 16% surprise 11% sadness surprise surprise 23% surprise
surprise
sadness
4% 2% 2% 12% sadness 2%
5% 4% 2% 18%
12%

Cristina Fernández (Argentina) José Mujica (Uruguay)


Evo Morales (Bolivia) Dilma Rouseff
Hugo(Brazil)
Chávez (Venezuela) Evo Morales (Bolivia)
anticipation anticipation
anticipation joy anticipation Sebastián Piñera (Chile)
anticipation joy Cristi
6%joy anticipation joy trust
11% joy 4% trust 10% 9% joy anticipation 4% joy 6%
anger 6% 6% 11% 15% anticipation
19% 15% 19% 6% 11%
anger 9% anger 11%
anger anger
trust
anger 16% anger
trust 16% trust
13% 14%
14%
9% 10%22% 14% anger
disgust 9%
16%
disgust disgust trust
trust 13% disgust 16% disgust fear
13% disgust fear disgust disgust
22% 20% 17% 17% 18% 20% 13%
18%
sadness fear
fear 17%
8% sadness
fear 20%
sadness
fear 9%
20% fear sadness
10% sadness surprise sadness surprise sadness
surprise 11% 16% surprise surprise sadness 20% 16%surprise 10%
9% surprise 15% 5%
5% 5% 4% 13% 6% sur
6%

(a) (b) (c) (d)


Laura Chinchilla (CostaLugo
Fernando Rica)(Paraguay) Leonel Fernández
Daniel Ortega (Nicaragua) Rafael Correa (Ecuador)
Juan Manuel Santos (Colombia)
anticipation joy
anticipation (Dominican Republic)
anticipation anticipation
Otto Pé
anticipation joy
4% joy 3%
joy Ollanta Humala (Peru)
7%
anticipation
7% 8% trust anger3% 7% 10% trust anger joy anticipation
joy
10% ang
anticipat
anger 10% 4% 11% joy
15% trust 12% anger 15% 7%
23% 5%18
15% anger 13% trust anger
21% 13% anger
anger 14% 16% 10%
16% disgust 10%
13% trust disgust
disgust trust 16% 10%
disgust 13% 19% disgust
fear trust
16%
disgust 17%
disgust
disgust sadness 16%
fear 16%
20%
13% 5%
disgust 20% 34%
sadness
18% fear fear
sadness 16% surprise
19% 21%
surprise 13% fear fear sadness 3%
sadness fear
fear
sadness surprise surprise sadness 2% surprise surprise21% 18%sadness
11% surprise
16% 22%
12% 4% 2% 12% 2% 3% 9% surprise 5% 22%
3%
(e) (f) Fernando Lugo (g) (Paraguay) (h)
Mauricio Funes(Venezuela)
Hugo Chávez (El Salvador) José Laura
Mujica Chinchilla
Sebastián
(Uruguay) Piñera(Costa
(Chile)Rica) anticipation Dilma Rouseff (Brazil)
anticipation Otto
Otto Pérez
Pérez Molina
Molina (Guatemala) joy
(Guatemala)
anticipation
Porfirio Leonel Fernández
Lobo (Honduras)
joy (DominicanFelipe Juan Manuel
Calderón Santos (Colombia)
(Mexico)
anticipation joy anticipation 3%
Leonel Fernández 10% an
3% joy
8%
anticipation
anticipation
anticipation 6% 4% joy 7%
joy anticipation joy Republic)
trust trust anticipation
anticipation anticipation joy joy
6%
anger 11% trust 5%6%5% 10% 11% anger (Dominican
1% Republic)
2% 20% 12% 9% 3% 7% joy 8%
anger anger joy joy anger anticipation 8% trust trust
anger
anger anger 10%
trust joy 19%
anger14% 15%
trust 15% 21% 21% anticipation
trust18% 4% anger 13% 15% ange
10% 9%
10% 14% 21% anger joy 15% 16%
13% 14% 4%
14% anger
16%
trust anger 13% 15% anger
10%
disgust
disgust 22% 13% 16%
10% 10%
disgust
disgust
16% disgust disgust trust
17%
sadness disgust disgust
disgust trust fear disgust
19%
trust
disgust sadness 16% disgust 13% 17% 16%
5% 5% 17% 16%
13% 19%fear 13%
disgust fear disgust
17% 34%fear 19% fear
fear trust trust 18% 17% 18%
21%
fear 24% 24% 17% disgust 20%
surprise
sadness
surprise fear 18%
20%
3% 3%
8% 19% sadness
sadness fearfear sadness
sadness fear sadness surprise
sadness surprise sadness sadness sadnesssurprise2% 9% fear
surprise
18% surprise
surprise 22%surprise 15%
16% 20% surprise
13%
13% fear sadness sadness surprise
13% 22% 6%
2% 23% fear
12%surprise
2% surprise
21% 20%
3%
4% 21% 12%
6% 9% surprise 21% 4% 4%
2% 2%
(i) (j) (k) (l)
Laura ChinchillaJosé (Costa Rica) Juan Manuel Santos (Colombia)
Daniel Ortega (Nicaragua) Fernando Lugo (Paraguay)
DanielFernández
Cristina Ortega (Nicaragua)
(Argentina) Ricardo Ollanta
Martinelli (Panama) Humala
Mujica (Peru)
(Uruguay) Fernando Lugo (Paraguay)
Evo Morales (Bolivia) Ollanta Humala (Peru) (Venezuela)
Hugoanticipation
Chávez Otto
Ricardo
anticipationMartinelli anticipation joy
anticipation(Panama)
anticipation joy anticipation joy antic
anticipation anticipation
anticipation 3% anticipation
4% joy
joy 7%trust joy anticipation
7% joy
3% 8%
joy
trust trust anger3% joy
anticipation joy 10% trust 5
anger joyjoy anticipation
5% 7%
6% joy 10% 13% 3%
4% anger 6%10% trust15% 7% 6% 10%
11% anger 5% anger 6% 6% 13% trust 15% joy 13% 11% anger
11% 19% anger trust 11% anger 12%
23% anger 9% 13% anger
anger 23% 10%
15%
anger anger 12% anger trust
14% 10% 21% 10%
16% 10%
anger 14% trust
16% 13% 14%
disgust 22%trust disgust trust disgust
9%
13% disgust 16% 13% 16% 10%
16%
disgust disgust disgust disgust fearsadness
13% trust disgust
16% 20% disgust fear
fear 20% trust disgust disgust 17% 5%
disgust fear fear
13%
22% 19% disgust
18% 17%
20% 13% 17% 34%
19% 23% fear 34%disgust 20%
sadness sadness 23% 18% sadness fear surprise
16% 8% fear 16% 20% 3%
sadness fear
19% fear
10% fear
fear sadness
surprise 20%fear sadness
surprise
surprise fear sadness sadness
surprise
sadness
sadness 22% sadness
16% surprise 22% sadness 2%
surprise
surprise 18%
11% 9% surprise
surprise sadnesssurprise
5%2% 18% 9% surprise 13% surprise 2
sadness
14%
16% surprise
9% 12% 12%
3%
5% 14% 6%surprise
3% 2% 12% 3% 4% 3% 6%
6%
(m) (n) (o) (p)
Hugo Chávez (Venezuela) Sebastián
(Mexico) Piñera (Chile)
Leonel Fernández (Dominican Felipe Calderón (Mexico)
José Mujica (Uruguay) Leonel Fernández (Dominican Felipe Calderón
anticipation
anticipation anticipation joy
Republic) anticipation anticipation
joy joy 6%
joy
Republic) 3% 6% joy
11%
6% 3% 10% 8% 11% 8% trust
anticipation trust anticipation anger anger
4% joy anger anger anger joy 16% 13% trust
13% 4% trust 14%
anger 15% 9% 16% 13% anger 15% 14%
trust 14%
13% 13%
22%
disgust
disgust trust 16% trust
disgust disgust
13% 19% disgust disgust 19% fear
disgust fear 13% 17%
17% 19% 17%
19% 17% fear
fear 17%
sadness 20%
sadness 8% fear sadness sadness
sadness surprise surprise
13% fear sadness 20% surprise sadness 13% surprise fear 21%
15% 3%
surprise 21% surprise 13% 6%
21% 3% surprise
6% 21%
2% 9%
2%

(q) (r)
Figure 5: Emotions Detected for each President.
V. C ONCLUSION AND F UTURE W ORK ACKNOWLEDGMENT
This work is partially supported by EU FP7 Project
We show in this work how the real-time nature of social CUBRIK (contract no. 287704).
media streams, in particular, Twitter, can be leveraged to take
the pulse of political emotions in emerging regions of the R EFERENCES
world, namely: Latin America. We performed a sentiment [1] @twittereng, “200 million tweets per day,” Twitter Blog. http:
analysis of tweets and brief blog posts over a period of //goo.gl/eybp0, June 2011.
six months. We applied a term-based method to detect the [2] Semiocast, “Countries on Twitter,” http://goo.gl/RfxZw, 2012.
[3] J. M. Wiebe, “Tracking point of view in narrative,” Comput.
polarity and emotions associated to eighteen Latin American Linguist., vol. 20, no. 2, pp. 233–287, Jun. 1994.
presidents. [4] P. S. Dodds, K. D. Harris, I. M. Kloumann, C. A. Bliss,
We conclude that the extracted polarity and isolated emo- and C. M. Danforth, “Temporal patterns of happiness and
tions alone, are not good predictors for the opinion captured information in a global social network: Hedonometrics and
twitter,” PLoS ONE, vol. 6, no. 12, p. e26752, 12 2011.
in an independent opinion poll. But the linear combination [5] S. Mohammad, “From once upon a time to happily ever after:
of basic emotions opposing pairs and polarity, achieved a Tracking emotions in novels and fairy tales,” in Proceedings
prediction performance of 81% in terms of AUC, whereas of the 5th ACL-HLT Workshop on Language Technology for
using polarity alone the AUC dropped to 61%. We also Cultural Heritage, Social Sciences, and Humanities. Port-
noticed that the pair of emotions joy–sadness dominated the land, OR, USA: Association for Computational Linguistics,
June 2011, pp. 105–114.
model. [6] G. Demartini, S. Siersdorfer, S. Chelaru, and W. Nejdl, “An-
Future work includes an online evaluation of our ap- alyzing political trends in the blogosphere,” in International
proach, where the global patterns discovered in the stream AAAI Conference on Weblogs and Social Media, 2011.
are immediately fed-back to the social web. We are inter- [7] Consulta Mitofsky – www.consulta.mx, “Aprobación de Man-
datarios América y El Mundo,” http://goo.gl/8fFNU, April
ested in measuring how this information and awareness can 2012.
affect individual and collective behavior. [8] H. Schmid, “Probabilistic part-of-speech tagging using deci-
All the 165,484 documents analyzed were in Spanish, but sion trees,” in Proceedings of the International Conference
the emotion lexicon was in English. Previous approaches on New Methods in Language Processing, Manchester, UK,
1994.
dealing with multilingualism in sentiment analysis use ma- [9] S. M. Mohammad and P. D. Turney, “Crowdsourcing a word-
chine assisted translation to translate all documents to the emotion association lexicon,” Computational Intelligence,
target language, usually English. However, our aim is to 2011.
process the social media stream in real-time, therefore the [10] R. Plutchik, A General Psychoevolutionary Theory of Emo-
translation of documents arriving at high speed becomes tion. New York: Academic press, 1980, pp. 3–33.
[11] A. Tumasjan, T. Sprenger, P. Sandner, and I. Welpe, “Pre-
problematic. Instead, we propose to translate the emotion dicting elections with twitter: What 140 characters reveal
lexicon itself to the original language of the documents, from about political sentiment,” in International AAAI Conference
English to Spanish. The translation of the lexicon is done on Weblogs and Social Media, 2010.
offline and only once. This methodology allowed a more [12] CID-Gallup – www.cidgallup.com, “Encuesta de Opinión
flexible architecture and the possibility of processing the Pública Centro América y República Dominicana,”
http://goo.gl/lc85v, December 2011.
documents more efficiently in real-time and in their original [13] C. M. Bishop, Pattern Recognition and Machine Learning
language. (Information Science and Statistics), 1st ed. Springer, 2007.
Our study has limitations that have to be considered for [14] T. Fawcett, “An introduction to roc analysis,” Pattern Recog-
future extensions, for example, the data collection process nition Letters, vol. 27, no. 8, pp. 861 – 874, 2006.
[15] L. Bottou, “Large-scale machine learning with stochastic
can be refined to consider more sophisticated named entity gradient descent,” in Proceedings of the 19th International
recognition, and not just the exact match of the names, Conference on Computational Statistics (COMPSTAT’2010),
moreover, the term-based sentiment analysis techniques that Y. Lechevallier and G. Saporta, Eds. Paris, France: Springer,
we applied are fast, but they do not consider complex context August 2010, pp. 177–187.
or senses, e.g., irony and sarcasm, that would be interesting [16] J. Bollen, H. Mao, and A. Pepe, “Modeling public mood and
emotion: Twitter sentiment and socio-economic phenomena,”
to explore. Additionally, a lexicon specialized in political in International AAAI Conference on Weblogs and Social
sentiment, in the target language studied, could bring bene- Media, 2011.
fits in the precision of the sentiment extracted, such a lexicon
could be built using crowdsourcing techniques, as in the case
of EmoLex, the one we used in this study.
Finally, we hope that this paper provides some insight
into the future of short-term forecasting of political figures
popularity, both during electoral campaigns and during pres-
idential mandates.

You might also like