Professional Documents
Culture Documents
Abstract — Sentiment analysis is a useful study for Sentiment analysis is done to view opinions or
determining opinions by classifying text. The opinions about a problem. One theory used to
document used in the research comes from Twitter derive sentiment analysis is the theory of judgment.
about public opinion about community satisfaction Assessment theory is a theory that explains the
related to performance of incumbent. The method language used in communicating consisting of
used is Appraisal Theory. The data used are 1587 for
Jokowi related data, 1774 for Ministry related data,
feelings and opinions [6]. In this research,
and 1337 government related data. The result of data sentimental analysis is done to see the opinion of
analysis from this research is that people have society which is addressed to the incumbent based
positive sentiments for incumbent. Innovation is a on Twitter data 2017. The result of opinion
term that has the highest positive sentiment, whereas measurement can be used as parameter of society
imaging is the term that has the lowest negative satisfaction to incumbent’s performance. With the
sentiment. aim that sentiment analysis can be used to measure
incumbent electability in Election 2019.
Keywords — sentiment analysis; appraisal theory; II. LITERATURE STUDY
twitter
A. Sentiment Analysis
The sentiment analysis, also known as opinion
I. INTRODUCTION mining, is a study to determine public opinion
Upcoming general election in 2019 is expected to about services, products, films, and so on. The need
be the concern of the whole community. No for sentiment analysis has become very important
exception for the incumbent who has the [7]. When people will make decisions, they will
opportunity holding the wheels of government for refer to the opinions of others. Almost all
two periods. Incumbent’s performance is now a companies conduct market surveys and research to
benchmark of electability level to advance in capture public opinion about their products, as well
elections in 2019. The level of electability can be as in government, to see how government
reflected from public opinion related to the level of performance can be done by studying public
satisfaction with the current government opinion [7]. The need for sentiment analysis is
performance [1]. growing substantially, especially with the
development of social media as a place for society
The rapid development of the internet ranges from to express its opinions. Sentiment analysis has been
news media to social media. The most popular widely used to examine social media content. [8]
social media among mobile internet users in developed a system that can analyze and provide
Indonesia includes Facebook, Instagram, Twitter, sentiment predictions through social media content
Google+, Path, Snapchat, Tumblr, and so on [2]. in real time using AsterixDB. In addition, [9]
Social media today is not limited to use as a proposes one semantic approach to determining
medium of friendship but can also function like user attitudes and business insights based on the
other marketing media, advertising, to the delivery results of analytical sentiments from social media
of opinions and aspirations [3]. in Arabic. Sentiment analysis was also used by [10]
which introduced SmartSA, a lexicon-based
Public opinion related to current government can sentiment classification system for social media
be seen through social media [4]. One such social genres. [10] hybridizing the general purpose of the
media is Twitter. Twitter has the fifth largest lexicon and proven to improve the sentiment of
number of users in the world, and there are tweets results.
totaling 4.1 billion tweets in 2016 alone [5]. This
rapid development made Twitter one of the most B. Appraisal Theory
fertile places for people to share their opinions on Appraisal theory describes how to use language to
government performance. The opinions of public communicate with others [11]. Appraisal Theory is
within Twitter are opportunity for the incumbent to a system of interpersonal meaning. Appraisal is
see community satisfaction. used to negotiate social relations between people,
978-1-5386-8402-3/18/$31.00
XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX©2018
IEEE IEEE 101
Proceeding of EECSI 2018, Malang - Indonesia, 16-18 Oct 2018
by telling what is felt about things and people to the Here is a process done in research to get a set of
reader. There are three aspects that were explored words that have potential as an attractive variable:
in the discussion of the appraisal system, namely
attitudes, how the attitude was expressed, and the 1. Cleansing. Cleansing is a process of cleaning
source of the engagement [12]. As explained up unnecessary words to reduce noise. The
below: omitted words are URL, hashtag (#),
1. Attitudes, related to evaluation of objects, username (@username), and email.
character, and feelings. Attitudes are divided Additionally, punctuation marks such as dots
into three types of evaluations of attitudes, (.), Commas (,), and other punctuation will be
namely affect (feelings of people), judgment omitted.
(character of people), and appreciation (value of 2. Case Folding. Case folding is the stage of
an item). changing the form of words into the same
2. Appreciation, talking about the opinion of a shape, whether it be all the lower case or the
person both inside and outside a person such as upper case.
shy, ugly, beautiful and one's behavior in a 3. Stopword Removal. Filtering plays a role to
social context such as heroic, weak. remove the commonly emerging and common
3. Engagement related to the source of attitude is words, showing less of relevance to the text.
divided into two types, namely (1) heterogloss The words to be discarded are defined in the
(relating to the source of attitudes that come stopword list.
from other than the author) and (2) monogloss 4. Stemming. Stemming is the stage to create a
(relating to the source of attitudes derived only word that affixed back to its original form.
from the author). Heterogloss is an attitude that example of the word omitted will be "lost".
is derived not only from the author. 5. Tokenizing. Tokenizing works to identify
words in the text into several sequences that
C. Data Mining are cut off by spaces or special characters.
Data mining is the process of finding patterns new,
has a meaning, descriptive model, easy to E. Citizen Satisfaction
understand and predictive of large-scale data [13]. Satisfaction reflects people's judgment of a product
Data mining can also be interpreted as disciplinary to what extent product performance meets customer
fields from various fields such as statistics, expectations [16]. If performance is less than
machine learning, information retrieval, pattern expectations, customers are disappointed. If
recognition and bioinformatics. Data mining is appropriate, customers are satisfied. If it exceeds
widely used in many domains such as retail, then the customer is very happy.
telecommunications finance, social and more [14].
Data mining is part of a process called [13]. Public satisfaction is the result of opinion and
Generally, the processes in KDD are: public assessment of the performance of services
1. Data Cleansing (to eliminate inconsistent provided to the apparatus of public service
data); providers. The element of service is a factor or
2. Data Integration (merging from various aspect contained in the provision of services to the
sources); community as a variable preparation of community
3. Data Selection (relevant data is taken from the satisfaction survey to determine the performance of
database for analysis); service units [17].
4. Data Transformation (data is transformed and
consolidated into an appropriate form for
aggregation); III. RESEARCH METHOD
5. Data Mining (an important process where
intelligent methods are applied to extract data A. Data Collection
patterns); Data collection is done by utilizing data from social
6. Pattern Evaluation (identifying interesting media Twitter. With the Twitter API available, the
patterns that represent knowledge based on public can tweet crawling based on the required
certain measurements); query. In this study, the query used for the data
7. Knowledge Presentation (visualization collection phase is the word "jokowi" OR
techniques and representations of knowledge "ministry" OR "government". The three words are
used to present that knowledge to users). considered to have a connection with the object of
research and are also a word that is pretty much
D. Text Mining tweeted by the public in expressing their opinions
Text mining is part of data mining activities that on Twitter social media. In addition, the tweet used
focus on textual data. The process of text mining in this study is an Indonesian-language tweet. Table
itself is done in phase to get a word from text data 1 is a detailed amount of data collection based on
that has potential as an attractive variable [15]. successful tweets in crawling.
102
Proceeding of EECSI 2018, Malang - Indonesia, 16-18 Oct 2018
jokowi 1587
ministry 1774 Stemming Results
government 1337 president papua
appear welcome
from him
sunroof up
B. Data Preprocessing car to
Preprocessing data is intended to perform cleansing residents regency
and transformation of data that has been collected nabire nabire
fantastic
in the previous stage. In this stage, there are several
steps taken to obtain the final data used when
conducting word weighting and sentiment analysis The output of preprocessing data is a collection of
to assess community satisfaction with the President words used by researchers to search for terms as an
of the Republic of Indonesia. Figure 1 below analysis material in scoring season. Table 3 is a
explains how the preprocessing data stages are term performed after a rational and emotional
performed. evaluation.
TABLE 3 Sentiment Terms
Cleansing Results
C. Scoring
The president appears from the car sunroof, Nabire Papua Based on previous research [6], Figure 2 is a
residents welcome him up to Nabire Regency, fantastic proposed model for the scoring library range that
Case Folding Results
The president appears from the car sunroof, nabire papua
will be used in the study. Based on the range
residents welcome him up to nabire district, fantastic scoring library, then will be calculated every
Tokenizing Results positive or negative terms based on data that has
been done preprocessing until the case of case
president papua
appear welcome folding.
from him
sunroof up
car to
residents regency
nabire nabire
fantastic
103
Proceeding of EECSI 2018, Malang - Indonesia, 16-18 Oct 2018
Here is an example of using scoring library range: Figure 4 is a graph of total sentiment based on the T1
targetscore. Based on the figure, it can be seen the term
Jokowi has a remarkable innovation for Indonesia. innovation is the highest term value for positive sentiment,
T1 A while the value of the imaging term is the lowest value for
negative sentiment. Figure 5 is a total sentiment graph based
Abandoned development becomes a government problem
A T2
on T2 target score. Based on the figure, it can be seen the
term of appreciation is the highest term value for positive
Ministry of Finance has not improved the problem of corruption sentiment, while the value of hate and lie term is the lowest
T3 A value of the term for negative sentiment. Figure 6 is a graph
of total sentiment based on the T3 targetscore. Based on the
Description: figure, it can be seen the term of openness is the highest
• A: Appraisal term value for positive sentiment, while the term abandoned
• T1: Target 1 (Jokowi) value is the lowest value for negative sentiment.
• T2: Target 2 (Government) The third result is the term frequency based on the specified
• T3: Target 3 (Ministry) term. This result shows the term that often arises based on
public opinion emotionally to the incumbent. The term
Calculation: frequency can be the basis for the incumbent to make
• Targetscore T1 = 3 decisions or steps to be taken to improve electability in
• Targetscore T2 = 2 elections 2019. Table 4 is the value of Term Frequency
• Targetscore T3 = 2 based on predetermined terms.
• Total Targetscore = 8 (Positive) D. Term Weighting
104
Proceeding of EECSI 2018, Malang - Indonesia, 16-18 Oct 2018
ACKNOWLEDGMENT
This research was supported by PITTA Grant Universitas
Indonesia contract number 1862/UN2.R3.1/HKP.05.00/2018
105
Proceeding of EECSI 2018, Malang - Indonesia, 16-18 Oct 2018
106