You are on page 1of 5

A New Hybrid Model for Sentiment analysis on Turkish news

Introduction
In the modern era of technology world has become a small village. Social media has played a vital role in
shaping the social norms and values in an innovative way. It has become most popular mode of
communication, people uses various social sites like YouTube, Twitter, LinkedIn and Facebook for
information sharing. They use these sites to get connected with world. Social networks has emerged a
new source of information sharing around the globe. This network is so smart that it is available to
everyone round the clock on their PCs, laptops and on mobile phones so people can access everything any
time when they want. Every day when you get up you see news and stories of friends and of the people
you follow on social media. News agencies and broadcasters actively update their profiles, blogs and
news feeds on social media so that people get instant updates about what is happening around them.
People can discuss social and political issues while sitting apart by using this facility. It has great
influence on politics, political workers realize the importance of social media and they effectively use it.
[1]
Social media also provide great opportunity to traders and to businessmen. They have enormous
opportunities to sell their goods and products online. It has provided speedy growth to business, as people
can share and sell products worldwide. It has provided considerable large prospects to customers sitting
worldwide. Due to its fastest communication service people more relay on social media for information
sharing. [2]
Sentiment analysis identifies emotions, feelings and opinion of a person or group of people, by analyzing
written statement of that person or group. After identifying it categorizes these sates of emotions of
subjected population. It is most powerful tool to classify the attitude of writer and polarity of text. It finds
the hidden emotional state of a person who has written the text.
Sentiment analysis basically finds the polarity of a document or paragraph which is written by someone.
After finding polarity it identifies the positivity or negativity of that particular scenario in which that
statement was written and which has to be analyzed. This analysis can be used in various fields of life like
politics, judging the reliability of product, and in analyzing the users on social media like twitter. Many
companies use micro blogging to asses about the public opinion about their products. As sentiment
analysis is one of the most accurate and widely used technique around the world. So it helps companies to
produce more reliable products to their customers.
Related work
Scientists have used different linguistic structures, to discover public opinion on multiple social sites and
in daily life. In this research, author used multiple features to carry out experiments. This research has
used n-grams, lexicon, parts of speech and micro blogs. These features identifies the unigrams, bigrams
and polarity to count respective parts of speech. [3] In experimentation two folds were created, in first
fold data was trained while in second, validation of features was checked. Then they used Sentiment
analysis to discover the confident ideology of a specific group of people. [4]
Many researchers used movie reviews to classify the emotions that is either it is positive or negative. [5]
This research has discovered sentiment analysis which is widely used practice in social media, which
recognize the hidden emotions in a written paragraph. Sentiment analysis identifies the hidden opinion
and secret emotional background of a paragraph written on social site like twitter. This technique
identifies the majoritys opinion. This analysis is widely used while making governmental and political
policies. It is used to test the opinion of a certain population about a specific policy. [6]
Turkish language is among the most morphological languages. A very limited work has been done on this
language regarding morphology. So to apply sentiment analysis on Turkish language is a challenging task.

[7] Twitter is one of the most advanced and widely used social media platform. Status or massage posted
by a twitter user gets spread to his/her followers, its also been used for politics. To perform sentiment
analysis on Turkish social media a sample of tweets was collected. These tweets were than mapped on
news which were circulating on social media, than their sentiment analysis was carried out. This was the
basic methodology which was adopted. Sentiment cataloguing shows that it is very complex and
problematic than customary classification which was adopted in the past. [8]
Turkish websites promote their brands in a very innovative way, they used sentiment analysis to promote
their brands. This polarity detection research has mainly focused on brands on different websites in
Turkey. Different filters were used to carry out the analysis. In this research punctuation and exclamation
sign was removed as they dont add value. To improve vocabulary they used lexicon and open domain
solution as it was open source library. They also removed the Turkish as these words were causing
duplication. Comments from different local movies were used to develop a systematic approach to detect
the polarity. Tweets were collected and opinion finder was used to classify these tweets, then, type of
tweet was tabulated, that it was objective tweets or it was subjective tweet. In order to carry out sentiment
analysis they used Support Vector Machine (SVM). The use of SVM resulted an advantage that it was the
best model among all others. [9]
Sentiment analysis has been studied in National Football League and in English Premier League. The
researcher predicted the outcome by making categories with some specified conditions. Than the
researcher used statistical and sentimental analysis. A system of control support was designed, then by
using this system tweets were collected and classified. After examining a conditional format was used to
make decision. Best $2704 and odds only $1887 conditions were used but this was less accurate as payout
was $3011. Then it was found that positive sentiment and cultural expectance dominated as difference in
goals was 0.42 for positive sentiment while it was 0.90 goals for negative sentiment. By the analysis of
researcher it was found that positive tweets created a supremacy which shows that twitter was accurate.
[10]
Twitter lexicon was the main focus of this research work was. [11] This was developed and was further
used in sentiment analysis. To carryout different requirements n-gram was used to in order to reduce the
features. This research of brand sentiment analysis of twitter proved that with less number of lexicon
and features, increased the efficiency of sentiment. The SVM was used to carry out the comparison. This
comparison exposed that new lexicon give good results than traditional old version. Then again a
comparison was carried out between DAN2 and new lexicon. This analysis provided that DAN2 is far
better than SVM with same new lexicon. [12]
Nave Bayes methodology to distinguish between different tweets was the main focus of author in this
research. Two specific word were selected for a team of 27 trainers. Then they were asked to classify the
tweets, than results were drawn by using Naive Bayes methodology. For Nave Bayes analysis total 50
tweets were selected, among these 50 tweets two groups were made. One group was for training and one
for classification. After classifying by trainer and Nave Bayes, sentimental validation was run. A very
good result was originated that was 90% 14% accurate. [13]
Sentiment analysis of FIFA World Cup was carried out during football matches in 2014. In this
sentimental analysis twitter API has been used to cry out the research. Research on different matches
was carried out to examine the feelings of football fans. After carrying out analysis we found that US
fans emotions were changing with time and with the change in situation of the matches. [14] To
investigate the emotions of US fans, all the tweets posted within US were analyzed. Among all the tweets,
1007 tweets of first match, 1295 of the second and 2135 of third match were from US. After collection
and tabulation of distinct emotions was made, then Natural Language Toolkit (NLTK) processed the data.
[15] During post analysis NRC lexicon and lexicon word frequency was used to evaluate the basic
emotions of spectators during matches. At the end it was evaluated that US fans were positively emotional
when their team was scoring the goals and were aggressive when opponent team was scoring goals. This

analysis also showed that reaction of US supporters was full of joy when other teams were playing and
was scoring goals. At the end of this research it was concluded that spectators expressed a valid predictive
response. [16]
Summery:
After related work we concluded that Sentiment analysis has been used most of the areas where we have
to analyze the emotions, feelings and opinion. It has been used in social media, sports, marketing,
business, news, in blogging and other fields of life. After doing related work we found that we should
model a hybrid system which could perform sentimental analysis on Turkish news. In this model
integration of different features and categorization of different sentiment will be done. As Turkish
language is one of the most morphological languages so we have to develop a new model to analyze
Turkish news.

References
[1] &. C. M. C. Dimple R. Thadani, "Online Social Network Dependency:," 44th
Hawaii International Conference on System Sciences, 2011.
[2] M. S. Neti, "SOCIAL MEDIA AND ITS ROLE IN MARKETING," al Journal of
Enterprise Computing and B International Journal of Enterprise Computing and
Business Systems, 2011.
[3] T. W. J. M. Efthymios Kouloumpis, "Twitter Sentiment Analysis: The Good the
Bad and the OMG!," Fifth International AAAI Conference on Weblogs and Social
Media, vol. 5, p. 4, 2011.
[4] A. Westerski, "Sentiment Analysis: Introduction and the State of the Art
overview".
[5] B. P. a. L. Lee2, "Opinion mining and sentiment analysis," Foundations and
Trends in Information Retrieval, vol. ormation Retrieval, pp. 1-135, 2008.
[6] B. Liu, "Sentiment Analysis and Opinion Mining," Bing Liu. Sentiment Analysis
and Opinion Mining, Morgan &, Bing Liu.
[7] M. J. a. D. K. A.B. Pawar, "Fundamentals of Sentiment Analysis:Concepts and
Methodology," Springer International Publishing Switzerland 2016, p. 25, 2016.
[8] 2. G. F. I. H. T. Mesut KAYA1, "Sentiment Analysis of Turkish Political News,"
IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent
Agent Technology, 2012.
[9] H. R. Z. E. A. Gozde Gul Isguder-Sahin*, "Polarity Detection of Turkish
Comments on Technology Companies," 2014 International Conference on Asian
Language Processing (IALP).
[10 A. T. J. C. S. L. J. Robert P. Schumaker a, "Predicting wins and spread in the
]
Premier League using a sentiment," Science Direct, 2016.
[11 B. X. I. V. O. R. R. P. Apoorv Agarwal, "Sentiment Analysis of Twitter Data," New
]
York, NY 10027 USA, 2014.
[12 . J. S. b. D. Z. M. Ghiassi a, "Twitter brand sentiment analysis: A hybrid system
]
using n-gram," Science Direct, 2013.
[13 M. Z. M. Y. Mohd Naim Mohd Ibrahim, "Twitter Sentiment Classification Using
]
Nave Bayes," IEEE Conference on e-Learning, e-Management and e-Services,
2015.

[14 S. Z. Z. A. O. Z. Z. N. AZMINA M. ZAMANI1, "Sentiment Analysis: Determining


]
People's Emotions in Facebook," Applied Computational Science, 2014.
[15 P. D. T. Saif M. Mohammad, "CROWDSOURCING A WORDEMOTION
]
ASSOCIATION LEXICON," Computational Inteleigance , Vols. Volume 29, , no.
Issue 3, 2012.
[16 X. W. Yang Yu, "World Cup 2014 in the Twitter World: A big data analysis of
]
sentiments," Science Direct, 2015.