You are on page 1of 20

YARMOUK UNIVERSITY

FACULTY OF INFORMATION TECHNOLOGY & COMPUTER


SCIENCES

DEPARTMENT OF INFORMATION SYSTEMS

AI Chabot for sentiment analysis on fake messages

May 31, 2021

Abstract

1
Chabot's are new information and communication channels enable businesses to reach their
target audience through messenger apps like Facebook and WhatsApp, it is a software which
is leading through conversations [1].
Chabot's are used frequently in business to facilitate various processes, particularly those are
related to customers' services and personalization, A Chabot is a trending application created
by Artificial Intelligence, They are used in many activities such as personal assistants and car
assistance to facilitate human work. .

Within this research study, the researchers discussed the main methods that are used for
recognizing news, analyzing emotions, and finally discussed the results and conclusions. A
comparison between two research papers have been used in the methodology to support this
research accomplishment.

Introduction
A Chabot is defined as a computer problem designed to simulate conversation with
human users. Especially over the internet, the Chabot is also considered one of the main
applications of artificial intelligent conversation entities. It is classified as intelligent bots,
interactive agents, and digital assistants; with the development of life and the emergence of a
new term, since 1950 by turning, artificial intelligence is first mentioned in computer science
( Adamopoulou & Moussiades 2020b). Which, also known later as artificial intelligence, has
been a promise to society for a long time; there was little to no advance as it was slow but
steady. At the beginning of 2010, artificial intelligence comes back in a way the ensured its
spread firmly due to the use of neural network algorithms and deep learning, helped by the
availability of massive data, Training in artificial intelligence had been started; accordingly,
software developers could access the models through interfaces, and users are also provided
by companies such as Google, Amazon, and Microsoft ( Vergeer 2020). A Chabot, also
Called Smart food or chatterbot, is a computer program that performs interaction among
humans and machines using audio or messaging methods (Sandeep et al. 2020).
General overview
Chabot's are generally described as dialog systems with various aims and services, .such
as customer services, information acquisition, automated information retrieval; as end-users
ask their queries. Chabot systems answer to the user; Chabot makes life easier for end-users
and is available 24/7 at any time. These days, Chabot systems facilitate communication like a
human being; the Chabot system spread widely in most sectors and reduced the effort. This
led to its increasing spread and frequent use; Chabot uses machine algorithms to learn things.
Chabot composes two types: " using retrieval-based models, "these botched are trained for
many inquiries and their possible answers for each question; the catboats can locate the
essential answers from every possible solution; there is no issue with the language and
sentence structure; the appropriate responses are pre-decided and can't turn out badly in
sentence structure (Sandeep et al. 2020). Fake news spreads through social media has become
a severe problem; AI Chabot, as the name suggests it is an AI software program that imitates
human conversation through text or voice interactions; the reason why catboats are becoming

2
more prominent because they can work 24/7, save time and money connect business to
customers and employees home automation. Fake news is now viewed as one of the greatest
threatens to democracy, journalists, and freedom of expression. The reach of fake news was
best highlighted during the critical months of the 2016 U.S presidential election camping;
during that period, the top twenty frequently discussed phony election stories generated
8,711,000 shares reactions and Facebook. At the same time, fake news is not a new
phenomenon. The leading cause is that fake news can be created and published online faster
and cheaper when compared to traditional news media such as newspapers and
television(ZHOU & ZAFARANl 2020). Limiting the scope, concentrate on sentiment
analysis instead of a conversational bot. To address the problem to detect if the text message
is fake or fact rather than intent classification; a Chabot is a computerized tool for
communication and receiving text or audio information based on neural networks and
machine learning technologies for particular purposes; Chabot's are used frequently in
business to facilitate various processes, particularly those related to customer service and
personalization. Sentiment analysis is a type of text research, aka mining. It applies a mix of
statistics, natural language processing NLP, machine learning to identify and extract
subjective information from a text file, for instance, reviewers feeling thoughts, judgments, or
assessments about a particular topic, event, or a company and its activities (Yarovii et al.
2020). This analysis type classification and extraction and the goal of sentiment analysis are
the same to know a user or audience opinion on a target object by analyzing a vast amount of
text from various sources (ZHOU & ZAFARANl 2020).
Background

The researcher links fake news to terms and concepts such as deceptive news, false news,
satire news, disinformation, miss information, cherry-picking, and rumor based on how these
terms and concepts are known as (ZHOU & ZAFARANl 2020).

1) authenticity containing any nonfactual statement or not

2) intention (aiming to mislead or entertain the public).

3) if the information is news.

Figure 1. Fake news life cycle and connection

3
Manual fact-checking

Manual fact-checking can be divided into expert-based and crowdsourced fact-checking

The fact-checkers expert relies on field experts as fact-checkers for verification and
features(ZHOU & ZAFARANl 2020).

A-easy to manage

B-lead to highly accurate result

2) Expert-based fact-checking website

Several websites have appeared to allow exiting expert public we list and provide details on
popular(ZHOU & ZAFARANl 2020).

Automatic fact-checking :

" Manual fact-checking does not scale with the volume of newly created information,
especially on social media," to decide scalability, automatic fact-checking techniques have
been developed heavily related to information retrieval "IR," natural language processing
(NLP) and machine learning "Ml" techniques(ZHOU & ZAFARANl 2020).

Chabot architecture:

Type of Chabot:

Chabot can be categorized by using various criteria :

1) domain of knowledge

2) service provided

3) objectives input processing

4) method of input processing and response generation humanitarian aid and construct
strategy, knowledge-based classification is the knowledge that an automated chat program
can access or the amount of data being trained, “open Chabot domain can talk about general
topics 738 and respond appropriately ,while closed domain catboats focus on a particular
knowledge domain and may fail to respond to other questions .” , Chabot design and
development includes variety of technologies ,understanding what a Chabot will offer and
which it falls into helps developers choose the algorithms or platform and tools needed to
create it also helps the end-users to understand what to except ,Chabot design requirement to
accurate representation of knowledge , a strategy for generating answers and a set of
predefined neutral solution for the user to respond when the users speech is not
(Adamopoulou & Moussiades 2020).

4
Figure 2. General Chabot architecture

Problem statement
Chabot is becoming part of daily digital life, as they increase messaging platforms and are
launched as a digital assistant by the largest technology companies, In 2016 Facebook
messenger announced it had over 30.0000 catboats available to help you check the wither
order food, organize travel and even help play Pokémon go, They are detecting fake tweets
using sentiment analysis , The Social platform is one of the most commonly used sites today,
and the population from different places exchange information, express opinions, and think.
The Twitter platform is one of the largest microblogs ,where twitter data are frequently used
for research to analyze data ,The problem statement absents the full ability to know and
identify the fake messages and rumors (Sandeep et al. 2020) .
Research question
How to detect rumors and fake messages on social media by IT application??

literature review

5
One of the first catboats, Eliza, was created in 1964-1966 at the MIT Artificial
intelligence laboratory and was the first to simulate human conversation (Przegalinska et al.
2019). Measure performance banking and fintech sectors Chabot are used mainly to reduce
the number of calls from customers to serve there and provide time in response to requests
such as sending receiving (Przegalinska et al. 2019). One of the critical measurements of
chatbots' performance is providing personal communication with users and supporting
required services (Przegalinska et al., 2019). Chabot is considered the perfect example of
implementing state-of-the-art consumer-oriented artificial intelligence and simulates human
behavior based on formal models but adapts to it(Przegalinska et al., 2019).
High demand for education and learning leads to increased competition in educational
organizations and centers (Adamopoulou & Moussiades 2020a). One of the reasons for the
lack of interest acceptable of education is the teacher's malfunction so reduce the concern and
assistant to learners(Adamopoulou & Moussiades 2020a). A chatbot can provide the ability to
provide high-quality content so that the role of Chabot in the educational process is
significant and play high level to improve productivity by support student in learning by
provide recording for old lessons and repeating it when the student miss understand or absent
of it(Adamopoulou & Moussiades 2020a). The Chabot easily facilitates the educational
process by preserving the lessons and answering the lesson material's questions
(Adamopoulou & Moussiades 2020a). The Chabot can also help students enroll in courses,
schedule exams, grades, and anything related to students(Adamopoulou & Moussiades
2020a). One of the last studies researches the number of students participating in universities
that use Chabot in tasks growing because Chabot helps register(Adamopoulou & Moussiades
2020a). Chabot is designed to provide patients with customized health and therapy
information, patient-related products and services, and diagnosis and suggest treatment based
on patient symptoms (Adamopoulou & Moussiades 2020a). Many Chabot was developed to
offer input during the covid-19 pandemic to support medical decision-making and improve
physical exercise (Adamopoulou & Moussiades 2020a). Due to deployed Chabot in the
health sector and used it in health care of patient more flexible and reliable to create comfort
between patient and Chabot more than physicist human the benefit of that the patient share
more knowledge and disclose more symptom( Adamopoulou & Moussiades 2020).
Nowadays, a microblogging stage like Twitter has become famous for the high range of
spread news and opinions that collect attention(Çetinkaya et al. 2020). The reactions such as
like, share, and subscribe used in this platform between users are considered the primary
determinates of this platform news feedback(Çetinkaya et al. 2020). These interactions attract
people to ongoing debates and help inform and shape their opinions(Çetinkaya et al. 2020).
Twitter has become very popular since it was founded in 2006 due to the spread of
information because of the advantage of rapid information dissemination (Çetinkaya et al.
2020). Twitter users agree with other users and disagree; this platform helps users discuss
information and support or oppose it( Çetinkaya et al. 2020).
1) fake news detection using news cascades :
define " is a tree, like structure that directly captures the propagation of a particular news
article on social media," the root node of a news cascade represents the user who first shared

6
the news article, other nodes in flood represent to the users that have subsequently spread the
article by forwarding it after it was posted by their parent nodes, which they are connected to
via edges, a news cascade can be represented in terms of numbers of steps, transmitted by the
news, for example, a news thread based on hops(ZHOU & ZAFARANl 2020).

Figure 3 illustrations of news cascades

2) Compute fake news websites attributes:


After collecting the creditable click baits database, then the researcher use the weka
classifier to
compute the attributes and produce the data files for weka; therefore, the researcher
crawled the web to collect URLs for click baits, focused on social media websites (Monther
Aldwairi) .that are likely to have more news or clickbait ads or articles such as Facebook,
Forex and Reddit, after gathering URLs in a file, a python scripts computed the attribute
from the title. The content of the web pages, last step (Monther Aldwairi) .researchers
extracted the features from web pages, the features keywords in Arabic and English, titles
that start with numbers, all cap words, contain questions and exclamation marks, the
researcher used weka machine learning to validate the solution, after reading the websites
attributes files into weka, we rank the attribute based on several algorithms, to chose the
most relevant to increase the accuracy and decrease the training time (Monther Aldwairi).

Figure 4 pseudo code

7
figure 5 Classification Results

Several Chabot platforms have been developed during the past few years to facilitate
Chabot's creation; there are two types of forums(Abhishek Anil Chintkuntlawar et al. 2020).
The first type, including that fuel and many chat supports, do it yourself Chabot making the
second type, includes google dialog flow(Abhishek Anil Chintkuntlawar et al. 2020).
Messaging apps and media are among the recent trends in the market, generally in
telecommunication chatting applications mainly concerned with artificial
inelegance(Abhishek Anil Chintkuntlawar et al. 2020). Chatting application takes high spread
worldwide and plays an essential role in sharing moments and experimenting with people
effectively, and facilitating(Abhishek Anil Chintkuntlawar et al. 2020). This app's
significance is to make people send relevant detail and secure data and eradicate the spread of
fake messages; the impact of this Chabot to observe the effective way to try to find an
effective way to use messaging apps to support the security of data and information(Abhishek
Anil Chintkuntlawar et al. 2020).
During last year's synchronization with a covid-19 pandemic, worry about fake news
due to high spread misleading information put the government across the world in various
places founded steps to steam their flow(Rodrigues & Xu 2020). Most governments must be
differentiated and balance freedom of expression and people's right to safe from the adverse
impact of inaccurate information(Rodrigues & Xu 2020). The organization conducted several
conferences and advertising measures and penalties for those who spread false
rumors(Rodrigues & Xu 2020). Government is more control in your country with a higher
level of information and knowledge of media must be maintained to preserve freedom of
people rights and prevent or any inaccurate speech or news (Rodrigues & Xu 2020). One of
the works for some government by working with companies that specialists in social media to
determine political advertisement, political news, and anything relates to political term by
removing it from social communication, Because the danger posed by information to society
and to spread the culture of verification, verify of the news and make It depletes the feeling of
the recipient whether. (Rodrigues & Xu 2020) positively or negatively(Rodrigues & Xu
2020).
One of Chabot's benefits is popular, easy, and speed telecommunication with customers
by particular interface specialists dedicated to specific works to reduce real-time customers
(Adam et al. 2020). Services in many e-commerce sectors, nowadays human chat services are

8
frequently replaced with conversational software and Chabot's Application designed to do
informal human works with users to answer and meet the requirement in natural language
based on artificial intelligence(Adam et al. 2020). Through cost and time-saving
opportunities lead to widespread implementation of AI-based Chabot. Still, they fail to meet
the users' needs, affecting the users by not including them(Adam et al. 2020). The result
shows that they need Chabot's to complete Chabot's request for services feedback(Adam et al.
2020). Communicating with customers by live chat interfaces has become popular to reduce
time, cost, and speed customer services(Adam et al. 2020). In the e-commerce sector, using
conversational Chabot to get more information about products or specific services or submit
solving technical problems facilitates raising the level of trust and satisfaction (Adam et al.
2020). AI-based increasingly popular in various settings and can offer a lot of time and cost-
saving(Adam et al. 2020). Many users gained more unsatisfied experiences with Chabot, so
much research was conducted to test its effectiveness(Adam et al. 2020).
A Chabot is a software tool that interacts with users for a sure thing or in a concern
specific term in natural by conversational way text or voice for many main aims, Chabot's are
often used, such as educational or marketing, online technical support; Chabot can serve
customers, users provide information or answer questions or discuss some topics or perform,
Some tasks can be implemented to help users, like booking a hotel (Smutny & Schreiberova
2020). Chabot application spread for a long time. For example, Eliza, Alice, the first Chabot,
was developed in 1956 by (joseph Weizenbaum ) to simulate psychotherapists and had
colossal knowledge(Smutny & Schreiberova 2020).

Twitter has recently become a place for exchanging news, mainly due to the rapid
spread of unreliable by the internet, Twitter considered (Sivasangari 2018). One of the
analytical data engines from the web and has become the prime source for spread fake news;
twitter becomes the main spread and active platform (Sivasangari 2018). Due to facilitate
post news, attractive, and share it with another person, Twitter used to search daily for the
daily event. These everyday circumstances are used to spread false or inaccurate information
to defraud individuals because of their vulnerability to events (Sivasangari 2018). At this
time, users of the internet become addicted because spending a long time using the internet,
such as platform social media such as Facebook, Instagram, Twitter, Snapchat, and
WhatsApp(Sivasangari 2018). The social media become the brain of your life for all
populations globally; people shared all feeling and communication with another by this
platform (Sivasangari 2018). The platform becomes more way to expose individuals to false
news and receive inappropriate or unreliable and harmful messages for the sake of
entertainment blackmail and profit for your slender(Sivasangari 2018). There is an urgent
need for society to find a radical solution to prevent this behavior (Sivasangari 2018). Twitter
one of the famous platform help to microblogging is the number of users who use Twitter
monthly 330 million active users (Sivasangari 2018). When a significant natural disaster or
important news occurs, there is an increase in interaction between users and express their
opinions(Sivasangari 2018).

9
Exchange fake news and share it has become a massive problem in the online social
media world (Anu Shrestha 2020). Fake news is more harmful than real news; the main
problem is spread and individual and trust citizens (Anu Shrestha 2020). Chabot's are
responsible for spreading real and fake news, but fake news on Twitter relates to
human activity, which plays a vital role (Anu Shrestha 2020). The first response of the big
problem is that the human, specifically Twitter users, share the news (Anu Shrestha 2020).
But not read the main content that described or expressed the base of information or details;
just read the title and share it no confirm or know the ground or source of news(Anu Shrestha
2020). Because it is frequently shared by many users who can think it is real news(Anu
Shrestha 2020). Experimental results show that our approach can detect fake news spreaders
with an accuracy of 0.73 on the English dataset and 0.77 on the spinach dataset(Anu Shrestha
2020). Must be searched and understand malicious users' characteristics to determine the
active users who share fake news and analytic to understand the rezones (Anu Shrestha
2020). On average, they found that users who share fake news tend to register for a shorter
time than those who share accurate information (Anu Shrestha 2020). Discovered the bots
likely share apiece of fake news, most probably the older people and females are more likely
to spared fake news (Anu Shrestha 2020). For the Facebook platform, fake news is age,
political orientation, and social use, but the older people, due to little knowledge, have to
share fake news(Anu Shrestha 2020). Social network users need to evaluate online news, real
news, accurate news on social media(Anu Shrestha 2020). The reason to share fake news may
be because familiar with the platform and know what they share (Anu Shrestha 2020). They
probably share fake news socializing, status-seeking, and information-seeking; they are most
likely to share fake news(Anu Shrestha 2020).

The messages are shared and exchanged between users online (Oluwaseun Ajao 2019).
This research proposed understanding and analyzing fake news characteristics, especially
sentiment, to discover fake news and rumors based on empirical observation (Oluwaseun
Ajao1 2019). It can be said that there is some link between feelings, false reports, and texts
published on the internet with the use latest method to detection of fake news(Oluwaseun
Ajao1 2019). When they started to dictate phony information on social media, it is beneficial
to know all features and identify and misuse them (Oluwaseun Ajao 2019). , twitter message
has been shown to have a lifespan of as little as less than one day and up to 70 days
depending on the type of content and URL being shared (Oluwaseun Ajao1 2019). Sentiment
analysis, also known as opinion mining, seeks to understand sentences and phrases' practical
meaning (Oluwaseun Ajao1 2019). The algorithm used world relevance and usage within the
corpus; considered the terms and words and concepts within a text corpus(Oluwaseun Ajao1
2019 ).
Fake news detection is defined as predicting a particular news article (news report,
editorial, expose. News verification aims to employ technology to identify intentionally
deceptive news content online and is an essential issue within the specific library and
information science [nadia K.conroy]. Authentication of identity on social media is
paramount to the notion of trust; the profilation of news in the form of current events through

10
mass technologies like microblog invites ways of ascertaining the difference between fake
and genuine content
[nadia K.conroy] .

Name of Method Main note Author


paper
Chatbot Natural presents the History, Technology, and Eleni Adamopoulou
history, Language Applications
technology, Processing
and (NLP),deep
application learning

Detecting Machine Detecting fake news in two dataset and ( Anu Shrestha)
Fake News learning ,wek compare the result
Spreaders in a classifiers
Social
Networks
via
Linguistic
&
Personality
Features
Artificial Cryptography AI. The technologies and the framework Abhishek Anil
inelegance Algorithm, used for the development of the app Chintkuntlawar
Chabot Classification plays a significant role in the recent
messenger Algorithmand trend
Word
Detection
Algorithm
Isolating Sentiment several steps to minimize the impact of Usha M Rodrigues
Rumors analysis fake news during COVID,
Using
Sentiment
Analysis
Isolating VADER Isolating rumors by sentiment analysis (V. Sivasangari1)
Rumors sentiment
Using method
Sentiment
Table 1

11
Methodology

The importance of detect fake news and hoaxes has been there since before the advent of
the internet, and fake news is: fictitious articles deliberately fabricated to device readers; the
goal is profiting through click baits, when detecting fake news from a knowledge-based
perspective, one often uses a process known as fact-checking aims to assess news authenticity
by comparing with the knowledge extracted to be verified news content with known fact
('Detecting Fake News in Social Media Networks' 2018).

The Paper title Methods Methods Dataset Data Reference


Methods Names steps type

Method 1 Detecting Random 1) Analysis tweets, Textual ( Anu


Fake News Forest features: data, Shrestha)
training
Spreaders in (RF), Data
1- style data set
Social Logistic file
=300
Networks Regression 2- N-gram XML
users
via "LR," format
3-Tweet with 100
Support
Linguistic embedding tweet
Vector
&
machine 4-sentiment , test
Personality "SVM" and analysis data set
Features. Extra trees include
"ET," 200
based on 2) applied users
features machine
learning
classifiers to
get accuracy.

Method 2 Isolating VADER 1)Crawl PHEME Textual (V.


Rumors Sentiment Tweets with dataset data Sivasangari1)
Using Analysis Metadata
Jason
Sentiment
2) VADER format
Analysis

12
Sentiment
Analysis
3)isolating
rumor

Table 2
Description of steps for method 1:
Style :
this first set of features captures the writing style of the collection of tweets authored by the
same user, specifically, computed the average number of certain words, items, and characters
per user tweet, which includes the average number of 1) words 2) characters 3) lowercase
words 4) uppercase words 5) lowercase characters 6)uppercase characters (Anu Shrestha).
The feature considered for detecting fake news spreaders, as the dataset files are in raw XML
format, parsed and formatted by using the XML .etree . after extraction, preprocessed the
content (user tweets) as pre-requirement for each feature, some features require cleaned texts
and some other, that need the text as it is for incorporating underlying details of the others
writing, four different types of features were considered to address this task, as explained in
more information below (Anu Shrestha).
Style: this first set of features captures the writing style of the collection of tweets authored
by the same user, specifically, computed the average number of certain words, items, and
characters per user tweet, which includes the average number of 1) words 2) characters 3)
lowercase words 4) uppercase words 5)lowercase characters 6)uppercase characters (Anu
Shrestha).
N-gram:
The second group of features includes TF-IDF based n-grams for both words and characters;
the researchers collect tweets by each user to form a single document per user, for each
record, they have removed words like "RT," "Via," and "&amp" and re-placed emoji's and
smileys with the corresponding English word, next each document was converted to lower
case, stop words and punctuation symbols were removed. The remaining terms were
lemmatized(Anu Shrestha).
Tweet embedding:
the third set of features includes the embedding of tweets computed by using modern BERT
model in NLP model specifically used the pre-trained multilingual model provided by
SBERT; text extracted inclusion was reduced to 10 features via principal component analysis,
then averaged all tweets of one user to create single embed to create representation for each
user that captures the semantics of all of the user tweets, before accounting for this inclusion,
each tweet was preprocessed to remove frequently used characters on Twitter such as “RT,”
via” and “ &amp” and replace emoji's and smiles with corresponding English words (Anu
Shrestha).

13
Sentiment analysis:
Since people express their feelings, rating and feeling towards any news or article by
choosing words in their tweets, took advantage of sentiment analysis as another feature, also,
in this case, the researchers pre-processed each tweet at removing “RT,” via” and “ &amp,
however, the researcher did not replace the emojis and smiles with the English text
corresponding to this feature since these emojis add to the emotional values of the text. For
each user, the researcher averaged sentiment across all of their tweets; the researcher used an
aware valence dictionary and thinking site (Vader ), a library specifically designed to capture
emotions expressed in social media texts for English (computed value) (Anu Shrestha).

Compute fake news websites attributes :


After collecting the creditable click baits database, then the researcher use the weka classifier
to
compute the attributes and produce the data files for weka; therefore, the researcher crawled
the web to collect URLs for click baits, focused on social media websites (Monther
Aldwairi).that are likely to have more news or clickbait ads or articles such as Facebook,
Forex and Reddit, after gathering URLs in a file, a python scripts computed the attribute from
the title. The content of the web pages, last step (Monther Aldwairi).researchers extracted the
features from web pages, the features keywords in Arabic and English, titles that start with
numbers, all cap words, contain questions and exclamation marks, the researcher used weka
machine learning to validate the solution, after reading the websites attributes files into weka,
we rank the attribute based on several algorithms, to chose the most relevant to increase the
accuracy and decrease the training time (Monther Aldwairi).

Figure 6

14
Figure 7 Classification Results

Description of method 2 :
After extracted crawl text (tweets ) are isolated into rumor or non- rumor using the VADER
sentiment analysis technique, the extracted data are categorized as positive or negative for
each trowel twitter text (tweets). If the negative score > than the threshold (0.5 ), it is well
marked as a rumor (V. Sivasangari1).

figure 8 VADER sentiment analysis

Result and Dissection:

This is the result for method 1, "Machine learning (Weka classifier ) "

15
Figure 9

This is the result for method 2 VADER sentiment analysis:

Figure 10
DISSCUTION RESULTS:
Description of steps for method 1:
Explaining the steps of the method in details as follows:
The data which has been used here passed through all of the previous criteria which was
applied on the two sets of data (in English language and Spanish language) firstly as an initial
form and then to compare between the features themselves to calculate the accuracy of
results.
With respect to the first feature, the highest result of accuracy was 0.64 "ET" MACHINE
LEARNING.

16
For the second feature, the highest result of accuracy was 0.72 "SVM".
As for the third feature, the highest result of accuracy was 0.69 "SVM".
As for the fourth feature, the highest result of accuracy was 0.66 "LR"
So the highest result of accuracy is for data in English "SVM".

As for the data in Spanish, the accuracy ratio for the first feature was "SVM" 0.750.7
The second relative advantage is "ET" 0.75.
And the third feature is "RF" 0.74.
And finally, the fourth feature is "LR" 0.57.

According to the results shown above, for English language Extra Trees have been selected
as the best classifier for the style features, SVM for both N-Grams, Tweet Embedding, and
Logistic Regression for Sentiment Analysis. Likewise, for Spanish language, SVM has been
selected as the best classifier for the style features, Extra Trees for N-Grams, Random Forest
for Tweet Embedding, and Logistic Regression for Sentiment Analysis.
Also, for the TRAINING DATA, TESTING DATA part, a comparison of 10-fold cross-
validation was conducted using the ratio for Training Set = 0.73 for English language, while
0.77 for the Spanish language. The AVG = 0.75.
The percentage of the Test Set was for the English language, while
0.76 for the Spanish language
AVG FOR TESTINF SET 0.73
As for the method in the second research, after using the Tweeter Scraper application and
applying the VADER sentiment model.
The results appeared as shown in Figure 10, where the news was classified as true or not on
the basis of VADER Polar emotionality and valence-based technology, extracted text.
They are categorized as positive and/or negative value for each Twitter text determining
whether it is a rumor, or a fact based on the condition. If the negative value is greater than the
value (0.5), then it is marked as common.
I think that the method in the first article is more accurate and comprehensive in analyzing
the text and is also used for analyzing large data and it gives ratios closer to reality.
CONCLOUSION

Within this research, there was a review for the most important research studies that are

17
related to the Robots technology in general, and also the most important research studies that
are related to and summarizing the important methods which are used and classified as fake
news; how data are collected, the stages they pass through and the mechanisms that are
applied for them. Two methods have been discussed and reviewed, a comparison between
these two methods have been conducted, the stages that these fake news go through it, and
how users' emotions and feelings are analyzed, and how news' publishers are feeling, and also
how they determine the most accurate and best news from this fake news.
In the future, the researchers hope that there will be a developed application that recognizes
fake news to educate the community and to protect the culture from this type of bad news due
to the reaction that is resulted from publishing the fake news and what will be its impact upon
the society

Reference:
Abhishek Anil Chintkuntlawar, Rutuj Sanjay Shahare2, Ayesha Hanif Pathan3, Himanshu
Nitin Dholakiya4 & Atkar5, G 2020, 'Artificial Intelligence Chatbot Messenger,'
International Journal of Research in Engineering, Science and Management, vol. Volume-3,,
no. Issue-3, p. 4.

Adam, M, Wessel, M & Benlian, A 2020, 'AI-based chatbots in customer service and their
effects on user compliance', Electronic Markets.

Adamopoulou, E & Moussiades, L 2020a, 'Chatbots: History, technology, and applications',


Machine Learning with Applications, vol. 2.

Adamopoulou, E & Moussiades, L 2020b, 'An Overview of Chatbot Technology', in


Artificial Intelligence Applications and Innovations, pp. 373-83.

18
Anu Shrestha, FS, and Abishai Joy 2020, 'Detecting Fake News Spreaders in Social
Networks via', Notebook for PAN at CLEF 2020, p. 10.

Adamopoulou, E & Moussiades, L 2020, 'An Overview of Chatbot Technology', in Artificial


Intelligence Applications and Innovations, pp. 373-83.

Çetinkaya, YM, Toroslu, İH & Davulcu, H 2020, 'Developing a Twitter bot that can join a
discussion using state-of-the-art architectures', Social Network Analysis and Mining, vol. 10,
no. 1.

Oluwaseun Ajao1, DBaSZ 2019, 'SENTIMENT AWARE FAKE NEWS DETECTION ON


ONLINE SOCIAL NETWORKS', p. 5.

Przegalinska, A, Ciechanowski, L, Stroz, A, Gloor, P & Mazurek, G 2019, 'In bot we trust: A
new methodology of chatbot performance measures', Business Horizons, vol. 62, no. 6, pp.
785-97.

Rodrigues, UM & Xu, J 2020, 'Regulation of COVID-19 fake news infodemic in China and
India', Media International Australia, vol. 177, no. 1, pp. 125-31.

Sandeep, Thorata , VD & Jadhavb* 2020, 'A Review on Implementation Issues of Rule-based
Chatbot Systems', International Conference on Innovative Com, p. 6.

Sivasangari, AKM, K. Suthendran2 and M. Sethumadhavan1 2018, 'Isolating Rumors Using


Sentiment Analysis', Journal ofCyber Security and Mobility, vol. 7 1, p. 19.

Smutny, P & Schreiberova, P 2020, 'Chatbots for learning: A review of educational chatbots
for the Facebook Messenger', Computers & Education, vol. 151.

Vergeer, M 2020, 'Artificial Intelligence in the Dutch Press: An Analysis of Topics and
Trends', Communication Studies, vol. 71, no. 3, pp. 373-92.

'Detecting Fake News in Social Media Networks', 2018, nternational Conference on


Emerging Ubiquitous Systems and Pervasive Networks (EUSPN 2018)

19
The 9th International Conference on Emerging Ubiquitous Systems and Pervasive Networks
(EUSPN 2018).

Monther Aldwairi, AA 'Detecting Fake News in Social Media Networks', Elsevier Science
Inc, vol. 141, p. 6.

Yarovii, A, Kudriavtsev, D & Olena, P 2020, 'Improving the Accuracy of Text Message
Recognition with an Intelligent Chatbot Information System', paper presented to 2020 IEEE
15th International Conference on Computer Sciences and Information Technologies (CSIT).

ZHOU, X & ZAFARANl, R 2020, 'A Survey of Fake News:Fundamental Theories,


Detection Methods, and Opportunities', ACM, p. 37.

20

You might also like