Professional Documents
Culture Documents
Open Access
Research/ Review/Survey Paper Volume-5, Issue-8 E-ISSN: 2347-2693
1
Dept. Of Computer Science, Government Engineering College, Palakkad
2
Dept. Of Computer Science, Government Engineering College, Palakkad
3
Computational Linguistics. International Centre for Open Source and Software, Trivandrum
4
Fellow, Computational Linguistics. International Centre for Open Source and Software, Trivandrum
e-mail:jennygeorge763@ gmail.com , binurajappan@yahoo.com , rajeevrr@icfoss.in , reshmamm@icfoss.in
Abstract— Recommendation systems are information filtering system which identifies user’s interest and preference. We
propose a personalised product recommendation system which uses k-means algorithm to identify similar customers. It also
provide recommendations to other customers within the cluster with the top most purchased product. The system also find out
the purchased items by silent customers and provide notifications in the form of offers for the same or related products .The
machine learning, as well as NLP approach, is used to analyze user's queries and generate appropriate responses.
The user inputs the question to the system and the AIML
FIGURE Interpreter matches the keywords and generate the response
then, manage the conversation. The <random> tag is used to
generate random response. The AIML language is language
AI based Segmentation independent. So Malayalam chatbot can be developed based
on the AIML tags.
Clustering of similar customers is done based on
demographic variables such as age,gender,marital status etc.
and the product information is collected from database. K-
This simple AIML file is shown above. The category part
means algorithm can be used for clustering of like-minded
includes pattern and template tags which contains the input
customers. Collaborative filtering can be of two types: User-
as well as the output. The AIML chatbot can be store the
user collaborative filtering and item-item collaborative
name of the user using the tag <set name>. They can
filtering. A user-user filtering takes a particular user,find
remember the name of the user till the conversation ends.
users that are similar to that user based on similarity of
purchase and recommend items that those similar users liked. Malayalam Chatbot based on Chatterbot Python Library
In contrast,item-item filtering will take an item,find users
who purchased that item,and find other items that those users Propose a retrieval based domain specific chatbot that
or similar users also purchased. It takes items and outputs converses in Malayalam language. The chatbot is developed
other items as recommendations. based on natural language processing library with machine
People within a certain age group have many buying learning mechanism. Initially, the chatbot matches the input
needs in common. For example, babies need diapers with the queries in the database and then calculates a
and formula, special care and clothing. People over the confidence value for sentences that matched. Finally, selects
age of 65 tend to need medicine and other health care the appropriate response based on the highest confidence
products.Customers can be divided into five age groups. value among them. Here, the response generation is retrieval
based. The retrieval based method retrieve responses from
These groups are children, teenagers, young adults, the
the knowledge base based on the context. The input is given
middle aged, and the over 65s.
to a web interface. Then process the text that given by the
user. Selects the sentences which match the input text.
Returns known response to the selected matches. Then,
Segmentation by gender is an obvious way of
calculates the confidence value of each response. The system
identifying possible markets. Women and men
will return an appropriate response as the highest confidence
sometimes look for different benefits in products. In the
value. Finally, the response is presented to the user.
past some goods, such as power tools and sporting
goods, were bought mainly by men. Others such as The chatterbot is a Python package based on Machine
groceries and health care products, were purchased learning concepts. It is a language independent platform. The
mainly by women. These patterns are not staying important module of the Chatterbot library is Chatterbot's
constant over time. Women today have an increasing adapters. The input is returned from the input adapter, the
amount of personal buying power due to increased input is processed and stored by the logic and the storage
income levels. More women are buying such things as adapters. Finally, it passed to output adapter to returns the
power drills and more men are shopping for groceries responses to the user. Chatterbot includes training tools to
make simple training process. The Chatterbot training
includes loading of the dialogues into the chatterbot IV. COMPARISON
database. Several training classes are inbuilt in chatterbot
package. Here, we train the data from the dialogue corpus Since this is the first chatbot in Malayalam. So, there is no
using Corpus Trainer Class of Chatterbot. Initially, developed another Malayalam chatbot for comparison. So compare the
a Malayalam Tourism corpus in the predefined format in AIML and Chatterbot based Malayalam chatbot. The AIML
YAML file. Then, set a trainer for the Malayalam dialogue based chatbot is a rule-based one. The Chatterbot is
corpus. The chatterbot has built-in storage adapter that integrated the Machine learning and Natural language
connects different databases. The Best Match logic adapters processing concepts.
are used here to find the best response to the closest match.
Here, uses a Low confidence adapter and set a threshold AIML based chatbot used simple pattern template to
value 0.5. The Best Match adapter calculates similarity represent the user input and output and also using simple
function used to compare the input text to known responses. pattern matching algorithm. The chatbot based on chatterbot
The response is selected based on the threshold value. starts off with no knowledge of how to communicate. Since
we don't locate an appropriate database for this purpose we
manually prepared the corpus in Malayalam. We take care of
the issue of the absence of required tools by choosing a
language independent platform and picking a retrieval based
model to fill the need.