Professional Documents
Culture Documents
MACHINE LEARNING
A MINI-PROJECT REPORT
Submitted by
SHARATH J [410720104063]
MANOJ KUMAR S [410720104041]
NESAMANI K [410720104046]
KALYANRAJ S [410720104303]
PAGE \* MERGEFORMAT 1
ANNA UNIVERSITY: CHENNAI 600 025
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
PAGE \* MERGEFORMAT 9
CERTIFICATE OF EVALUATION
Semester : 6th
Subject :
KALYAN RAJ S
4. (410720104303)
The report of the project work submitted by the above students in partial
fulfillment for the award of a Bachelor of Technology degree in COMPUTER
SCIENCE AND ENGINEERING under Anna University was evaluated and
confirmed to be the reports about the work done by the above students.
PAGE \* MERGEFORMAT 9
ACKNOWLEDGEMENT
Department, ………………………
for guiding us in all aspects of our project in each stage and providing us
with valuable suggestions.
PAGE \* MERGEFORMAT 9
ABSTRACT
Recommendation systems are a helpful and effective technique for filtering the
info. This paper could be a review of advice systems which will describe
recommendation systems; however, it works and helps in several platforms, and
also the different kinds of it with their deserves and limitations
PAGE \* MERGEFORMAT 9
TABLE OF CONTENT
CHAPTER TITLE PG
NO. NO.
ABSTRACT V
LIST OF FIGURES VII
1 INTRODUCTION 1
1.1 OBJECTIVE 2
1.2 APPLICATION 2
2 LITERATURE
2.1 LITERATURE REVIEW 3
2.2 RELATED WORKS IN THE 4
RECOMMENDATION SYSTEMS
3 DEVELOPMENT PROCESS
3.1 ARCHITECTURE 7
3.2 WORK FLOW 8
3.3 METHODS OF RECOMMENDATION SYSTEM 8
5 CONCLUSION 11
APPENDICES
SAMPLE CODE 12
5.1 SAMPLE OUTPUT 16
6 REFERENCES 17
PAGE \* MERGEFORMAT 9
LIST OF FIGURE
FIGURE TITLE PG
NO. NO.
PAGE \* MERGEFORMAT 9
CHAPTER : 1
INTRODUCTION
1
1.1 OBJECTIVE :
1.2 APPLICATION :
2
CHAPTER 2
LITERATURE
2.1 LITERATURE REVIEW :
Recommender systems have become an important research field
since the emergence of the first paper on collaborative filtering in the
mid-1990s.
In general, recommender systems are defined as the supporting
systems which help users to find information, products, or services (such
as books, movies, music, digital products, websites, and TV programs)
by aggregating and analyzing suggestions from other users, which mean
reviews from various authorities, and user attributes. However, as
academic research on recommender systems has increased significantly
over the last ten years, more research is required to be applicable in a
real-world situation.
Because the research field on recommender systems is still wide
and less mature than other research fields. Accordingly, the existing
articles on recommender systems need to be reviewed for the next
generation of recommender systems. However, it would be not easy to
confine the recommender system researches to specific disciplines,
considering the nature of the recommender system researches. So, we
reviewed all articles on recommender systems from 37 journals that
were published from 2001 to 2010. The 37 journals are selected from the
top 125 journals of the MIS Journal Rankings. Also, the literature search
was based on the descriptors "Recommender system",
"Recommendation system", "Personalization system", "Collaborative
filtering" and "Contents filtering".
The full text of each article was reviewed to eliminate the article
that was not related to recommender systems. Many articles were
excluded because articles such as Conference papers, master's and
doctoral dissertations, textbooks, unpublished working papers, non-
English publication papers, and news were unfit for our research. We
classified articles by year of publication, journals, recommendation
fields, and data mining techniques. The recommendation fields and data
mining techniques of 187 articles are reviewed and classified into eight
recommendation fields (book, document, image, movie, music,
shopping, TV program, and others) and eight data mining techniques
(association rule, clustering, decision tree, k-nearest neighbor, link
analysis, neural network, regression, and other heuristic methods). The
3
results represented in this paper have several significant implications.
First, based on previous publication rates, the interest in recommender
system-related research will grow significantly in the future. Second, 49
articles are related to movie recommendations whereas image and TV
program recommendations are identified in only 6 articles. This result
has been caused by the easy use of the Movie Lens data set. So, it is
necessary to prepare a data set of other fields. Third, recently social
network analysis has been used in various applications. However,
studies on recommender systems using social network analysis are
deficient. Henceforth, we expect that new recommendation approaches
using social network analysis will be developed in the recommender
systems. So, it will be an interesting and further research area to
evaluate the recommendation system research using social method
analysis. This result provides a trend of recommender system research
by examining the published literature and provides practitioners and
researchers with insight and future direction on recommender systems.
We hope that this research helps anyone interested in recommender
systems research to gain insight for future research.
4
Recently, various approaches for building recommendation systems have
been developed, which can utilize either collaborative filtering, content-based
filtering, or hybrid filtering. The collaborative filtering technique is the most
mature and most commonly implemented.
The system makes use of a user interface that assists users in browsing
the Internet; it can track the browsing pattern of a user to predict the pages that
they may be interested in. Pazzan. Designed an intelligent agent that attempts to
predict which web pages will interest a user by using a naive Bayesian
classifier. The agent allows a user to provide training instances by rating
different pages as either hot or cold. Jennings and Higuchi describe a neural
network that models the interests of a user in a Usenet news environment.
5
Movie/Book/News Recommendations - Suggest new content that increases
user engagement. The aim is to introduce users to new content that may interest
them and encourage them to consume more content on our platform.
Product Recommendations - Suggest a mix of old and new products. The old
products from users’ historical transactions serve as a reminder of their frequent
purchases. Also, it is important to suggest new products that the users
may like to try.
6
CHAPTER 3
DEVELOPMENT PROCESS
3.1 ARCHITECTURE :
Figure 3.1
7
3.2 WORK FLOW :
Figure 3.2
Figure 3.
8
CHAPTER 4
COLLABORATIVE FILTERING :
9
HYBRID FILTERING :
10
CHAPTER : 5
CONCLUSION
11
APPENDICES
SAMPLE CODE
import numpy as np
import pandas as pd
import difflib
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.metrics.pairwise import cosine_similarity
movies_data.shape
selected_features = ['genres','keywords','tagline','cast','director']
print(selected_features)
print(combined_features)
12
# converting the text data to feature vectors
vectorizer = TfidfVectorizer()
feature_vectors = vectorizer.fit_transform(combined_features)
print(feature_vectors)
"""Cosine Similarity"""
similarity = cosine_similarity(feature_vectors)
print(similarity)
print(similarity.shape)
# creating a list with all the movie names given in the dataset
list_of_all_titles = movies_data['title'].tolist()
print(list_of_all_titles)
# finding the close match for the movie name given by the user
find_close_match = difflib.get_close_matches(movie_name,
list_of_all_titles)
print(find_close_match)
close_match = find_close_match[0]
print(close_match)
13
# getting a list of similar movies
similarity_score = list(enumerate(similarity[index_of_the_movie]))
print(similarity_score)
len(similarity_score)
i=1
list_of_all_titles = movies_data['title'].tolist()
find_close_match = difflib.get_close_matches(movie_name,
list_of_all_titles)
close_match = find_close_match[0]
similarity_score = list(enumerate(similarity[index_of_the_movie]))
14
sorted_similar_movies = sorted(similarity_score, key = lambda x:x[1],
reverse = True)
i=1
15
5.1 SAMPLE OUTPUT :
16
CHAPTER : 6
REFERENCE
17