Report Major Project by Shubham

1
AMITY UNIVERSITY UTTAR PRADESH

GREATER NOIDA CAMPUS
AMITY SCHOOL OF ENGINEERING & TECHNOLOGY
2019 - 2023
MAJOR PROJECT REPORT
“Music Recommendation System”
Submitted as a partial fulfilment toward the award of

Bachelor of Technology
(COMPUTER SCIENCE AND ENGINEERING)
Submitted by :
Shubham Yadav ( A41105219048 )
Submitted to :
Prof. Pradeep Kumar Kushwaha
( Faculty Guide )
2
CERTIFICATE
I hereby certify that the work which is being offered in entitled

Major Project Report, in partial fulfilment of the requirement for
the award of B.Tech in (CSE) and submitted to the Department
of ASET is a genuine file of my very own and executed all
through a duration from Jan - Mar 2023 beneath the
supervision of,
Prof. Pradeep Kushwaha, Amity University Greater Noida
Computer Science Department.
The matter provided on this undertaking record has not been

submitted via me for the award of another diploma someplace
else.
SHUBHAM YADAV
(A41105219048)
This is certified that the above statement made with the aid of
the candidate is accurate to the exceptional of my expertise.
Date:
Signature of Faculty Guide

Mr. Pradeep Kushwaha
3
ACKNOWLEDGEMENT
I would love to express my sincerest gratitude and

indebtedness to the person that gave me ethical and technical
help & whose kind assistance has been instrumental inside the
crowning glory of my project. It offers me sizeable satisfaction
to very own my humble gratitude to my faculty guide Mr.
Pradeep Kushwaha for this integral steering and for supplying
necessary thoughts to perform this challenge.
I would, then the like to vicinity on record my best regards and

private sense of gratitude to Prof. (Dr.) AJAY RANA (Director
General -SD &VT), Prof J S Jassi ( Dean Academic ), Prof.
Pradeep Kushwaha (Programme Coordinator) and all the
professors of our Computer Science Department. I
additionally thank all the faculties of Amity University for his or
her cautious and precious steerage which has been
extraordinarily treasured for my look at both theoretically and
almost.
I understand this possibility as a huge milestone in my

professional improvement. I will try to apply abilities and
understanding won in the course of this era in a great feasible
manner, and I will retain paintings, to obtain my preferred
professional targets.
Sincerely
SHUBHAM YADAV
(A41105219048)
Computer Science Department, (ASET)
4
TABLE OF CONTENT
S No. CONTENT PAGE No.
1 Certificate 2
2 Acknowledgement 3
3 Abstract 5-6
4 Introduction 7-8
5 Literature Review 9-12
6 Methodology & Structural 13-17
Design
7 Graphical Representations 18-19
8 Algorithms 26
9 Implementation 27-29
10 Features and 30-33

Capabilities Required
11 Quality Attributes 34-35
12 Data Collection 36-45
13 Conclusion and future 50
scope(s)
14 Reference 51
5
ABSTRACT
Recommender systems are crucial in the world of e-commerce,

serving as a major marketplace that connects people from all over
the globe. These systems make it easy for users to access and
browse the market from anywhere. They play a significant role in
ensuring that commerce runs smoothly, by recommending products
based on users' preferences and analyzing their tastes. In this study,
we focus on developing a recommender system for music
applications, taking into account the diverse musical preferences of
different users. To achieve this, we utilize a combination of a
popularity algorithm, classification, and collaborative filtering. Finally,
we evaluate the effectiveness of our system using various evaluation
metrics.
In today's world, music streaming platforms have become a
ubiquitous source of entertainment, with millions of users accessing
them daily. With such a vast collection of songs and genres available,
it can be overwhelming for users to choose what to listen to. A music
recommendation system aims to address this issue by analyzing user
preferences and recommending songs or playlists based on their
listening history. In this project, we propose a music recommendation
system that utilizes collaborative filtering and content-based filtering
techniques to generate personalized song recommendations for
users. The system analyzes user behavior and preferences, such as
song ratings and listening history, to create a profile of the user's
musical taste. It then applies machine learning algorithms to find
similar songs or artists that the user might enjoy. The system also
considers other factors such as song popularity and user feedback to
improve the recommendations. We evaluate the performance of the
system using various metrics such as precision, recall, and F1-score
and compare it with other existing recommendation systems. Our
results show that our music recommendation system can effectively
provide personalized recommendations to users, leading to
increased user satisfaction and engagement.
6
Keywords— E-commerce, recommender system, collaborative

filtering, memory-based recommendation, model-based
recommendation, music information retrieval.
7
INTRODUCTION
In the digital age, music is a popular entertainment medium that is

created by humans to express emotions and ideas through melody,
harmony, and rhythm. It can be categorized into different genres
such as pop, rock, jazz, blues, and folk. With the advent of smart
phones, it has become easier to listen to music both offline and
online. However, the abundance of digital music available can lead to
information fatigue and make it difficult to sort through. That's why
developing a music recommender system that automatically
searches music libraries and suggests suitable songs to users can be
very useful. Music streaming applications like Spotify and Pandora
already have features that recommend music to users. The challenge
of the music recommender system is to continuously find attractive
new music that aligns with users' preferences. This requires a
personalized recommender system that effectively reflects personal
preferences and can be adjusted to cater to different audiences.
Implementing such a system is more complicated than a general
recommender system. Our research approach is based on
comparing the similarity of audio signal features using Convolutional
Recurrent Neural Networks (CRNNs) for feature extraction and
similarity distance. CRNNs are a combination of Convolutional Neural
Networks (CNNs) and Recurrent Neural Networks (RNNs), which can
work with time series data and predict temporal sequence prediction
problems. CNNs are useful for predicting high-level song features
such as chords and beats as they allow hierarchical structure,
including intermediate features on multiple timescales. Our method
was inspired by the work of Choi et al. The research process for
creating a personalized music recommender system involves several
stages. The first stage involves identifying the problem and reviewing
relevant literature. The second stage involves collecting music data,
and the third stage is focused on cleaning and selecting the data. In
the fourth stage, the audio data is preprocessed. Next, in the fifth
stage, a neural network model is developed. In the sixth stage, a
recommender system model is developed. Finally, in the seventh
stage, the music recommender system application is designed,
implemented, and evaluated.
8
The widespread use of computer networks has facilitated the retrieval
of various types of media such as videos, books, and music, among
others. Music, in particular, is a significant aspect of people's lives
and they often engage in listening to it. However, the challenge is
managing the vast amount of music available to society. To address
this, an effective music recommender system should be able to
automatically identify user preferences and generate playlists
accordingly. The proposed system detects music plagiarism by
comparing the similarity between input music and query music that is
suspected of being plagiarized. The development of recommender
systems also provides an opportunity for the music industry to
aggregate users who share an interest in music. To create the best
music recommendation system, customized prediction algorithms
such as KNN and machine learning are utilized to predict users'
music preferences. Music is highly valued across all cultures and the
number of songs available for listening is immense. The collaborative
filtering algorithm, based on users' listening behavior and ratings, has
been found to perform well for music recommendation. The content-
based method can also be used to detect plagiarism by checking the
similarity between songs. Mood prediction can be used to
recommend songs to users based on their interests and emotional
state. Music recommender systems offer benefits to both users and
providers by keeping users engaged, reducing the burden of choice,
and providing the opportunity to discover new music.
9
Literature Review
A music recommendation system is a technology that uses algorithms

and data analysis to suggest music to listeners based on their personal
preferences and listening history. This system collects information about
the user's playlists, favorite artists, and genres, as well as other factors
such as the popularity and release date of songs. The goal is to provide
a personalized listening experience and introduce users to new artists,
songs, and genres. There are two main types of music recommendation
systems: collaborative filtering, which analyzes the listening habits of a
group of users, and content-based filtering, which focuses on the
attributes of individual tracks. Music recommendation systems are
commonly used by music streaming services like Spotify, Apple Music,
and Pandora to improve the user experience and keep users engaged.
They have become an important aspect of the music industry, enabling
artists to reach new audiences and listeners to discover new music.
Lops, De Gemmis, and Semeraro (2010) delve into the key concepts of
content-based recommender systems in their article "Content-based
Recommender Systems: State of the Art and Trends". They discuss the
different methods for expressing structured and unstructured data
retrieval and analysis. The article is divided into three parts. The first part
focuses on the fundamental concepts of content-based recommender
systems, including their architecture and advantages and disadvantages.
The second part examines traditional and innovative methods for
encoding objects and user profiles in various application sectors. Finally,
the article discusses the matching algorithm used to recommend
personalized music to listeners. Two main approaches to the matching
algorithm are collaborative filtering and content-based filtering, with
collaborative filtering being a technique used in recommendation
systems to provide customized suggestions to users based on their
preferences and the preferences of other users. R. Burke's article,
"Hybrid Recommender Systems: Survey and Experiments," discusses
different recommendation techniques and their strengths and
10
weaknesses. The article compares these techniques and shows that

they are more effective when combined than when analyzed individually.
This finding has led to research on hybrid recommendation systems that
integrate various techniques for better performance. In 2012, Luo
Zhenghua explored the use of association rules in personalized
recommendation systems for book sales in e-commerce business
systems. The implementation of such systems involves special
processes for information input, knowledge processing, association rule
existence, and personalized recommendation. The results of the study
showed excellent performance on the website that implemented the
system.
TYPES OF RECOMMENDER SYTEMS:-
Recommender systems have been widely studied and can be classified

into three categories: collaborative filtering, content-based, and context-
based. Collaborative filtering relies on user reviews to build user and
item profiles for making recommendations. This method is considered
the most basic and has been popularized by the Netflix Competition, but
it has some disadvantages. Content-based systems focus on the
features of products and create user profiles based on previous reviews
and item profiles based on features and reviews. This approach helps
overcome sparsity problems in collaborative filtering. Context-based
systems extend the user/item approach to include contextual
information, eliminating the need for users to fill out personal details.
 COLLABORATIVE FILTERING:-
Collaborative filtering is a technique used in recommender systems
to provide personalized recommendations to users based on their
preferences and behavior. This technique works by analyzing user
data, such as their past purchases or ratings, and comparing it to
data from other users to find patterns and similarities. Based on
these patterns, the system can suggest items that a user might like.
Collaborative filtering can be used in various domains such as e-
11
commerce, social networks, and music streaming services.
Fig.1 Listen by her recommend to him
Memory-based Collaborative Filtering:

Memory-based Collaborative Filtering is a type of recommendation
system that uses past user-item interactions to predict
recommendations. It determines the similarity between users or
items and uses that information to make suggestions. While it's a
straightforward and easy-to-implement approach, it can be
challenging to scale and handle new users or items.
 Model-based Collaborative Filtering:
Model-based Collaborative Filtering (CF) is a method used in
recommendation systems that utilizes machine learning models to
estimate user preferences or ratings for a specific item. Its primary
objective is to forecast the preferences or ratings of a user for a
particular item based on the preferences or ratings of other users
for that same item. This technique is typically compared to memory-
based collaborative filtering, which computes user or item similarity
directly from the user-item matrix. Model-based CF can better
handle vast amounts of data and sparse data, and can also capture
intricate associations between users and items.
12
 Hybrid Collaborative Filtering:

A hybrid collaborative filtering (CF) is a type of recommendation
system that aims to enhance the accuracy and diversity of
recommendations by combining two or more recommendation
approaches. One example of a hybrid system is the combination of
content-based filtering and collaborative filtering to provide
recommendations. While content-based filtering uses item
attributes to recommend similar items, collaborative filtering relies
on user behavior and preferences to make recommendations. By
utilizing both approaches, a hybrid CF system can take advantage
of the strengths of each method to offer more accurate and diverse.
13
 Hybrid Approach for Collaborative Filtering: The authors Gilbert

Badaro, Hazem Hajj, Wassim El-Hajj, and Lama Nachman
published a paper in 2013 that discusses a hybrid approach for
collaborative filtering in recommender systems. They explain that
this technique combines user- and item-based collaborative
filtering to improve the accuracy of recommendations and
address the problem of sparse data. The proposed method uses
a weighted combination of both approaches to predict the ratings
of unrated items in the user-item ranking matrix. The authors
conducted an analysis to show that their approach outperforms
fully user-based or item-based collaborative filtering methods.
 CONTENT-BASED RECOMMENDER SYSTEM:

A Content-based Recommender System is a recommendation
system that suggests items based on their properties or
attributes. It examines the content or characteristics of items that
a user has previously enjoyed and utilizes this data to
recommend similar items. The main idea behind content-based
recommendation is that if a user likes a certain item, they are
more likely to enjoy other items that share similar features or
properties. Machine learning algorithms and natural language
processing techniques can be utilized to extract relevant
attributes or features of items to build this type of recommender
system. Content-based recommendation is commonly employed
in applications such as recommending books, movies, music, or
articles based on a user's previous preferences.
Anand Shanker Tewari, Abhay Kumar, and Asim Gopal proposed

a new method for book recommendation systems by combining
content-based filtering, collaborative filtering, and association rule
mining. They suggest that parameters like book content and
quality can be considered by doing collaborative filtering of
ratings given by other consumers. Their aim is to suggest books
to customers based on their interests. This method works offline
14
and stores recommendations in the buyer's internet profile. It

determines the category of books that the customer has
previously purchased from their internet history.
The information flow in a content-based recommendation system
is shown in Figure 2. The system takes relevant entities and their
relations for an item as input and extracts major features of the
item from an item ontology. These features, along with the user's
ratings and user modeling data, are then used to generate
recommended items as output.
Fig 2. Content Based Recommender System
 CONTEXT BASED RECOMMENDER SYSTEM:

A context-based recommendation system is designed to provide
better recommendations to a user by considering situational
information about a specific group of people. This information is
gathered through techniques such as GPS and social data, with
the aim of improving the user's profile. Contextual data includes
factors such as location, time, weather, and social data, which
can be either dynamic or static. Dynamic factors change over
time and may be refined through user feedback, while static
factors remain stable. For example, when buying a cell phone,
the contextual factors could include the time and purpose of the
purchase, which are only considered for that specific purpose.
The challenge with dynamic factors is to determine when to
switch to a different underlying context model. The
15
recommendation system only considers these factors for the

entire purchasing purpose.
 Non-Personalized Recommender Systems: An article by Anil
Poriya, Neev Patel, Tanvi Bhagat, and Rekha Sharma, Ph.D
explains that many websites nowadays heavily rely on
recommender systems, but some of these systems are non-
personalized and provide the same recommendations for all
users. Two common techniques for non-personalized
recommendations are collective opinion recommenders and
basic product association recommenders, which are based on
average scores or provide recommendations within a specific
context. However, since these recommendations [9] lack
personalization, they are unappealing to users. Therefore,
cooperative filtering is used to overcome this limitation and
provide personalized recommendations. The advantage of
cooperative filtering is that it doesn't require item knowledge and
is domain-independent, with low machine time for model-based
approaches
16
Methodology & Structural Design
 Machine learning
Machine learning is a subfield of artificial intelligence that focuses

on enabling machines to learn from data and make predictions or
decisions without being explicitly programmed. The learning
process involves algorithms that identify patterns in data and
adjust their parameters to improve their predictions over time.
There are several types of machine learning, including

supervised learning, unsupervised learning, and reinforcement
learning. In supervised learning, the algorithm learns from
labeled data and is trained to predict outputs based on new
inputs. Unsupervised learning, on the other hand, involves
learning from unlabeled data and identifying patterns or clusters
in the data. Reinforcement learning is a type of learning where
the algorithm learns from feedback provided by the environment,
and its goal is to maximize a reward function.
Machine learning is being used in various applications, such as

image recognition, natural language processing, and predictive
modeling. It has become an essential tool for businesses and
organizations to extract valuable insights from their data and
improve their decision-making processes.
 Supervised Machine Learning:-

Supervised learning is a type of machine learning where the
algorithm learns from labeled data. In supervised learning, a
dataset is provided which has both input and output
variables, and the algorithm tries to learn the mapping
function between the input and the output variables. The
goal of supervised learning is to predict the output variable
for new input data. It is called supervised because the
17
algorithm is trained on labeled data, where the correct

answer or output is already known, and the algorithm is
supervised by a teacher or a labeled dataset.
Supervised learning is a type of machine learning that is
used when the output is classified or labeled. These
algorithms analyze past data (known as training data) to
make predictions about future events in new data. To
accurately predict test data, a large dataset is required to
identify patterns. The algorithm can be further trained by
comparing training outputs to actual outputs and adjusting
for errors. In supervised learning, the ML algorithm is given
a small training dataset to work with, which gives it a basic
understanding of the problem and provides labeled
parameters required for the solution. The algorithm then
establishes cause-and-effect relationships between
variables, finding relationships between the parameters
given. The algorithm learns from the final dataset in the
same way as the training dataset, improving itself and
discovering new patterns and relationships.
 Unsupervised Machine Learning:

Unsupervised learning algorithms are utilized when the
output is unknown and there are no labels available. These
algorithms analyze data to identify hidden patterns without
any prior knowledge of what the final output should look like.
Unlike supervised learning, where labeled data is used to
find the exact relationship between data points,
unsupervised learning discovers hidden structures between
data points without any input from humans. This makes
unsupervised learning more adaptable as it can dynamically
change hidden structures to fit the data. It offers more post-
deployment development compared to supervised learning
algorithms which have a defined and set problem statement.
18
 Reinforcement Machine Learning:

Reinforcement learning is a type of machine learning
algorithm that uses a trial and error approach to optimize its
output. The algorithm is placed in an environment with an
interpreter and a reward system. After each iteration, the
interpreter evaluates the algorithm's output and provides
feedback in the form of rewards or punishments. The
algorithm is then encouraged to optimize its output to
maximize its rewards. In many cases, the reward system is
tied to the effectiveness of the solution, expressed as a
percentage value. The algorithm continues to iterate until it
finds the best possible solution for the highest possible
reward.
 Semi-Supervised Machine Learning:

Semi-supervised machine learning algorithms work with
both labeled and unlabeled data, but the amount of
unlabeled data is generally much larger. This type of
machine learning is considered a combination of supervised
and unsupervised learning algorithms. By using both labeled
and unlabeled data, the accuracy of the learning process
can be improved.
 Classification:
Classification refers to a type of problem in machine
learning where the objective is to identify the category or
class that a given input belongs to. This involves training a
model using a set of labeled data, where the correct output
category is known for each input. Once the model has been
trained, it can be used to predict the category of new,
unlabeled data points based on their features and the
patterns learned during training.
19
 Regression:
Regression is a type of machine learning algorithm that is
used to estimate or predict continuous numeric output
values based on the input data. It involves analyzing the
relationship between a dependent variable and one or more
independent variables to create a mathematical model. The
model generated can then be used to make predictions on
new input data by estimating the output value based on the
input values.
 Python:
Python is a high-level, interpreted programming language that is
widely used for web development, data analysis, artificial
intelligence, scientific computing, and many other applications. It
was created by Guido van Rossum and first released in 1991.
Python emphasizes code readability and ease of use, making it a
popular choice for beginners and experienced programmers
alike. It supports multiple programming paradigms, including
procedural, object-oriented, and functional programming. Python
has a large and active community, which has developed
numerous libraries and frameworks to extend its functionality and
ease the development process.
 Features of Python: The key features of Python are:
1. Easy to Learn: Python is easy to learn and has a simple syntax

that is similar to the English language. This makes it a popular
choice for beginners.
2. Interpreted: Python is an interpreted language, which means that
you can execute the code line by line, making it easier to debug
and test.
3. Cross-Platform: Python can run on various platforms, including
Windows, macOS, Linux, and Unix.
20
4. Open Source: Python is open source, which means that it is free

to use and distribute.
5. Object-Oriented: Python is an object-oriented language, which
means that it supports object-oriented programming (OOP)
concepts like encapsulation, inheritance, and polymorphism.
6. Large Standard Library: Python comes with a large standard
library that provides support for a wide range of tasks, from web
development to machine learning.
7. Third-Party Libraries: Python has a large number of third-party
libraries that can be easily installed and used in your projects.
8. High-Level Language: Python is a high-level language, which
means that it abstracts away low-level details, making it easier to
write and understand code.
9. Dynamic Typing: Python is a dynamically typed language, which
means that the type of a variable is determined at runtime,
making it more flexible and easier to use.
10. Versatile: Python can be used for a wide range of tasks,
including web development, data analysis, scientific computing,
machine learning, and more.
 Python libraries:
There are many Python libraries available for different tasks
and purposes. Here are some popular libraries and their
brief descriptions:
 NumPy: NumPy is the fundamental library for

numerical and scientific computing in Python. It
provides powerful tools for linear algebra, random
number generation, Fourier transforms, and more.
NumPy arrays are used extensively in data science
and machine learning applications.
 Pandas: Pandas is a library for data manipulation and
analysis. It provides tools for reading and writing data
in various formats, cleaning and transforming data, and
21
performing complex queries on data. Pandas has

become a standard tool in the data science ecosystem.
 Matplotlib: Matplotlib is a plotting library for Python. It
provides a variety of plots, including line, scatter, bar,
and histogram plots. Matplotlib can be used to create
publication-quality figures, and it is highly
customizable.
 OpenCV: OpenCV is a library for computer vision and
image processing. It provides tools for image and video
capture, image filtering and transformation, object
detection and recognition, and more. OpenCV is widely
used in industry and academia for a variety of
applications, including robotics, surveillance, and
medical imaging.
 TensorFlow: TensorFlow is a library for building and
training deep learning models. It provides tools for
creating neural networks, including convolutional and
recurrent networks. TensorFlow can be used for a
variety of applications, including image and speech
recognition, natural language processing, and robotics.
 Jupyter Notebook:
Jupyter Notebook is an open-source web application that allows
users to create and share documents containing live code,
equations, visualizations, and narrative text. It supports various
programming languages, including Python, R, Julia, and others.
Jupyter Notebook allows users to write and run code in a web-
based environment, making it easy to explore data, experiment
with different algorithms, and share findings with others.
One of the primary benefits of Jupyter Notebook is that it

promotes interactive computing. Users can write code and see
the results in real-time, making it easier to experiment and
explore data. It also allows for the integration of data
22
visualizations and documentation, which can help users

communicate their findings more effectively.
 Google Colab:
Google Colaboratory, also known as Google Colab, is a cloud-
based platform for working with Jupyter Notebooks. It provides
free access to computing resources, including GPUs, and allows
users to run code, create and share notebooks, and collaborate
with others in real-time.
Google Colab runs on Google's cloud infrastructure and is
integrated with other Google services, such as Google Drive and
Google Sheets. It allows users to write and run Python code in a
browser-based environment, without needing to install any
software or set up a local development environment. This makes
it accessible to anyone with an internet connection and a web
browser.
Structural Design:-
A music recommendation system architecture typically involves

several components working together to provide personalized music
recommendations to users. Here is a general overview of the system
architecture:
1. Data Ingestion: The system begins by ingesting data from
various sources such as music streaming platforms, user
listening history, user profile data, and metadata about songs and
artists.
2. Data Processing and Feature Extraction: Once the data is
ingested, the system processes and cleans the data, extracts
relevant features from it, and transforms it into a format that can
be used for building the recommendation model.
3. Recommendation Engine: The recommendation engine is the
core of the system. It uses various algorithms and techniques to
generate personalized recommendations for users based on their
23
listening history, preferences, and behavior.

4. User Interface: The user interface is the frontend of the system,
through which users interact with the system. It can be a web or
mobile application that displays the recommended songs,
playlists, or artists to users.
5. Feedback Loop: The system should continuously gather
feedback from users, such as whether they liked or disliked a
recommended song, and use this feedback to improve the
recommendations in the future.
6. Model Training and Improvement: The recommendation engine
should be trained using machine learning algorithms and updated
regularly to improve the accuracy of the recommendations.
7. Database and Storage: The system requires a database to store
user data, song metadata, and other relevant information. It
should be scalable and easily accessible to the various
components of the system.
24
Graphical Representations
There are different ways to represent the architecture of a music

recommendation system graphically. One common approach is to use
a flowchart or block diagram that shows the main components and
their interactions. Another approach is to use a network diagram that
represents the relationships between the different entities in the
system, such as users, items, and features.
A flowchart for a music recommendation system typically includes the

following components: data collection, feature extraction, model
training, recommendation generation, and evaluation. The data
collection component is responsible for gathering information about
the users, items, and interactions between them. The feature
extraction component transforms the raw data into meaningful
features that can be used for modeling. The model training component
trains the recommendation algorithm on the extracted features, using
techniques such as collaborative filtering or content-based filtering.
The recommendation generation component generates a list of
recommended items for each user, based on their preferences and
behavior. The evaluation component measures the performance of the
recommendation system using metrics such as accuracy, coverage,
and diversity.
 The flowchart for a music recommendation system:
1. Data Collection: The first step is to gather data from various

sources, such as music streaming platforms, social media,
and user feedback.
2. Data Preprocessing: The collected data is preprocessed by
cleaning, filtering, and transforming it into a structured
format for analysis.
3. Feature Extraction: The system extracts features from the
preprocessed data, such as the genre, artist, album, and
user behavior.
25
4. Recommendation Model: The recommendation model uses

machine learning algorithms to analyze the extracted
features and create a personalized recommendation for the
user.
5. User Interface: The recommendation is presented to the
user through a graphical user interface, which can be a web
application, mobile app, or desktop software.
6. Feedback: The system collects feedback from the user
regarding the recommendation and uses it to improve future
recommendations.
7. System Maintenance: The system is continuously
maintained and updated to ensure optimal performance and
accuracy.
26
 Flow chart for a Music Recommendation System:-
 The Activity Diagram for a Music Recommendation System:
1. Start: User opens the music recommendation system

2. Login: User enters their login credentials or creates a new
account
3. User Profile: The system displays the user's profile, which
may include their preferences, history, and playlists
4. Music Selection: The user selects a music category or
enters a specific song, artist, or album.
5. Recommendation Engine: The system uses algorithms to
analyze the user's profile and music selection, and generate
personalized recommendations
6. Recommendations: The system displays the recommended
songs, artists, or playlists, along with a description of why
the recommendation was made
27
7. User Feedback: The user can provide feedback on the

recommendations, such as liking or disliking a song, or
saving a playlist
8. Update Profile: The system updates the user's profile based
on their feedback and activity
9. End: The user exits the music recommendation system.
 Activity Diagram for CF :-

28
 Activity Diagram for CB:-
 Sequence Diagram:-
A sequence diagram for a music recommendation system could

include the following steps:
1. User logs in to the system
2. System retrieves user data, including listening history and
preferences
3. User selects a music genre or artist
4. System retrieves recommendations based on user data and
selection
5. User listens to recommended music
6. System tracks user engagement with recommended music
7. System updates user data based on engagement with
recommended music
8. User can provide feedback on recommended music
29
9. System uses feedback to refine recommendations for future

use.
The sequence diagram would include arrows indicating the flow
of information and interactions between the user and the system
at each step. It would also include any necessary conditions or
decision points, such as the user selecting a genre or artist and
the system retrieving recommendations based on that selection.
 Sequence Diagram for CF:-

30
 Sequence Diagram for CB:-

31
Algorithms
We have employed four different algorithms to create an effective

recommendation system.
 Collaborative Filtering:-
Collaborative filtering is a commonly used algorithm for
developing a music recommendation system. It is a type of
recommendation system that uses the behavior of similar users
to make recommendations for a particular user. In collaborative
filtering, the system identifies users with similar preferences and
recommends items that these similar users have liked in the past
but the current user has not interacted with yet. It can be further
divided into two types: user-based collaborative filtering and item-
based collaborative filtering.
The item-based model [Figure A] operates on the assumption

that if certain users frequently listen to certain songs together,
those songs are likely to be similar and may be listened to
together in the future by other users.
Fig A. Item Based CF

32
The user-based similarity model [Figure B] suggests that users

who have similar listening histories, which means they have
listened to the same songs in the past, are likely to have similar
interests. As a result, they are more likely to listen to the same
songs in the future as well.
Fig B. User Based CF
 Popularity Based Model:-

In a popularity-based model, the most popular items among
users are recommended to all users, regardless of their individual
preferences. This approach assumes that the most popular items
are likely to be liked by most users and therefore result in high
user satisfaction. However, it doesn't take into account individual
user preferences and can lead to recommendations that are not
personalized to the user's taste.
 SVD Model:-
The SVD (Singular Value Decomposition) model is a
collaborative filtering algorithm that is commonly used for music
recommendation. The basic idea behind SVD is to factorize a
large matrix of user-item ratings into three smaller matrices: a
33
user matrix, an item matrix, and a weight matrix.

The user matrix contains information about the users, such as
their preferences, while the item matrix contains information
about the items, such as their features. The weight matrix
contains the actual ratings given by the users for the items.
Once the matrix has been factorized, the model can use it to
predict ratings for items that the user has not yet rated. This is
done by computing the dot product of the user vector and the
item vector, which gives an estimate of the user's rating for that
item.
The SVD model is known for its accuracy in predicting ratings
and is commonly used in online music streaming platforms such
as Spotify and Pandora to recommend songs to users based on
their listening history and preferences.
 KNN:- K-nearest neighbors (KNN) is a type of machine learning

algorithm used for classification and regression. In the context of
recommendation systems, KNN is often used for collaborative
filtering. The algorithm works by finding the K nearest neighbors
to a given user or item based on similarity metrics such as cosine
similarity or Pearson correlation. It then recommends items that
the nearest neighbors have rated highly but the given user or
item has not interacted with yet. KNN can be used for both user-
based and item-based collaborative filtering.
 Evaluation Metrics:-
Evaluation metrics are quantitative measures used to assess the
performance of a system or model. In the context of music
recommendation systems, some commonly used evaluation
metrics include:
1. Precision: This metric measures the percentage of
recommended songs that are actually relevant to the user. A
high precision score indicates that the recommended songs
are closely related to the user's preferences.
34
2. Recall: This metric measures the percentage of relevant

songs that were successfully recommended to the user. A
high recall score indicates that the system is able to identify
a large proportion of relevant songs.
3. F1 Score: This metric is the harmonic mean of precision and
recall. It provides a balanced measure of both precision and
recall.
4. Mean Average Precision (MAP): This metric is used to
evaluate the ranking of recommendations. It measures the
average precision of all relevant songs in a list of
recommended songs.
5. Normalized Discounted Cumulative Gain (NDCG): This
metric is used to evaluate the ranking of recommendations
as well. It measures the quality of a recommendation list by
considering the position of each relevant song in the list.
6. Coverage: This metric measures the percentage of songs in
the music catalog that are recommended to the user. A high
coverage score indicates that the system is able to
recommend a diverse range of songs to the user.
7. Mean Reciprocal Rank (MRR): This metric measures the
quality of the top recommendation. It calculates the average
reciprocal rank of the first relevant song in the list of
recommendations.
35
 SVM:-
SVM stands for Support Vector Machines, which is a supervised
machine learning algorithm used for classification and regression
analysis. It is a powerful and versatile algorithm that can handle
both linear and non-linear data. In SVM, the data is represented
as points in space, and the algorithm tries to find the best line or
hyper-plane that separates the data into different classes. The
best line or hyper-plane is the one that maximizes the margin
between the data points of different classes. SVM has been
successfully applied in a wide range of applications, including
image classification, text classification, and bioinformatics.
36
Implementation:-
Here’s is a general code template for a collaborative

filtering-based music recommendation algorithm.
import numpy as np
import pandas as pd
# Load the music dataset

music_data = pd.read_csv("music_data.csv")
# Create a user-item matrix

user_item_matrix = music_data.pivot_table(index='user_id',
columns='song_id', values='listen_count')
# Define a function to calculate the similarity between two users

def calculate_similarity(user1, user2):
user1_data = user_item_matrix.loc[user1]
user2_data = user_item_matrix.loc[user2]
common_items =
user_item_matrix.columns[(user1_data.notna()) &
(user2_data.notna())]
if len(common_items) == 0:
return 0
user1_ratings = user1_data[common_items]
user2_ratings = user2_data[common_items]
similarity = np.dot(user1_ratings, user2_ratings) /
(np.linalg.norm(user1_ratings) * np.linalg.norm(user2_ratings))
return similarity
# Define a function to recommend music to a user

def recommend_music(user):
# Calculate the similarities between the given user and all
37
other users
similarities = []
for other_user in user_item_matrix.index:
if other_user == user:
continue
similarity = calculate_similarity(user, other_user)
similarities.append((other_user, similarity))
# Sort the list of similarities in descending order

similarities = sorted(similarities, key=lambda x: x[1],
reverse=True)
# Find the top similar users

top_similar_users = [similarity[0] for similarity in
similarities[:10]]
# Find the music items that the top similar users have listened
to but the given user has not
recommendations = []
for other_user in top_similar_users:
other_user_data = user_item_matrix.loc[other_user]
user_data = user_item_matrix.loc[user]
common_items =
user_item_matrix.columns[(user_data.isna()) &
(other_user_data.notna())]
for item in common_items:
recommendations.append((item, other_user_data[item]))
# Sort the list of recommendations in descending order

recommendations = sorted(recommendations, key=lambda x:
x[1], reverse=True)
return [recommendation[0] for recommendation in

recommendations[:10]]
38
# Test the recommendation function

recommendations = recommend_music(1234)
print(recommendations)
39
Features and Capabilities Required
Functional Requirements are a set of specifications that describe the

behavior of a system or software in terms of what it should do
User profile creation: The system should allow users to create a profile
where they can provide information such as their age, gender, music
preferences, and other relevant details.
1. Search functionality: The system should allow users to search for

music by title, artist, genre, or other relevant parameters.
2. Recommendation algorithm: The system should have a
recommendation algorithm that analyzes user data, such as
listening history, likes, and dislikes, to recommend new music
that the user might enjoy.
3. Personalized playlists: The system should allow users to create
personalized playlists based on their music preferences.
4. Social sharing: The system should allow users to share their
playlists and music recommendations with others through social
media or other communication channels.
5. User feedback: The system should allow users to rate and
provide feedback on recommended music to improve the
recommendation algorithm.
6. Cross-platform compatibility: The system should be compatible
with various platforms such as web, mobile, and desktop to
enable users to access it from different devices.
7. Integration with music streaming services: The system should be
integrated with popular music streaming services such as Spotify,
Apple Music, and YouTube to provide a seamless user
experience.
8. Privacy and security: The system should ensure the privacy and
security of user data by implementing appropriate security
measures such as encryption, authentication, and access control.
40
Quality Attributes
Non-functional requirements for a music recommendation system can

include:
1. Performance: The system should be able to process large
amounts of data and provide recommendations quickly and
efficiently.
2. Reliability: The system should be available at all times and
should not crash or produce incorrect recommendations.
3. Security: The system should be designed to protect user data
and prevent unauthorized access.
4. Scalability: The system should be able to handle a growing
number of users and data without affecting performance.
5. Usability: The system should be user-friendly and easy to
navigate, with clear instructions and explanations.
6. Compatibility: The system should be compatible with different
devices, operating systems, and browsers.
7. Maintainability: The system should be easy to maintain and
update, with clear documentation and organized code.
8. Accessibility: The system should be accessible to users with
disabilities, with features such as screen readers and alternative
input methods.
41
Data Collection
Link:-
https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-
dataset
https://www.kaggle.com/datasets/amaanansari09/most-streamed-
songs-all-time
http://mac.citi.sinica.edu.tw/~yang/pub/poddar18smc.pdf%7D
42
Conclusion and future scope(s)
Conclusion:
In the end result, a music recommendation system is a useful
application that can help users discover new songs and artists based
on their listening history and preferences. The implementation of
various algorithms such as collaborative filtering, popularity-based
models, and SVD has led to the development of efficient music
recommendation systems. The evaluation metrics such as precision,
recall, and F1 score have helped in the assessment of the
performance of the system.
Future Scope:
There are several possibilities for future improvement and research in
this music recommendation systems. One of the areas that could be
explored is the incorporation of user demographic data such as age,
gender, and location to provide more personalized recommendations.
Another potential area of research is the use of deep learning models
and natural language processing techniques to analyze user reviews
and comments on songs and artists, which can further enhance the
accuracy of recommendations. Additionally, the integration of social
media platforms such as Facebook and Twitter could provide more
information about user preferences and listening habits, which can
lead to better recommendations.
43
REFERENCE
1.https://thesai.org/Publications/ViewPaper?Volume=9&Issue=
4&Code=IJACSA&SerialNo=25
2. https://www.researchgate.net/publication/271497743_Implem
ent_of_cloud_computing_for_e-learning_system
3.Zhang, J., Chow, T. W. S., Yang, H., & Chen, N. (2021). A

hybrid model for music recommendation based on user’s
listening history and lyrics analysis. Multimedia Tools and
Applications, 80(14), 20949-20968.
4.https://ieeexplore.ieee.org/document/8807891
5.Choi, K., & Lee, J. H. (2020). Personalized music

recommendation based on collaborative filtering and deep
learning using audio features. Expert Systems with
Applications, 146, 113210.
6.Yang, X., Wang, H., Jin, B., & Guo, M. (2020). A hybrid
approach for music recommendation based on user
preferences and audio content analysis. Multimedia Tools and
Applications, 79(13), 9025-9042.
7.https://www.researchgate.net/publication/338440327_Implem
entation_Of_SaaS_Cloud_Computing_Services_On_E-Learnin
g_Applications_Case_Study_PGRI_Foundation_School
8.Chen, Z., & Qin, T. (2020). Music recommendation based on

user and song classification using deep learning. Multimedia
Tools and Applications, 79(19), 13505-13523.
9.https://www.ijser.org/paper/Solution-to-implement-e-learning-s
ystem-based-on-cloud-computing.html
44
10. Li, J., Liu, B., & Zhou, J. (2019). A deep collaborative
filtering model for music recommendation. Multimedia Tools
and Applications, 78(7), 8095-8107.
45

Report Major Project by Shubham

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Report Major Project by Shubham

Uploaded by

Copyright:

Available Formats

1

AMITY UNIVERSITY UTTAR PRADESH

MAJOR PROJECT REPORT

“Music Recommendation System”

Submitted as a partial fulfilment toward the award of

I hereby certify that the work which is being offered in entitled

The matter provided on this undertaking record has not been

Signature of Faculty Guide

I would love to express my sincerest gratitude and

I would, then the like to vicinity on record my best regards and

I understand this possibility as a huge milestone in my

S No. CONTENT PAGE No.

10 Features and 30-33

Recommender systems are crucial in the world of e-commerce,

Keywords— E-commerce, recommender system, collaborative

In the digital age, music is a popular entertainment medium that is

A music recommendation system is a technology that uses algorithms

weaknesses. The article compares these techniques and shows that

TYPES OF RECOMMENDER SYTEMS:-

Recommender systems have been widely studied and can be classified

commerce, social networks, and music streaming services.

Fig.1 Listen by her recommend to him

Memory-based Collaborative Filtering:

 Hybrid Collaborative Filtering:

 Hybrid Approach for Collaborative Filtering: The authors Gilbert

 CONTENT-BASED RECOMMENDER SYSTEM:

Anand Shanker Tewari, Abhay Kumar, and Asim Gopal proposed

and stores recommendations in the buyer's internet profile. It

Fig 2. Content Based Recommender System

 CONTEXT BASED RECOMMENDER SYSTEM:

recommendation system only considers these factors for the

Methodology & Structural Design

Machine learning is a subfield of artificial intelligence that focuses

There are several types of machine learning, including

Machine learning is being used in various applications, such as

 Supervised Machine Learning:-

algorithm is trained on labeled data, where the correct

 Unsupervised Machine Learning:

 Reinforcement Machine Learning:

 Semi-Supervised Machine Learning:

 Features of Python: The key features of Python are:

1. Easy to Learn: Python is easy to learn and has a simple syntax

4. Open Source: Python is open source, which means that it is free

 NumPy: NumPy is the fundamental library for

performing complex queries on data. Pandas has

One of the primary benefits of Jupyter Notebook is that it

visualizations and documentation, which can help users

A music recommendation system architecture typically involves

listening history, preferences, and behavior.

There are different ways to represent the architecture of a music

A flowchart for a music recommendation system typically includes the

 The flowchart for a music recommendation system:

1. Data Collection: The first step is to gather data from various

4. Recommendation Model: The recommendation model uses

 Flow chart for a Music Recommendation System:-

 The Activity Diagram for a Music Recommendation System:

1. Start: User opens the music recommendation system

7. User Feedback: The user can provide feedback on the

 Activity Diagram for CF :-

 Activity Diagram for CB:-

A sequence diagram for a music recommendation system could

9. System uses feedback to refine recommendations for future

 Sequence Diagram for CF:-

 Sequence Diagram for CB:-