Professional Documents
Culture Documents
Collaborative Filtering & Recommendation System
Collaborative Filtering & Recommendation System
Marketing Analytics
BAN 6065
Professor Jia Li
Spring 2021
Today’s Agenda
• Product Analytics
– Recommendation System
1
Netflix
• Online streaming video service
– The company started as an online DVD rental
service
– Started streaming video in 2007
– Today, more than 158 million subscribers in 190
countries
• The best stock in the past 10 years
– 10-year cumulative return: 3,522%
2
Other Examples
• Retail: Amazon
• Friends: Facebook
• Music: Spotify
• Websites: Reddit
Question Time
• Question 1
Accompanying Questions:
https://wakeforest.qualtrics.com/jfe/form/SV_397533jR7DWEmZE
(Link available at Canvas; right below the link to this video)
3
Collaborative Filtering
Key Ideas
• Intuition: Low tech way to get recommendation - ask your friends!
– Some of your friends have better “taste” than others (likely-minded)
4
Input Data
• Explicit (Questioning)
– Explicit rating (1 -5 numerical ratings)
– Favorites (Likes): 1 (liked), 0 (No vote), -1 (disliked)
• Implicit (Behavioral)
– Purchase: 1 (bought), 0 (didn’t buy)
– Clicks: 1 (clicked), 0 (didn’t click)
– Reads: 1 (read), 0 (didn’t read)
– Watching a Video: 1 (watched), 0 (didn’t watch)
– Hybrid: 2 (bought), 1 (browsed), 0 (didn’t buy)
5
Collaborative Filtering Tasks
1. Finding Similar Users: Calculating Similarities
6
Cosine similarity
1 0 1 1
0 1 0 1
1 0 0 1
0
1 1 1
0 1 0 1
7
Recommend movies to Toby based on
the user ranking
8
Recommending Items – 2/2
Lady in Snake Just Superm
Customer You, Me, The Night
Similarity the on a My an
ID and Dupree Listener
Water Plane Luck Returns
Michael 0.99 2.5 3.5 3.0 3.5 2.5 3.0
Jay 0.38 3.5 3.5 3.0
July 0.89 3.5 3.0 4.0 2.5 4.5
Peter 0.92 3.0 4.0 2.0 3.0
Stephen 0.66 3.0 4.0 5.0 3.0
Weighted
5.24 10.26
Rating Sum
Sum of
1.91 2.80
Weight
Normalized
2.74 3.66
Rating
9
Item-based Collaborative Filtering
Example
Product Product Product
1 2 3
Customer1 2 3
Customer2 5 2
Customer3 3 3 1
Customer4 2 2
Item-item similarity is computed by looking into co-rated
items only.
Question Time
• Question 2
• Question 3
• Question 4
Accompanying Questions:
https://wakeforest.qualtrics.com/jfe/form/SV_397533jR7DWEmZE
(Link available at Canvas; right below the link to this video)
10
Problems with Collaborative Filtering
• When data are sparse, correlations (weights) are based on
very few common items -> unreliable
Content-based Recommendations
1. Defined features and feature values
4. Recommend items that are similar to those that a user liked in the
past
11
Example of Content-based approach
• Movie content
• Genre, actors, director, movie summary, ….
12
Question Time
• Question 5
Accompanying Questions:
https://wakeforest.qualtrics.com/jfe/form/SV_397533jR7DWEmZE
(Link available at Canvas; right below the link to this video)
13
Contest Rules
• If the grand prize as not yet reached, progress prizes
of USD50,000 per year would be awarded for the
best result so far, as long as it had >1% improvement
over the previous year.
Initial Results
• The contest went live on October 2, 2006.
14
Progress During the Contest
• By June 2007, over 20,000 teams had registered from
over 150 countries.
Competition intensified …
• The 2008 progress prize went to Team BellKor which
contained researchers from the original BellKor team
as well as the Team BigChaos.
15
Last Call announced …
• On June 26, 2009, the Team BellKor’s Pragmatic
Chaos submitted a 10.05% improvement over
Cinematch.
16
Winner is declared!
• On September 18, 2009, a winning team was
announced
• BellKor’s Pragmatic
Chaos won the
competition and the
USD1,000,000 grand
prize.
17