SlideShare a Scribd company logo
2015/2/27
Scaling-up Item-based Collaborative Filtering
Recommendation Algorithm based on Hadoop
Jing Jiang, Jie Lu, Guangquan Zhang, Guodong Long 2011 IEEE World Congress Services
outline
✤ Collaborative Filtering
✤ scaling-up item-based CF
✤ experimentation and evaluation
Collaborative Filtering
✤ Collaborative filtering (CF) techniques have achieved
widespread success in E-commerce nowadays.
Collaborative Filtering
✤ Collaborative filtering is a method of making
automatic predictions (filtering) about the interests of
a user by collecting preferences or taste information
from many users (collaborating). from wiki
Collaborative Filtering
1. Weight all users with respect to similarity with active user
2. Select a subset of users to use as a set of predictors
3. Compute a prediction from a weighted combination of selected
neighbors’ ratings
1. Weight all users with respect to similarity with active user
2. Select a subset of users to use as a set of predictors
3. Compute a prediction from a weighted combination of selected
neighbors’ ratings
simple
compute
Nathan [5,1,5]
Joe [5,2,5]
John [2,5,2.5]
Al [2,2,4]
use cosine compute similarity
cos (Nathan,Joe) 0.99
cos (Nathan,John) 0.64
cos (Nathan,Al) 0.91
1. Weight all users with respect to similarity with active user
2. Select a subset of users to use as a set of predictors
3. Compute a prediction from a weighted combination of selected
neighbors’ ratings
simple
compute
cos (Nathan,Joe) 0.99
cos (Nathan,John) 0.64
cos (Nathan,Al) 0.91
(0.99*4+0.64*3+0.91*2)/(0.99+0.64+0.91) = 3.03
0.99
0.91
0.64
? = 3.03
Collaborative Filtering
✤ User-Based CF
✤ Item-Based CF
compute similarity base on user
compute similarity base on item
Collaborative Filtering
✤ User-Based CF
compute similarity base on user
if predict user A to item4 rating
user B to item4 rating is 5
user F to item4 rating is 1
user A to item4 =
5 * similarities (user A, user B) + 1 * similarities (user A, user F)
similarities (user A, user B) + similarities (user A, user F)
Collaborative Filtering
✤ Item-Based CF
compute similarity base on item
if predict user A to item4 rating
user A to item2 rating is 1
user A to item3 rating is 1
user A to item4 =
1 * similarities (item2, item4) + 1 * similarities (item3, item4)
similarities (item2, item4) + similarities (item3, item4)
scaling-up item-based CF
divide CF algorithm into two steps as follows:
Similarity computation
Prediction and Recommendation
pearson correlation(1,-1)
j
scaling-up item-based CF
pearson correlation(1,-1)
j
Covariance
scaling-up item-based CF
Similarity computation
apple milk toast
sam 2 0 4
john 5 5 3
tim 2 4 ?
u
i
j
j
Ri = (2+5+2)/3 Rj = (4+3)/2
scaling-up item-based CF
Similarity computation
apple milk toast
sam 2 0 4
john 5 5 3
tim 2 4 ?
u
j
i
Ru(sam) = (2+0+4)/3
Rj = (2+5+2)/3 Ri = (4+3)/2
scaling-up item-based CF
The three parts of intensive computation are:
(1)computing the average rating for each item
(2)computing the similarity between item pairs
(3)computing predicted items for the target user
item iby user j
map item i
1 2 3
1
wheremeans the
set of users who rated the item kand item l
2
similarity
3
map user j
map user j
experimentation and evaluation
3 nodes
nodes with Intel P4 CPU,
1G RAM, 80G disk
All the machines were connected
with one 100Mbps switch.
experimentation and evaluation
13
20

More Related Content

PDF
Collaborative filtering
PDF
Collaborative Filtering 1: User-based CF
PPTX
Collaborative filtering
PDF
Movie recommendation project
ODP
Collaborative Filtering
PDF
Movie Recommendation engine
PPT
Item Based Collaborative Filtering Recommendation Algorithms
PPT
Project presentation
Collaborative filtering
Collaborative Filtering 1: User-based CF
Collaborative filtering
Movie recommendation project
Collaborative Filtering
Movie Recommendation engine
Item Based Collaborative Filtering Recommendation Algorithms
Project presentation

What's hot (20)

PPTX
Movie lens recommender systems
PPTX
Collaborative Filtering using KNN
PPTX
Recommender system
PPTX
Movies Recommendation System
PPTX
Movie recommendation system using collaborative filtering system
PPTX
Recommendation System
PPTX
Collaborative filtering
PPTX
Recommender systems using collaborative filtering
PDF
ACM SIGIR 2020 Tutorial - Reciprocal Recommendation: matching users with the ...
PDF
Movies recommendation system in R Studio, Machine learning
PPTX
[Final]collaborative filtering and recommender systems
PDF
Hybrid recommender systems
PPTX
Recommender Systems
PDF
Recent advances in deep recommender systems
PPTX
Collaborative filtering at scale
PDF
Introduction to Recommendation Systems
PDF
Recommendation System Explained
PPTX
Recommendation system
PDF
Survey of Recommendation Systems
PPTX
CSTalks - Real movie recommendation - 9 Mar
Movie lens recommender systems
Collaborative Filtering using KNN
Recommender system
Movies Recommendation System
Movie recommendation system using collaborative filtering system
Recommendation System
Collaborative filtering
Recommender systems using collaborative filtering
ACM SIGIR 2020 Tutorial - Reciprocal Recommendation: matching users with the ...
Movies recommendation system in R Studio, Machine learning
[Final]collaborative filtering and recommender systems
Hybrid recommender systems
Recommender Systems
Recent advances in deep recommender systems
Collaborative filtering at scale
Introduction to Recommendation Systems
Recommendation System Explained
Recommendation system
Survey of Recommendation Systems
CSTalks - Real movie recommendation - 9 Mar
Ad

Viewers also liked (12)

PDF
Hidden markov model
PDF
NoSQL & JSON
PDF
Weebly上手教學
PDF
Scalable machine learning
PDF
Parallel-kmeans
PDF
Semantic ui教學
PDF
Scalable sentiment classification for big data analysis using naive bayes cla...
PDF
沒有想像中簡單的簡單分類器 Knn
PDF
Python簡介和多版本虛擬環境架設
PDF
響應式網頁教學
PDF
Python 起步走
Hidden markov model
NoSQL & JSON
Weebly上手教學
Scalable machine learning
Parallel-kmeans
Semantic ui教學
Scalable sentiment classification for big data analysis using naive bayes cla...
沒有想像中簡單的簡單分類器 Knn
Python簡介和多版本虛擬環境架設
響應式網頁教學
Python 起步走
Ad

Similar to Collaborative Filtering Recommendation Algorithm based on Hadoop (20)

PPTX
collaborativefiltering-150228122057-conversion-gate02.pptx
PPT
Item basedcollaborativefilteringrecommendationalgorithms
PPTX
introduction to machine learning 3d-collab-filtering.pptx
PPT
Amazon Item-to-Item Recommendations
PPT
Chapter 02 collaborative recommendation
PPT
Chapter 02 collaborative recommendation
PPT
Collaborative filtering hyoungtae cho
PPT
Collaborative filtering
PDF
Overview of recommender system
PPTX
Recommender Systems
PDF
Costomization of recommendation system using collaborative filtering algorith...
PPTX
DN 2017 | Building a recommender system using collaborative filtering (CF) | ...
PDF
Improving collaborative filtering’s rating prediction accuracy by introducing...
PDF
IMPROVING COLLABORATIVE RECOMMENDATION VIA USER-ITEM SUBGROUPS
PPT
Cs548 s15 showcase_web_mining
PDF
Book Recommendation System
PPTX
Lecture Notes on Recommender System Introduction
PDF
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
PDF
Collaborative Filtering 2: Item-based CF
PPTX
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
collaborativefiltering-150228122057-conversion-gate02.pptx
Item basedcollaborativefilteringrecommendationalgorithms
introduction to machine learning 3d-collab-filtering.pptx
Amazon Item-to-Item Recommendations
Chapter 02 collaborative recommendation
Chapter 02 collaborative recommendation
Collaborative filtering hyoungtae cho
Collaborative filtering
Overview of recommender system
Recommender Systems
Costomization of recommendation system using collaborative filtering algorith...
DN 2017 | Building a recommender system using collaborative filtering (CF) | ...
Improving collaborative filtering’s rating prediction accuracy by introducing...
IMPROVING COLLABORATIVE RECOMMENDATION VIA USER-ITEM SUBGROUPS
Cs548 s15 showcase_web_mining
Book Recommendation System
Lecture Notes on Recommender System Introduction
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
Collaborative Filtering 2: Item-based CF
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com

Recently uploaded (20)

PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PDF
Nekopoi APK 2025 free lastest update
PDF
17 Powerful Integrations Your Next-Gen MLM Software Needs
PPTX
Patient Appointment Booking in Odoo with online payment
DOCX
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
AutoCAD Professional Crack 2025 With License Key
PDF
iTop VPN 6.5.0 Crack + License Key 2025 (Premium Version)
PDF
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
PDF
Designing Intelligence for the Shop Floor.pdf
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PPTX
Reimagine Home Health with the Power of Agentic AI​
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Nekopoi APK 2025 free lastest update
17 Powerful Integrations Your Next-Gen MLM Software Needs
Patient Appointment Booking in Odoo with online payment
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
Adobe Illustrator 28.6 Crack My Vision of Vector Design
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Advanced SystemCare Ultimate Crack + Portable (2025)
Navsoft: AI-Powered Business Solutions & Custom Software Development
Oracle Fusion HCM Cloud Demo for Beginners
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
AutoCAD Professional Crack 2025 With License Key
iTop VPN 6.5.0 Crack + License Key 2025 (Premium Version)
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
Designing Intelligence for the Shop Floor.pdf
CHAPTER 2 - PM Management and IT Context
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Reimagine Home Health with the Power of Agentic AI​

Collaborative Filtering Recommendation Algorithm based on Hadoop

  • 1. 2015/2/27 Scaling-up Item-based Collaborative Filtering Recommendation Algorithm based on Hadoop Jing Jiang, Jie Lu, Guangquan Zhang, Guodong Long 2011 IEEE World Congress Services
  • 2. outline ✤ Collaborative Filtering ✤ scaling-up item-based CF ✤ experimentation and evaluation
  • 3. Collaborative Filtering ✤ Collaborative filtering (CF) techniques have achieved widespread success in E-commerce nowadays.
  • 4. Collaborative Filtering ✤ Collaborative filtering is a method of making automatic predictions (filtering) about the interests of a user by collecting preferences or taste information from many users (collaborating). from wiki
  • 5. Collaborative Filtering 1. Weight all users with respect to similarity with active user 2. Select a subset of users to use as a set of predictors 3. Compute a prediction from a weighted combination of selected neighbors’ ratings
  • 6. 1. Weight all users with respect to similarity with active user 2. Select a subset of users to use as a set of predictors 3. Compute a prediction from a weighted combination of selected neighbors’ ratings simple compute Nathan [5,1,5] Joe [5,2,5] John [2,5,2.5] Al [2,2,4] use cosine compute similarity cos (Nathan,Joe) 0.99 cos (Nathan,John) 0.64 cos (Nathan,Al) 0.91
  • 7. 1. Weight all users with respect to similarity with active user 2. Select a subset of users to use as a set of predictors 3. Compute a prediction from a weighted combination of selected neighbors’ ratings simple compute cos (Nathan,Joe) 0.99 cos (Nathan,John) 0.64 cos (Nathan,Al) 0.91 (0.99*4+0.64*3+0.91*2)/(0.99+0.64+0.91) = 3.03 0.99 0.91 0.64 ? = 3.03
  • 8. Collaborative Filtering ✤ User-Based CF ✤ Item-Based CF compute similarity base on user compute similarity base on item
  • 9. Collaborative Filtering ✤ User-Based CF compute similarity base on user if predict user A to item4 rating user B to item4 rating is 5 user F to item4 rating is 1 user A to item4 = 5 * similarities (user A, user B) + 1 * similarities (user A, user F) similarities (user A, user B) + similarities (user A, user F)
  • 10. Collaborative Filtering ✤ Item-Based CF compute similarity base on item if predict user A to item4 rating user A to item2 rating is 1 user A to item3 rating is 1 user A to item4 = 1 * similarities (item2, item4) + 1 * similarities (item3, item4) similarities (item2, item4) + similarities (item3, item4)
  • 11. scaling-up item-based CF divide CF algorithm into two steps as follows: Similarity computation Prediction and Recommendation pearson correlation(1,-1) j
  • 12. scaling-up item-based CF pearson correlation(1,-1) j Covariance
  • 13. scaling-up item-based CF Similarity computation apple milk toast sam 2 0 4 john 5 5 3 tim 2 4 ? u i j j Ri = (2+5+2)/3 Rj = (4+3)/2
  • 14. scaling-up item-based CF Similarity computation apple milk toast sam 2 0 4 john 5 5 3 tim 2 4 ? u j i Ru(sam) = (2+0+4)/3 Rj = (2+5+2)/3 Ri = (4+3)/2
  • 15. scaling-up item-based CF The three parts of intensive computation are: (1)computing the average rating for each item (2)computing the similarity between item pairs (3)computing predicted items for the target user
  • 16. item iby user j map item i 1 2 3
  • 17. 1 wheremeans the set of users who rated the item kand item l
  • 19. experimentation and evaluation 3 nodes nodes with Intel P4 CPU, 1G RAM, 80G disk All the machines were connected with one 100Mbps switch.