SlideShare a Scribd company logo
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
MATRIX FACTORIZATION TECHNIQUE FOR
RECOMMENDER SYSTEMS
Oluwashina Aladejubelo
Universite Joseph Fourier,
Grenoble, France
June 6, 2015
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
About Me
Bachelor of Science, Ambrose Alli University, Nigeria
(2004-2008)
IT Business Analyst, Virgin Nigeria Airlines (2009-2011)
Team Lead/Software Architect, Speckless Innovations
Limited (2011-2014)
Master of Informatics (M2 MOSIG), Universit Joseph
Fourier, Grenoble (2014-2015)
Master Thesis on ”Distributed Large-Scale Learning” with
Pr. Massih-Reza Amini.
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Overview
1 Introduction
2 Matrix Factorization Methods
3 Netflix Prize Competition
4 Conclusion
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
1 Introduction
Recommender Systems
Content Filtering Approach
Collaborative Filtering Approach
Content vs Collaborative Filtering
2 Matrix Factorization Methods
Matrix Factorization Model (MFM)
Stochastic Gradient Descent
Alternating Least Squares
Adding Biases
Additional Input Source
Temporal Dynamics
Varying confidence levels
3 Netflix Prize Competition
4 Conclusion
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Recommender Systems
Recommender systems analyze patterns of user interest in
products to provide personalized recommendations
They seek to predict the rating or preference that user would
give to an item
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Recommender Systems
Such systems are very useful for entertainment products such
as movies, music, and TV shows.
Many customers will view the same movie and each customer
is likely to view numerous different movies.
Huge volume of data arise from customer feedbacks which can
be analyzed to provide recommendations
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Content Filtering Approach
creating profile for each user or product to characterize its
nature.
programs associate users with matching products.
it requires gathering external information that may not be
available
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Collaborative Filtering Approach
depends on past user behaviour, e.g. previous transactions or
product rating
does not rely on creation of explicit profiles
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Collaborative Filtering Approach
the primary areas of collaborative filtering are neighborhood
methods and latent factor models
neighborhood is based on computing the relationships
between items or users
latent factor models tries to explain by characterizing both
items and users on say, 20 to 100 factors inferred from the
ratings patterns
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Content vs Collaborative Filtering
Collaborative filtering address data aspects that are difficult to
profile.
it is generally more accurate
suffers from cold startup problem (new product / new user) in
which case content filtering is better
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
1 Introduction
Recommender Systems
Content Filtering Approach
Collaborative Filtering Approach
Content vs Collaborative Filtering
2 Matrix Factorization Methods
Matrix Factorization Model (MFM)
Stochastic Gradient Descent
Alternating Least Squares
Adding Biases
Additional Input Source
Temporal Dynamics
Varying confidence levels
3 Netflix Prize Competition
4 Conclusion
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Matrix Factorization Model (MFM)
some of the most successful realizations of latent factor
models are based on matrix factorization
it characterizes both items and users by vectors of factors
inferred from item rating patterns
high correspondence between item and user factors leads to a
recommendation
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Matrix Factorization Model (MFM)
MFM maps both users & items to a joint latent factor space
of dimensionality f
the user-item interactions are modeled as inner products in
space f
each item i is associated with a vector qi ∈ Rf
each user u is associated with a vector pu ∈ Rf
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Matrix Factorization Model (MFM)
the approximate user rating is given by
ˆrui = qT
i Pu (1)
carelessly addressing only the relatively few known entries is
highly prone to overfitting
observed ratings can be modeled directly with regularization
as follows
minq∗,p∗
(u,i)∈κ
(rui − qT
i pu)2
+ λ(||qi ||2
+ ||pu||2
) (2)
κ is a set of (u, i) pairs for which rui is known
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Stochastic Gradient Descent (SGD) - Simon Funk; 2006
SGD approach can be used for solving the equation (2)
For each given training case, the system predicts rui and
computes the prediction error
eui = rui − qT
i pu
it modifies the parameters by a magnitude proportional to γ
in the opposite direction of the gradient, yielding∈ Rf
qi ← qi + γ.(eui .pu − γ.qi )
pu ← pu + γ.(eui .qi − γ.pu)
combines ease with a relatively fast runtime
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Alternating least squares
Because both qi and pu are unknown, equation (2) is not
convex
if we fix one of the unknowns the quadratic optimization can
be solved optimally
when all pu are fixed the system recomputes the qi by solving
a least-squares problem and vice versa
each step decreases the minimization problem until
convergence
massively parallelizable
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Adding Biases
rating values are also affected by biases independent of any
interaction
a first-order approximation of the bias involved in rating rui is
bui = µ + bi + bu (3)
µ denotes the average rating, bu and bi are the observed
deviations of user u on item i
therefore,
ˆr = µ + bi + bu + qT
i pu (4)
equation(2) also becomes,
minq∗,p∗,b∗
(u,i)∈κ
(rui −µ−bu−bi −qT
i pu)2
+λ(||qi ||2
+||pu||2
+b2
u+b2
i ) (5)
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Additional Input Sources
cold start problem could be as a result of user supplying very
few ratings-difficulty to conclude on their taste
behavioural information such as purchase and browsing history
can be used for implicit feedback
let’s say N(u) denotes the set of itels for which user u
expressed an implicit preference
a new set of item factors is given by xi ∈ Rf
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Additional Input Sources
a user who showed a preference for items in N(u) is
characterized by the vector
i∈N(u)
xi
normalizing the sum we have,
|N(u)|−0.5
i∈N(u)
xi
another information source is known as user attribute, e.g.
demographics, gender, age, income level and so on
let A(u) denote set of attributes of a user u
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Additional Input Sources
a distinct factor vector ya ∈ Rf corresponds to each attribute
to describe a user through the set of user-associated
attributes:
a∈A(u) ya
the matrix factorization model should intergrate all signal
sources, with ehanced representation:
ˆrui = µ + bi + bu + qT
i [pu + |N(u)−0.5
i∈N(u)
xi +
a∈A(u)
ya] (6)
items can get a similar treatment
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Temporal Dynamics
in reality customers’ inclinations evolve, leading them to
redefine their taste
it is therefore important to accommodate this temporal effects
reflecting the dynamic, time-drifting nature of user-item
interactions
the following terms vary over time: item biases, bi (t); user
biases, bu(t); and user preferences, pu(t)
equation (4) therefore becomes,
ˆr(t) = µ + bi (t) + bu(t) + qT
i pu(t) (7)
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Varying Confidence Level
other factors like massive advertisement can influence
observed ratings, which do not reflect long-term characteristics
hence the need for a weighting scheme or confidence
confidence can stem from available numerical values that
describe the frequency of actions, e.g. how much time the
user watched a show
in matrix factorization less weight is given to less meaningful
action
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Varying Confidence Level
if confidence in observing rui is denoted as cui, then the model
enhances equation (5) to account for confidence as follows
minq∗,p∗,b∗
(u,i)∈κ
cui (rui −µ−bu−bi −qT
i pu)2
+λ(||qi ||2
+||pu||2
+b2
u+b2
i ) (8
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
1 Introduction
Recommender Systems
Content Filtering Approach
Collaborative Filtering Approach
Content vs Collaborative Filtering
2 Matrix Factorization Methods
Matrix Factorization Model (MFM)
Stochastic Gradient Descent
Alternating Least Squares
Adding Biases
Additional Input Source
Temporal Dynamics
Varying confidence levels
3 Netflix Prize Competition
4 Conclusion
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Netflix Prize Competition
in 2006, Netflix announced a contest to improve the state of
its recommender system
training data comprised of 100 million ratings sapnning
500,000 annonymous customers’ rating of 17,000 movies
each movie was rated on a scale of 1 to 5 stars
test data was 3million ratings
the metrics was 10 percent or more root-mean-square error
(RMSE) performance better than Netflix algorithm
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Netflix Prize Competition
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
1 Introduction
Recommender Systems
Content Filtering Approach
Collaborative Filtering Approach
Content vs Collaborative Filtering
2 Matrix Factorization Methods
Matrix Factorization Model (MFM)
Stochastic Gradient Descent
Alternating Least Squares
Adding Biases
Additional Input Source
Temporal Dynamics
Varying confidence levels
3 Netflix Prize Competition
4 Conclusion
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Conclusion
matrix factorization techniques have become a dominant
methodology within collaborative filtering recommenders
experience with the Netflix competion has shown that they
deliver accuracy superior to classical nearest-neighbor
techniques
they integrate many crucial aspects of the data, such as
multiple forms of feedback, temporal dynamics and confidence
levels.
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
Reference
Y. Koren, R. Bell and C. Volinsky: Matrix Factorization Techniques
for Recommender Systems, AT&T Labs-Research, 2009
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
Introduction
Matrix Factorization Methods
Netflix Prize Competition
Conclusion
THANK YOU!
Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems

More Related Content

PPTX
Multiclass classification of imbalanced data
PDF
Data Clustering with R
PPTX
Cannonical correlation
PPTX
Machine learning 101
PDF
Module 3: Linear Regression
PDF
Recommendation System Explained
PPT
4.4 text mining
PPTX
Supervised learning
Multiclass classification of imbalanced data
Data Clustering with R
Cannonical correlation
Machine learning 101
Module 3: Linear Regression
Recommendation System Explained
4.4 text mining
Supervised learning

What's hot (20)

PPTX
Machine Learning with R
ODP
Multiple Linear Regression II and ANOVA I
PDF
Hyperparameter Optimization for Machine Learning
PPTX
Regression analysis on SPSS
PPTX
Ml3 logistic regression-and_classification_error_metrics
PPTX
Optimization/Gradient Descent
PDF
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
PPTX
decision tree regression
PPT
Pagerank Algorithm Explained
PDF
Graph lecture
PDF
Ordinal Logistic Regression
PPTX
Branch And Bound and Beam Search Feature Selection Algorithms
PDF
Applied Machine Learning for Ranking Products in an Ecommerce Setting
PDF
Lecture 2 Basic Concepts in Machine Learning for Language Technology
PPTX
Singular Value Decomposition (SVD).pptx
PDF
Python NumPy Tutorial | NumPy Array | Edureka
PPTX
Presentation on Regression Analysis
PPTX
Soziale Interaktion als Erfolgsfaktor des Lernens mit digitalen Medien
PDF
Confusion Matrix Explained
PDF
An Introduction to Deep Learning
Machine Learning with R
Multiple Linear Regression II and ANOVA I
Hyperparameter Optimization for Machine Learning
Regression analysis on SPSS
Ml3 logistic regression-and_classification_error_metrics
Optimization/Gradient Descent
Talent Search and Recommendation Systems at LinkedIn: Practical Challenges an...
decision tree regression
Pagerank Algorithm Explained
Graph lecture
Ordinal Logistic Regression
Branch And Bound and Beam Search Feature Selection Algorithms
Applied Machine Learning for Ranking Products in an Ecommerce Setting
Lecture 2 Basic Concepts in Machine Learning for Language Technology
Singular Value Decomposition (SVD).pptx
Python NumPy Tutorial | NumPy Array | Edureka
Presentation on Regression Analysis
Soziale Interaktion als Erfolgsfaktor des Lernens mit digitalen Medien
Confusion Matrix Explained
An Introduction to Deep Learning
Ad

Viewers also liked (17)

PDF
آموزش محاسبات عددی - بخش دوم
PDF
Neighbor methods vs matrix factorization - case studies of real-life recommen...
PDF
Intro to Factorization Machines
PPTX
Factorization Machines with libFM
PDF
Collaborative Filtering with Spark
PPT
Matrix factorization
PPTX
Recommender system introduction
PPT
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
PDF
Nonnegative Matrix Factorization
PPTX
Recommender Systems
PDF
Introduction to Matrix Factorization Methods Collaborative Filtering
PDF
Beginners Guide to Non-Negative Matrix Factorization
PDF
Recommender Systems
PPT
Recommendation system
PPTX
Collaborative Filtering Recommendation System
PDF
Building a Recommendation Engine - An example of a product recommendation engine
PDF
Recommender system algorithm and architecture
آموزش محاسبات عددی - بخش دوم
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Intro to Factorization Machines
Factorization Machines with libFM
Collaborative Filtering with Spark
Matrix factorization
Recommender system introduction
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Nonnegative Matrix Factorization
Recommender Systems
Introduction to Matrix Factorization Methods Collaborative Filtering
Beginners Guide to Non-Negative Matrix Factorization
Recommender Systems
Recommendation system
Collaborative Filtering Recommendation System
Building a Recommendation Engine - An example of a product recommendation engine
Recommender system algorithm and architecture
Ad

Similar to Matrix Factorization Technique for Recommender Systems (20)

PDF
International Journal of Computational Engineering Research(IJCER)
PDF
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
PDF
Pay-as-you-go Reconciliation in Schema Matching Networks
PDF
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
PDF
Multimodal interactions in recommender systems (Bracis 2014)
PDF
AIRLINE FARE PRICE PREDICTION
PDF
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
PDF
Next directions in Mahout's recommenders
PDF
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
PDF
Ankit presentation
PDF
IRJET- Online Course Recommendation System
PDF
CSE545_Porject
PDF
Probabilistic Collaborative Filtering with Negative Cross Entropy
PDF
IRJET- Performance Evaluation of Various Classification Algorithms
PDF
IRJET- Performance Evaluation of Various Classification Algorithms
PDF
IRJET- Analysis of Vehicle Number Plate Recognition
PDF
IRJET - House Price Prediction using Machine Learning and RPA
PDF
Training language models to follow instructions with human feedback.pdf
PDF
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
PDF
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
International Journal of Computational Engineering Research(IJCER)
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Pay-as-you-go Reconciliation in Schema Matching Networks
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
Multimodal interactions in recommender systems (Bracis 2014)
AIRLINE FARE PRICE PREDICTION
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
Next directions in Mahout's recommenders
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
Ankit presentation
IRJET- Online Course Recommendation System
CSE545_Porject
Probabilistic Collaborative Filtering with Negative Cross Entropy
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Performance Evaluation of Various Classification Algorithms
IRJET- Analysis of Vehicle Number Plate Recognition
IRJET - House Price Prediction using Machine Learning and RPA
Training language models to follow instructions with human feedback.pdf
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...

Recently uploaded (20)

PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Institutional Correction lecture only . . .
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Cell Types and Its function , kingdom of life
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Classroom Observation Tools for Teachers
Renaissance Architecture: A Journey from Faith to Humanism
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Microbial diseases, their pathogenesis and prophylaxis
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Institutional Correction lecture only . . .
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
01-Introduction-to-Information-Management.pdf
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
human mycosis Human fungal infections are called human mycosis..pptx
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Cell Types and Its function , kingdom of life
PPH.pptx obstetrics and gynecology in nursing
O5-L3 Freight Transport Ops (International) V1.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Classroom Observation Tools for Teachers

Matrix Factorization Technique for Recommender Systems

  • 1. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion MATRIX FACTORIZATION TECHNIQUE FOR RECOMMENDER SYSTEMS Oluwashina Aladejubelo Universite Joseph Fourier, Grenoble, France June 6, 2015 Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 2. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion About Me Bachelor of Science, Ambrose Alli University, Nigeria (2004-2008) IT Business Analyst, Virgin Nigeria Airlines (2009-2011) Team Lead/Software Architect, Speckless Innovations Limited (2011-2014) Master of Informatics (M2 MOSIG), Universit Joseph Fourier, Grenoble (2014-2015) Master Thesis on ”Distributed Large-Scale Learning” with Pr. Massih-Reza Amini. Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 3. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Overview 1 Introduction 2 Matrix Factorization Methods 3 Netflix Prize Competition 4 Conclusion Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 4. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion 1 Introduction Recommender Systems Content Filtering Approach Collaborative Filtering Approach Content vs Collaborative Filtering 2 Matrix Factorization Methods Matrix Factorization Model (MFM) Stochastic Gradient Descent Alternating Least Squares Adding Biases Additional Input Source Temporal Dynamics Varying confidence levels 3 Netflix Prize Competition 4 Conclusion Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 5. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Recommender Systems Recommender systems analyze patterns of user interest in products to provide personalized recommendations They seek to predict the rating or preference that user would give to an item Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 6. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Recommender Systems Such systems are very useful for entertainment products such as movies, music, and TV shows. Many customers will view the same movie and each customer is likely to view numerous different movies. Huge volume of data arise from customer feedbacks which can be analyzed to provide recommendations Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 7. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Content Filtering Approach creating profile for each user or product to characterize its nature. programs associate users with matching products. it requires gathering external information that may not be available Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 8. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Collaborative Filtering Approach depends on past user behaviour, e.g. previous transactions or product rating does not rely on creation of explicit profiles Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 9. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Collaborative Filtering Approach the primary areas of collaborative filtering are neighborhood methods and latent factor models neighborhood is based on computing the relationships between items or users latent factor models tries to explain by characterizing both items and users on say, 20 to 100 factors inferred from the ratings patterns Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 10. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Content vs Collaborative Filtering Collaborative filtering address data aspects that are difficult to profile. it is generally more accurate suffers from cold startup problem (new product / new user) in which case content filtering is better Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 11. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion 1 Introduction Recommender Systems Content Filtering Approach Collaborative Filtering Approach Content vs Collaborative Filtering 2 Matrix Factorization Methods Matrix Factorization Model (MFM) Stochastic Gradient Descent Alternating Least Squares Adding Biases Additional Input Source Temporal Dynamics Varying confidence levels 3 Netflix Prize Competition 4 Conclusion Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 12. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Matrix Factorization Model (MFM) some of the most successful realizations of latent factor models are based on matrix factorization it characterizes both items and users by vectors of factors inferred from item rating patterns high correspondence between item and user factors leads to a recommendation Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 13. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Matrix Factorization Model (MFM) MFM maps both users & items to a joint latent factor space of dimensionality f the user-item interactions are modeled as inner products in space f each item i is associated with a vector qi ∈ Rf each user u is associated with a vector pu ∈ Rf Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 14. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Matrix Factorization Model (MFM) the approximate user rating is given by ˆrui = qT i Pu (1) carelessly addressing only the relatively few known entries is highly prone to overfitting observed ratings can be modeled directly with regularization as follows minq∗,p∗ (u,i)∈κ (rui − qT i pu)2 + λ(||qi ||2 + ||pu||2 ) (2) κ is a set of (u, i) pairs for which rui is known Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 15. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Stochastic Gradient Descent (SGD) - Simon Funk; 2006 SGD approach can be used for solving the equation (2) For each given training case, the system predicts rui and computes the prediction error eui = rui − qT i pu it modifies the parameters by a magnitude proportional to γ in the opposite direction of the gradient, yielding∈ Rf qi ← qi + γ.(eui .pu − γ.qi ) pu ← pu + γ.(eui .qi − γ.pu) combines ease with a relatively fast runtime Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 16. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Alternating least squares Because both qi and pu are unknown, equation (2) is not convex if we fix one of the unknowns the quadratic optimization can be solved optimally when all pu are fixed the system recomputes the qi by solving a least-squares problem and vice versa each step decreases the minimization problem until convergence massively parallelizable Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 17. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Adding Biases rating values are also affected by biases independent of any interaction a first-order approximation of the bias involved in rating rui is bui = µ + bi + bu (3) µ denotes the average rating, bu and bi are the observed deviations of user u on item i therefore, ˆr = µ + bi + bu + qT i pu (4) equation(2) also becomes, minq∗,p∗,b∗ (u,i)∈κ (rui −µ−bu−bi −qT i pu)2 +λ(||qi ||2 +||pu||2 +b2 u+b2 i ) (5) Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 18. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Additional Input Sources cold start problem could be as a result of user supplying very few ratings-difficulty to conclude on their taste behavioural information such as purchase and browsing history can be used for implicit feedback let’s say N(u) denotes the set of itels for which user u expressed an implicit preference a new set of item factors is given by xi ∈ Rf Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 19. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Additional Input Sources a user who showed a preference for items in N(u) is characterized by the vector i∈N(u) xi normalizing the sum we have, |N(u)|−0.5 i∈N(u) xi another information source is known as user attribute, e.g. demographics, gender, age, income level and so on let A(u) denote set of attributes of a user u Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 20. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Additional Input Sources a distinct factor vector ya ∈ Rf corresponds to each attribute to describe a user through the set of user-associated attributes: a∈A(u) ya the matrix factorization model should intergrate all signal sources, with ehanced representation: ˆrui = µ + bi + bu + qT i [pu + |N(u)−0.5 i∈N(u) xi + a∈A(u) ya] (6) items can get a similar treatment Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 21. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Temporal Dynamics in reality customers’ inclinations evolve, leading them to redefine their taste it is therefore important to accommodate this temporal effects reflecting the dynamic, time-drifting nature of user-item interactions the following terms vary over time: item biases, bi (t); user biases, bu(t); and user preferences, pu(t) equation (4) therefore becomes, ˆr(t) = µ + bi (t) + bu(t) + qT i pu(t) (7) Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 22. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Varying Confidence Level other factors like massive advertisement can influence observed ratings, which do not reflect long-term characteristics hence the need for a weighting scheme or confidence confidence can stem from available numerical values that describe the frequency of actions, e.g. how much time the user watched a show in matrix factorization less weight is given to less meaningful action Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 23. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Varying Confidence Level if confidence in observing rui is denoted as cui, then the model enhances equation (5) to account for confidence as follows minq∗,p∗,b∗ (u,i)∈κ cui (rui −µ−bu−bi −qT i pu)2 +λ(||qi ||2 +||pu||2 +b2 u+b2 i ) (8 Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 24. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion 1 Introduction Recommender Systems Content Filtering Approach Collaborative Filtering Approach Content vs Collaborative Filtering 2 Matrix Factorization Methods Matrix Factorization Model (MFM) Stochastic Gradient Descent Alternating Least Squares Adding Biases Additional Input Source Temporal Dynamics Varying confidence levels 3 Netflix Prize Competition 4 Conclusion Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 25. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Netflix Prize Competition in 2006, Netflix announced a contest to improve the state of its recommender system training data comprised of 100 million ratings sapnning 500,000 annonymous customers’ rating of 17,000 movies each movie was rated on a scale of 1 to 5 stars test data was 3million ratings the metrics was 10 percent or more root-mean-square error (RMSE) performance better than Netflix algorithm Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 26. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Netflix Prize Competition Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 27. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion 1 Introduction Recommender Systems Content Filtering Approach Collaborative Filtering Approach Content vs Collaborative Filtering 2 Matrix Factorization Methods Matrix Factorization Model (MFM) Stochastic Gradient Descent Alternating Least Squares Adding Biases Additional Input Source Temporal Dynamics Varying confidence levels 3 Netflix Prize Competition 4 Conclusion Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 28. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Conclusion matrix factorization techniques have become a dominant methodology within collaborative filtering recommenders experience with the Netflix competion has shown that they deliver accuracy superior to classical nearest-neighbor techniques they integrate many crucial aspects of the data, such as multiple forms of feedback, temporal dynamics and confidence levels. Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 29. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion Reference Y. Koren, R. Bell and C. Volinsky: Matrix Factorization Techniques for Recommender Systems, AT&T Labs-Research, 2009 Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems
  • 30. Introduction Matrix Factorization Methods Netflix Prize Competition Conclusion THANK YOU! Oluwashina Aladejubelo Matrix Factorization Techniques for Recommender Systems