SlideShare a Scribd company logo
Shallow & Deep Latent Models
for Recommender Systems
Anoop Deoras, Dawen Liang
PRS Workshop, Netflix
06/08/2018
@adeoras, @dawen_liang
● Personalization and Recommendations at Netflix
● Discuss evolution of latent models in the Recommender System space
● Showcase some experimental results and interesting findings
● Take away points
Theme of the talk
● Recommendation Systems are means to an end.
● Our primary goal:
○ Maximize Netflix member’s enjoyment of the selected show
■ Enjoyment integrated over time
○ Minimize the time it takes to find them
■ Interaction cost integrated over time
Personalization
● Personalization
● How ?
Ordering of the titles in each row is personalized
From what shows to recommend
Selection and placement of the row types is personalized
... To how to construct the page
Personalized images.
Profile 1 Profile 2
... To what images to select
Personalization
● When the catalog size is very large, recommendations are the only saving grace.
● A good Recommender Systems should consider:
○ What is recommended
○ How it is recommended
○ When it is recommended
○ Where it is recommended
Personalization
● We try to model
○ User’s taste
○ Context
■ Time
■ Device
■ Country
■ Language
■ …
○ Difference in local tastes
■ What is popular in US may not be popular in India
■ Not available != Not Popular
Personalization
● We try to model
○ User’s taste
○ Context
■ Time
■ Device
■ Country
■ Language
■ …
○ Difference in local tastes
■ What is popular in US may not be popular in India
■ Not available != Not Popular
Latent Models for
Recommendation
● Shallow
○ Latent Factor Models -- Matrix Factorization (MF)
○ Latent Dirichlet Allocation (LDA)
● Deep
○ Variational Autoencoder
○ Feedforward Neural Networks
○ Sequential Neural Networks (RNNs)
○ Convolutional Neural Networks
Latent Models
Shallow Models
Latent Factor Model
1.0 2.0
3.0 4.0
3.0
5.0
*
#users
# items
K
K
User latent factors
Item latent factors
Observed ratings
Explicit Feedback
Latent Factor Model
1 0 0 1 0
0 1 0 0 1
0 0 0 1 0
0 0 1 0 0
*
#users
# items
K
K
User latent factors
Item latent factors
Observed plays
Implicit Feedback
1 0 0 1 0
0 1 0 0 1
0 0 0 1 0
0 0 1 0 0
Gaussian matrix factorization
*
#users
# items
K
K
User latent factors
Item latent factors
Observed plays
Confidence
1 0 0 1 0
0 1 0 0 1
0 0 0 1 0
0 0 1 0 0
Topic Models (Latent Dirichlet Allocation)
*
#users
# items
K
K
User latent factors
Item latent factors
Observed plays
# Plays of
User ‘u’
Deep Nonlinear Models
1 0 0 1 0
0 1 0 0 1
0 0 0 1 0
0 0 1 0 0
Deep Latent Factor Model
#users
K
User latent factorsObserved plays
DNN
Variational Autoencoders
zu
ru
Taste
fθ
ru
Encoder
Decoder
fѰ
fѰ
DNN
Liang et al. (2018), Variational Autoencoders for Collaborative Filtering, WWW.
Generative model:
Inference model:
● Commonly used in Language Models and Economics
● Close proxy to the top-N ranking loss
○ The likelihood (cross-entropy) rewards the model for putting probability
mass on the non-zero entries
○ The items have to compete for limited budget ( since )
● Effectively ranking non-zero entries higher
Why Multinomial?
Why VAEs (or rather, Bayesian)?
● Generalized linear latent factor models :
○ Recover LDA as a special linear case
● No ‘Fold-In’ necessary
○ Only evaluate inference and generative functions (amortized inference)
● Per user, RecSys is more of a “small data” than a “big data” problem
Next Play Models
Neural Multi Class Models
play (t-n)
...
play (t-1)
cntxt
Soft-max over entire
vocabulary
play
(t-n)...
play
(t-1)cntxt
Soft-max over entire
vocabulary
N-GRAM BoW-n
Feed
Forward User,Cntxt
P(next-video | <user, cntxt>)
Neural Multi Class Models
play
(t-1)
cntxt
Soft-max over entire
vocabulary
state
(t-1)
RNN Family
play
(t-2)
...
play
(t-1)
Soft-max over entire
vocabulary
cntxt
play
(t-4)play
(t-3)
play
(t-n)play
(t-n+1)
CNN Family
state
(t)
Recurrent
Convolutn
P(next-video | <user, cntxt>)
Why Conditional Models ?
● Maximizes the likelihood of user playing the next play ‘directly’
● No ‘Fold-In’ necessary
○ Only need to evaluate forward graph
● Enables encoding of temporal and sequential information seamlessly
● Rich literature around model adaptation and bootstrapping
Model Comparisons
Results (internal Netflix dataset)
Interpreting a CNN CF Model
● Deeper CNN layers have discovered higher level features in images:
○ Edges
○ Faces etc
● What would a CNN learn if it is trained on user-item interaction dataset?
○ Can it discover semantic topics ?
Interpreting a CNN CF Model
HorroR Filter
Kids Filter
Narcotics Filter
Thanks to Ko-Jen Hsiao for the CNN viz
Concluding Remarks
Take Away Points
● Shallow models
○ Presented a unified view of various latent factor models
○ Discussed limited modeling capacity ⇒ inferior prediction power
● Deep models
○ Encoding of rich nonlinear user item interaction ⇒ superior prediction power
○ Discussed how VAEs can be thought of as non linear LDA
○ Showcased how ‘Next Play models’ model directly the task at hand
Thank you
Anoop Deoras: adeoras@netflix.com
Dawen Liang: dliang@netflix.com

More Related Content

PDF
Deep Learning for Recommender Systems
PDF
Missing values in recommender models
PDF
Déjà Vu: The Importance of Time and Causality in Recommender Systems
PDF
Context Aware Recommendations at Netflix
PDF
Contextualization at Netflix
PDF
Time, Context and Causality in Recommender Systems
PDF
Sequential Decision Making in Recommendations
PDF
Personalizing "The Netflix Experience" with Deep Learning
Deep Learning for Recommender Systems
Missing values in recommender models
Déjà Vu: The Importance of Time and Causality in Recommender Systems
Context Aware Recommendations at Netflix
Contextualization at Netflix
Time, Context and Causality in Recommender Systems
Sequential Decision Making in Recommendations
Personalizing "The Netflix Experience" with Deep Learning

What's hot (20)

PDF
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
PDF
Past, Present & Future of Recommender Systems: An Industry Perspective
PDF
Calibrated Recommendations
PDF
Making Netflix Machine Learning Algorithms Reliable
PDF
Netflix Recommendations - Beyond the 5 Stars
PDF
Artwork Personalization at Netflix Fernando Amat RecSys2018
PDF
Deep Learning for Recommender Systems
PDF
Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...
PPTX
Learning a Personalized Homepage
PDF
Recent Trends in Personalization at Netflix
PPTX
Personalized Page Generation for Browsing Recommendations
PDF
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
PDF
Homepage Personalization at Spotify
PDF
Incorporating Diversity in a Learning to Rank Recommender System
PDF
Artwork Personalization at Netflix
PDF
A Multi-Armed Bandit Framework For Recommendations at Netflix
PDF
Data council SF 2020 Building a Personalized Messaging System at Netflix
PDF
Crafting Recommenders: the Shallow and the Deep of it!
PPTX
Recommender system introduction
PDF
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Past, Present & Future of Recommender Systems: An Industry Perspective
Calibrated Recommendations
Making Netflix Machine Learning Algorithms Reliable
Netflix Recommendations - Beyond the 5 Stars
Artwork Personalization at Netflix Fernando Amat RecSys2018
Deep Learning for Recommender Systems
Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...
Learning a Personalized Homepage
Recent Trends in Personalization at Netflix
Personalized Page Generation for Browsing Recommendations
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Homepage Personalization at Spotify
Incorporating Diversity in a Learning to Rank Recommender System
Artwork Personalization at Netflix
A Multi-Armed Bandit Framework For Recommendations at Netflix
Data council SF 2020 Building a Personalized Messaging System at Netflix
Crafting Recommenders: the Shallow and the Deep of it!
Recommender system introduction
RecSys 2020 A Human Perspective on Algorithmic Similarity Schendel 9-2020
Ad

Similar to Shallow and Deep Latent Models for Recommender System (20)

PPTX
acmsigtalkshare-121023190142-phpapp01.pptx
PDF
Further enhancements of recommender systems using deep learning
PDF
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
PDF
Silk Data - Review Lecture on Recommendation Systems
PDF
Introduction to Recommender Systems
PDF
Xavier amatriain, dir algorithms netflix m lconf 2013
PDF
MLConf - Emmys, Oscars & Machine Learning Algorithms at Netflix
PPTX
Олександр Обєдніков “Рекомендательные системы”
PPTX
Rokach-GomaxSlides (1).pptx
PPTX
Rokach-GomaxSlides.pptx
PDF
Deep neural network for youtube recommendations
PDF
Deep neural networks for Youtube recommendations
PDF
Past, present, and future of Recommender Systems: an industry perspective
PDF
Recent Trends in Personalization at Netflix
PPTX
Deep Learning for Recommender Systems
PDF
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
PDF
Recent Trends in Personalization: A Netflix Perspective
PDF
Big & Personal: the data and the models behind Netflix recommendations by Xa...
PDF
LatentCross.pdf
PDF
Machine learning advanced applications
acmsigtalkshare-121023190142-phpapp01.pptx
Further enhancements of recommender systems using deep learning
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Silk Data - Review Lecture on Recommendation Systems
Introduction to Recommender Systems
Xavier amatriain, dir algorithms netflix m lconf 2013
MLConf - Emmys, Oscars & Machine Learning Algorithms at Netflix
Олександр Обєдніков “Рекомендательные системы”
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides.pptx
Deep neural network for youtube recommendations
Deep neural networks for Youtube recommendations
Past, present, and future of Recommender Systems: an industry perspective
Recent Trends in Personalization at Netflix
Deep Learning for Recommender Systems
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Recent Trends in Personalization: A Netflix Perspective
Big & Personal: the data and the models behind Netflix recommendations by Xa...
LatentCross.pdf
Machine learning advanced applications
Ad

Recently uploaded (20)

PDF
Modernizing your data center with Dell and AMD
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Electronic commerce courselecture one. Pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Cloud computing and distributed systems.
PPTX
A Presentation on Artificial Intelligence
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
cuic standard and advanced reporting.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Modernizing your data center with Dell and AMD
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Electronic commerce courselecture one. Pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced methodologies resolving dimensionality complications for autism neur...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Cloud computing and distributed systems.
A Presentation on Artificial Intelligence
The AUB Centre for AI in Media Proposal.docx
cuic standard and advanced reporting.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Machine learning based COVID-19 study performance prediction
CIFDAQ's Market Insight: SEC Turns Pro Crypto

Shallow and Deep Latent Models for Recommender System

  • 1. Shallow & Deep Latent Models for Recommender Systems Anoop Deoras, Dawen Liang PRS Workshop, Netflix 06/08/2018 @adeoras, @dawen_liang
  • 2. ● Personalization and Recommendations at Netflix ● Discuss evolution of latent models in the Recommender System space ● Showcase some experimental results and interesting findings ● Take away points Theme of the talk
  • 3. ● Recommendation Systems are means to an end. ● Our primary goal: ○ Maximize Netflix member’s enjoyment of the selected show ■ Enjoyment integrated over time ○ Minimize the time it takes to find them ■ Interaction cost integrated over time Personalization ● Personalization ● How ?
  • 4. Ordering of the titles in each row is personalized From what shows to recommend
  • 5. Selection and placement of the row types is personalized ... To how to construct the page
  • 6. Personalized images. Profile 1 Profile 2 ... To what images to select
  • 7. Personalization ● When the catalog size is very large, recommendations are the only saving grace. ● A good Recommender Systems should consider: ○ What is recommended ○ How it is recommended ○ When it is recommended ○ Where it is recommended
  • 8. Personalization ● We try to model ○ User’s taste ○ Context ■ Time ■ Device ■ Country ■ Language ■ … ○ Difference in local tastes ■ What is popular in US may not be popular in India ■ Not available != Not Popular
  • 9. Personalization ● We try to model ○ User’s taste ○ Context ■ Time ■ Device ■ Country ■ Language ■ … ○ Difference in local tastes ■ What is popular in US may not be popular in India ■ Not available != Not Popular
  • 11. ● Shallow ○ Latent Factor Models -- Matrix Factorization (MF) ○ Latent Dirichlet Allocation (LDA) ● Deep ○ Variational Autoencoder ○ Feedforward Neural Networks ○ Sequential Neural Networks (RNNs) ○ Convolutional Neural Networks Latent Models
  • 13. Latent Factor Model 1.0 2.0 3.0 4.0 3.0 5.0 * #users # items K K User latent factors Item latent factors Observed ratings Explicit Feedback
  • 14. Latent Factor Model 1 0 0 1 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 * #users # items K K User latent factors Item latent factors Observed plays Implicit Feedback
  • 15. 1 0 0 1 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 Gaussian matrix factorization * #users # items K K User latent factors Item latent factors Observed plays Confidence
  • 16. 1 0 0 1 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 Topic Models (Latent Dirichlet Allocation) * #users # items K K User latent factors Item latent factors Observed plays # Plays of User ‘u’
  • 18. 1 0 0 1 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 Deep Latent Factor Model #users K User latent factorsObserved plays DNN
  • 19. Variational Autoencoders zu ru Taste fθ ru Encoder Decoder fѰ fѰ DNN Liang et al. (2018), Variational Autoencoders for Collaborative Filtering, WWW. Generative model: Inference model:
  • 20. ● Commonly used in Language Models and Economics ● Close proxy to the top-N ranking loss ○ The likelihood (cross-entropy) rewards the model for putting probability mass on the non-zero entries ○ The items have to compete for limited budget ( since ) ● Effectively ranking non-zero entries higher Why Multinomial?
  • 21. Why VAEs (or rather, Bayesian)? ● Generalized linear latent factor models : ○ Recover LDA as a special linear case ● No ‘Fold-In’ necessary ○ Only evaluate inference and generative functions (amortized inference) ● Per user, RecSys is more of a “small data” than a “big data” problem
  • 23. Neural Multi Class Models play (t-n) ... play (t-1) cntxt Soft-max over entire vocabulary play (t-n)... play (t-1)cntxt Soft-max over entire vocabulary N-GRAM BoW-n Feed Forward User,Cntxt P(next-video | <user, cntxt>)
  • 24. Neural Multi Class Models play (t-1) cntxt Soft-max over entire vocabulary state (t-1) RNN Family play (t-2) ... play (t-1) Soft-max over entire vocabulary cntxt play (t-4)play (t-3) play (t-n)play (t-n+1) CNN Family state (t) Recurrent Convolutn P(next-video | <user, cntxt>)
  • 25. Why Conditional Models ? ● Maximizes the likelihood of user playing the next play ‘directly’ ● No ‘Fold-In’ necessary ○ Only need to evaluate forward graph ● Enables encoding of temporal and sequential information seamlessly ● Rich literature around model adaptation and bootstrapping
  • 28. Interpreting a CNN CF Model ● Deeper CNN layers have discovered higher level features in images: ○ Edges ○ Faces etc ● What would a CNN learn if it is trained on user-item interaction dataset? ○ Can it discover semantic topics ?
  • 29. Interpreting a CNN CF Model HorroR Filter Kids Filter Narcotics Filter Thanks to Ko-Jen Hsiao for the CNN viz
  • 31. Take Away Points ● Shallow models ○ Presented a unified view of various latent factor models ○ Discussed limited modeling capacity ⇒ inferior prediction power ● Deep models ○ Encoding of rich nonlinear user item interaction ⇒ superior prediction power ○ Discussed how VAEs can be thought of as non linear LDA ○ Showcased how ‘Next Play models’ model directly the task at hand
  • 32. Thank you Anoop Deoras: adeoras@netflix.com Dawen Liang: dliang@netflix.com