SlideShare a Scribd company logo
Paris ML meetup
Machine Learning @ Netflix
(and some lessons learned)
Yves Raimond (@moustaki)
Research/Engineering Manager
Search & Recommendations
Algorithm Engineering
Netflix evolution
Netflix scale
● > 69M members
● > 50 countries
● > 1000 device types
● > 3B hours/month
● 36% of peak US downstream traffic
Recommendations @ Netflix
● Goal: Help members find content
to watch and enjoy to maximize
satisfaction and retention
● Over 80% of what people watch
comes from our recommendations
● Top Picks, Because you Watched,
Trending Now, Row Ordering,
Evidence, Search, Search
Recommendations, Personalized
Genre Rows, ...
▪ Regression (Linear, logistic, elastic net)
▪ SVD and other Matrix Factorizations
▪ Factorization Machines
▪ Restricted Boltzmann Machines
▪ Deep Neural Networks
▪ Markov Models and Graph Algorithms
▪ Clustering
▪ Latent Dirichlet Allocation
▪ Gradient Boosted Decision Trees/Random Forests
▪ Gaussian Processes
▪ …
Models & Algorithms
Some lessons learned
Build the offline experimentation
framework first
When tackling a new problem
● What offline metrics can we compute that capture what online improvements we’
re actually trying to achieve?
● How should the input data to that evaluation be constructed (train, validation,
test)?
● How fast and easy is it to run a full cycle of offline experimentations?
○ Minimize time to first metric
● How replicable is the evaluation? How shareable are the results?
○ Provenance (see Dagobah)
○ Notebooks (see Jupyter, Zeppelin, Spark Notebook)
When tackling an old problem
● Same…
○ Were the metrics designed when first running experimentation in that space still appropriate now?
Think about distribution from the
outermost layers
1. For each combination of hyper-parameter
(e.g. grid search, random search, gaussian processes…)
2. For each subset of the training data
a. Multi-core learning (e.g. HogWild)
b. Distributed learning (e.g. ADMM, distributed L-BFGS, …)
When to use distributed learning?
● The impact of communication overhead when building distributed ML
algorithms is non-trivial
● Is your data big enough that the distribution offsets the communication overhead?
Example: Uncollapsed Gibbs sampler for LDA
(more details here)
Design production code to be
experimentation-friendly
Idea Data
Offline
Modeling
(R, Python,
MATLAB, …)
Iterate
Implement in
production
system (Java,
C++, …)
Missing post-
processing logic
Performance
issues
Actual
outputProduction environment
(A/B test) Code
discrepancies
Final
model
Data
discrepancies
Example development process
Avoid dual implementations
Shared Engine
Experiment
code
Production
code
ProductionExperiment
To be continued...
We’re hiring!
Yves Raimond (@moustaki)

More Related Content

PPTX
Beyond DevOps - How Netflix Bridges the Gap
PDF
PDF
Scaling LinkedIn - A Brief History
PDF
Launching a Rocketship Off Someone Else's Back
PDF
From Data to AI with the Machine Learning Canvas
PDF
Booz Allen Field Guide to Data Science
PDF
A Beginners Guide to noSQL
PDF
IQ Crash Course - Big Data Analytics
Beyond DevOps - How Netflix Bridges the Gap
Scaling LinkedIn - A Brief History
Launching a Rocketship Off Someone Else's Back
From Data to AI with the Machine Learning Canvas
Booz Allen Field Guide to Data Science
A Beginners Guide to noSQL
IQ Crash Course - Big Data Analytics

What's hot (20)

PPTX
What is engineering?
PPTX
Why choose engineering?
PPTX
How to think like a startup
PDF
The power of creative collaboration
PPTX
Social Media Power : Positive or Negative
PDF
SlideShare 101
PPTX
What is Engineering? Presentation
PDF
Four Public Speaking Tips From Standup Comedians
PPTX
Open AI Chat GPT.
PPTX
Social Media and its impact on students
PPTX
Discover The Top 10 Types Of Colleagues Around You
PPTX
Design process: Stages of Engineering Design
PDF
Engineering Ethics and Values
PDF
Engineering Ethics
PDF
Engineering Ethics
PPTX
PDF
The six step guide to practical project management
PDF
Introduction to Game Development
PPTX
Gaming Technology Presentation
PDF
The Five Levels of Generative AI for Games
What is engineering?
Why choose engineering?
How to think like a startup
The power of creative collaboration
Social Media Power : Positive or Negative
SlideShare 101
What is Engineering? Presentation
Four Public Speaking Tips From Standup Comedians
Open AI Chat GPT.
Social Media and its impact on students
Discover The Top 10 Types Of Colleagues Around You
Design process: Stages of Engineering Design
Engineering Ethics and Values
Engineering Ethics
Engineering Ethics
The six step guide to practical project management
Introduction to Game Development
Gaming Technology Presentation
The Five Levels of Generative AI for Games
Ad

Viewers also liked (20)

PPT
Oracle Sql Tuning
PDF
Metaprogramming JavaScript
PDF
Capability maturity model
PPTX
Organizational Communication
PDF
Principles and Practices in Continuous Deployment at Etsy
PPT
C the basic concepts
PPTX
Why Project Managers (Understandably) Hate the CMMI -- and What to Do About It
PPTX
Project Management With Scrum
PPT
A Simple Introduction To CMMI For Beginer
PPTX
Organizational communication
PPT
Capability Maturity Model
PPT
Capability maturity model cmm lecture 8
PPT
Gear Cutting Presentation for Polytechnic College Students of India
PPTX
6 Thinking Hats
PPTX
Root cause analysis - tools and process
PPT
Introduction to Cyber Security
PPT
Object Oriented Analysis and Design
PDF
Agile Transformation and Cultural Change
PPTX
Evolution of Microsoft windows operating systems
PPTX
An Overview of User Acceptance Testing (UAT)
Oracle Sql Tuning
Metaprogramming JavaScript
Capability maturity model
Organizational Communication
Principles and Practices in Continuous Deployment at Etsy
C the basic concepts
Why Project Managers (Understandably) Hate the CMMI -- and What to Do About It
Project Management With Scrum
A Simple Introduction To CMMI For Beginer
Organizational communication
Capability Maturity Model
Capability maturity model cmm lecture 8
Gear Cutting Presentation for Polytechnic College Students of India
6 Thinking Hats
Root cause analysis - tools and process
Introduction to Cyber Security
Object Oriented Analysis and Design
Agile Transformation and Cultural Change
Evolution of Microsoft windows operating systems
An Overview of User Acceptance Testing (UAT)
Ad

Similar to Paris ML meetup (20)

PDF
10 more lessons learned from building Machine Learning systems - MLConf
PDF
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
PDF
10 more lessons learned from building Machine Learning systems
PDF
Making Netflix Machine Learning Algorithms Reliable
PDF
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
PDF
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
PPTX
Lessons Learned from Building Machine Learning Software at Netflix
PDF
Practical machine learning
PDF
10 Lessons Learned from Building Machine Learning Systems
PPTX
Machine Learning In Production
PDF
C2_W1---.pdf
PDF
BSSML16 L10. Summary Day 2 Sessions
PDF
VSSML17 Review. Summary Day 2 Sessions
PDF
Machine learning in production
PPTX
230208 MLOps Getting from Good to Great.pptx
PPTX
2024-02-24_Session 1 - PMLE_UPDATED.pptx
PDF
Pitfalls of machine learning in production
PPTX
Introduction to Machine Learning - An overview and first step for candidate d...
PPT
Unexpected Challenges in Large Scale Machine Learning by Charles Parker
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
10 more lessons learned from building Machine Learning systems
Making Netflix Machine Learning Algorithms Reliable
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
BIG2016- Lessons Learned from building real-life user-focused Big Data systems
Lessons Learned from Building Machine Learning Software at Netflix
Practical machine learning
10 Lessons Learned from Building Machine Learning Systems
Machine Learning In Production
C2_W1---.pdf
BSSML16 L10. Summary Day 2 Sessions
VSSML17 Review. Summary Day 2 Sessions
Machine learning in production
230208 MLOps Getting from Good to Great.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
Pitfalls of machine learning in production
Introduction to Machine Learning - An overview and first step for candidate d...
Unexpected Challenges in Large Scale Machine Learning by Charles Parker

More from Yves Raimond (11)

PDF
Time, Context and Causality in Recommender Systems
PDF
Deep Learning for Recommender Systems
PDF
(Some) pitfalls of distributed learning
PDF
Recommending for the World
PDF
Spark Meetup @ Netflix, 05/19/2015
PDF
Utilisation du Web Semantique pour les sites de la BBC
PDF
Linked Data on the BBC
ODP
Publishing and interlinking music-related data on the Web
ODP
Linked data and applications
ODP
Web of data
ODP
Towards a musical Semantic Web
Time, Context and Causality in Recommender Systems
Deep Learning for Recommender Systems
(Some) pitfalls of distributed learning
Recommending for the World
Spark Meetup @ Netflix, 05/19/2015
Utilisation du Web Semantique pour les sites de la BBC
Linked Data on the BBC
Publishing and interlinking music-related data on the Web
Linked data and applications
Web of data
Towards a musical Semantic Web

Recently uploaded (20)

PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
PPT on Performance Review to get promotions
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Digital Logic Computer Design lecture notes
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Lecture Notes Electrical Wiring System Components
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
additive manufacturing of ss316l using mig welding
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Construction Project Organization Group 2.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPT
Project quality management in manufacturing
PPTX
UNIT 4 Total Quality Management .pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPT on Performance Review to get promotions
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Digital Logic Computer Design lecture notes
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
OOP with Java - Java Introduction (Basics)
Lecture Notes Electrical Wiring System Components
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
CYBER-CRIMES AND SECURITY A guide to understanding
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
additive manufacturing of ss316l using mig welding
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Embodied AI: Ushering in the Next Era of Intelligent Systems
CH1 Production IntroductoryConcepts.pptx
Construction Project Organization Group 2.pptx
Foundation to blockchain - A guide to Blockchain Tech
Project quality management in manufacturing
UNIT 4 Total Quality Management .pptx

Paris ML meetup

  • 2. Machine Learning @ Netflix (and some lessons learned) Yves Raimond (@moustaki) Research/Engineering Manager Search & Recommendations Algorithm Engineering
  • 4. Netflix scale ● > 69M members ● > 50 countries ● > 1000 device types ● > 3B hours/month ● 36% of peak US downstream traffic
  • 5. Recommendations @ Netflix ● Goal: Help members find content to watch and enjoy to maximize satisfaction and retention ● Over 80% of what people watch comes from our recommendations ● Top Picks, Because you Watched, Trending Now, Row Ordering, Evidence, Search, Search Recommendations, Personalized Genre Rows, ...
  • 6. ▪ Regression (Linear, logistic, elastic net) ▪ SVD and other Matrix Factorizations ▪ Factorization Machines ▪ Restricted Boltzmann Machines ▪ Deep Neural Networks ▪ Markov Models and Graph Algorithms ▪ Clustering ▪ Latent Dirichlet Allocation ▪ Gradient Boosted Decision Trees/Random Forests ▪ Gaussian Processes ▪ … Models & Algorithms
  • 8. Build the offline experimentation framework first
  • 9. When tackling a new problem ● What offline metrics can we compute that capture what online improvements we’ re actually trying to achieve? ● How should the input data to that evaluation be constructed (train, validation, test)? ● How fast and easy is it to run a full cycle of offline experimentations? ○ Minimize time to first metric ● How replicable is the evaluation? How shareable are the results? ○ Provenance (see Dagobah) ○ Notebooks (see Jupyter, Zeppelin, Spark Notebook)
  • 10. When tackling an old problem ● Same… ○ Were the metrics designed when first running experimentation in that space still appropriate now?
  • 11. Think about distribution from the outermost layers
  • 12. 1. For each combination of hyper-parameter (e.g. grid search, random search, gaussian processes…) 2. For each subset of the training data a. Multi-core learning (e.g. HogWild) b. Distributed learning (e.g. ADMM, distributed L-BFGS, …)
  • 13. When to use distributed learning? ● The impact of communication overhead when building distributed ML algorithms is non-trivial ● Is your data big enough that the distribution offsets the communication overhead?
  • 14. Example: Uncollapsed Gibbs sampler for LDA (more details here)
  • 15. Design production code to be experimentation-friendly
  • 16. Idea Data Offline Modeling (R, Python, MATLAB, …) Iterate Implement in production system (Java, C++, …) Missing post- processing logic Performance issues Actual outputProduction environment (A/B test) Code discrepancies Final model Data discrepancies Example development process
  • 17. Avoid dual implementations Shared Engine Experiment code Production code ProductionExperiment