SlideShare a Scribd company logo
Training Deep AutoEncoders for
Collaborative Filtering
Marlesson Santana
Prof. Dr. Anderson Soares
• Introduction
• Objective
• Recommender Model
• Results and discussion
• Conclusion
Outline
2
RecSys are a subclass of information filtering system that seek
to predict the ‘rating’ or ‘preference’ that user would give to an
item.
Sites like amazon, Netflix and Spotify use RecSys to suggest
items to users.
Introduction: What is a RecSys ?
3
RecSys can be divided into different types:
● Content-Based
● Collaborative filter
● Cold-start
● Hybrid
Introduction: Types of RecSys
4
The classic CF problem is what tries to infer the missing entries
in an mxn matrix, whose (i,j) entry describes the ratings given
by the ith user to the jth item.
Introduction: Collaborative Filter
5
There are many non deep learning types of approaches to CF,
Matrix factorization techniques is the most popular. Several
recent approaches use autoencoders for RecSys, including the
current state-of-the-art.
Introduction: RecSys with DeepLearning
6
This paper proposes a Deep Autoencoder Model for
Collaborative Filtering for rating prediction task in
recommender system.
Objective
7
The dataset used was the
original Netflix Prize,
separated into subsets of
training and testing with
differing periods
Recommender Model: Dataset
8
Encoder and decoder parts
of the autoencoder consist
of feed-forward neural
networks with classical fully
connected layers
computing:
l = f(W * x + b)
Recommender Model: Model
9
During forward pass or
inference, the model takes
user represented by his
vector of ratings from the
training set x ∈ ℝn, where
n is number of items
x = [s1, s2, s3,..., sn]
Recommender Model: Model
10
The optimize Masked Mean Squared Error loss (MMSE)
where, ri is actual rating, yi is reconstructed and mi is a mask
function such that mi = 1 if ri != 0 else mi = 0.
Recommender Model: Loss function
11
Results and discussion: Activation functions
12
There are two properties
which seems to separate
activations:
● non-zero negative part
● unbounded positive part
Results and discussion: Size of hidden units
13
Single layer autoencoder with 128, 256, 512 and 1024
hidden units in the coding layer. A: training RMSE per
epoch; B: evaluation RMSE per epoch.
Results and discussion: Dropout
14
This model quickly over-fits
if trained with no
regularization.
Very high values of drop
probability (0.8) turned out
to be the best
Results and discussion: Comparison
15
Results and discussion: Inference of RecSys
16
1. Given sparse x, x ∈ ℝn
2. Compute dense
f(x) = decode(encode(x))
3. Filters only f(x)i where xi = 0
4. Recommends the highest value
of f(x)i
• This paper demonstrated how very deep autoencoders can
be successfully trained for RecSys
• On the task of future rating prediction, the model
outperforms other approaches even without using
additional temporal signals
• It has been demonstrated the importance of the activation
function and the regularization by dropout
Conclusion
17
Obrigado(a)!
18

More Related Content

PPTX
sentiment analysis using support vector machine
PPTX
Basics of Machine Learning
PPTX
K nearest neighbor
PDF
K - Nearest neighbor ( KNN )
DOCX
Structure in c sharp
PDF
Classification Based Machine Learning Algorithms
PDF
Meetup 29042015
PPTX
Unsupervised learning clustering
sentiment analysis using support vector machine
Basics of Machine Learning
K nearest neighbor
K - Nearest neighbor ( KNN )
Structure in c sharp
Classification Based Machine Learning Algorithms
Meetup 29042015
Unsupervised learning clustering

What's hot (20)

PPTX
Support Vector Machine ppt presentation
PPTX
Presentation on unsupervised learning
PPT
Lecture 2
PPT
Towards a theory of data entangelement
PPT
Designing a Proof GUI for Non-Experts
PDF
Introduction to conventional machine learning techniques
PPTX
A Fairness-aware Machine Learning Interface for End-to-end Discrimination Dis...
PPTX
Support vector machine-SVM's
PPTX
Decision Trees
PDF
Support vector machines
PPTX
Structures in c language
PDF
Lecture 8: Decision Trees & k-Nearest Neighbors
PPT
ppt slides
PPTX
Intro to Machine Learning for non-Data Scientists
PPT
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
PPTX
Basic of Structure,Structure members,Accessing Structure member,Nested Struct...
PPTX
Machine Learning using Support Vector Machine
PPTX
Linear Regression, Machine learning term
PDF
IB Math Functions
Support Vector Machine ppt presentation
Presentation on unsupervised learning
Lecture 2
Towards a theory of data entangelement
Designing a Proof GUI for Non-Experts
Introduction to conventional machine learning techniques
A Fairness-aware Machine Learning Interface for End-to-end Discrimination Dis...
Support vector machine-SVM's
Decision Trees
Support vector machines
Structures in c language
Lecture 8: Decision Trees & k-Nearest Neighbors
ppt slides
Intro to Machine Learning for non-Data Scientists
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Basic of Structure,Structure members,Accessing Structure member,Nested Struct...
Machine Learning using Support Vector Machine
Linear Regression, Machine learning term
IB Math Functions
Ad

Similar to Training deep auto encoders for collaborative filtering (20)

PDF
Introduction to Machine Learning with SciKit-Learn
PDF
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
PPTX
Unit - 1 - Introduction of the machine learning
PDF
Current clustering techniques
PPTX
Spark MLlib - Training Material
PDF
LR2. Summary Day 2
PPTX
Rapid object detection using boosted cascade of simple features
PPT
01-introductionto Object ooriented Programming in JAVA CS.ppt
PDF
Unit4_AML_MTech that has many ML concepts covered
PDF
GRASP_Designing Objects With Responsibilities.pdf
PDF
Integrating Artificial Intelligence with IoT
PPT
01-introduction OOPS concepts in C++ JAVA
PPTX
acmsigtalkshare-121023190142-phpapp01.pptx
PDF
Methodological study of opinion mining and sentiment analysis techniques
PPTX
Keynote at IWLS 2017
PDF
Hands-on - Machine Learning using scikitLearn
PPTX
Classification: MNIST, training a Binary classifier, performance measure, mul...
PPTX
Recommendation Systems
PPTX
Machine Learning for Recommender Systems in the Job Market
PPTX
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Introduction to Machine Learning with SciKit-Learn
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Unit - 1 - Introduction of the machine learning
Current clustering techniques
Spark MLlib - Training Material
LR2. Summary Day 2
Rapid object detection using boosted cascade of simple features
01-introductionto Object ooriented Programming in JAVA CS.ppt
Unit4_AML_MTech that has many ML concepts covered
GRASP_Designing Objects With Responsibilities.pdf
Integrating Artificial Intelligence with IoT
01-introduction OOPS concepts in C++ JAVA
acmsigtalkshare-121023190142-phpapp01.pptx
Methodological study of opinion mining and sentiment analysis techniques
Keynote at IWLS 2017
Hands-on - Machine Learning using scikitLearn
Classification: MNIST, training a Binary classifier, performance measure, mul...
Recommendation Systems
Machine Learning for Recommender Systems in the Job Market
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Ad

More from Marlesson Santana (11)

PDF
UFG - Topic Modeling in Text
PDF
Ciência de Dados: a revolução na tomada de decisões
PDF
Data science com python - frameworks e melhores práticas
PDF
Sistemas de Recomendação - O que são? Como funcionam? Do que se alimentam?
PDF
Campus Party - Sistemas de Recomendação
PDF
Machine Learning: Do Notebook ao modelo em produção
PDF
Ciência de Dados: a revolução na tomada de decisões
PDF
Deep Learning para Sistemas de Recomendação
PDF
DDC - workshop deep learning
PDF
Introdução a Redes Neurais Recorrentes
PDF
Mineração de Dados em Redes Sociais
UFG - Topic Modeling in Text
Ciência de Dados: a revolução na tomada de decisões
Data science com python - frameworks e melhores práticas
Sistemas de Recomendação - O que são? Como funcionam? Do que se alimentam?
Campus Party - Sistemas de Recomendação
Machine Learning: Do Notebook ao modelo em produção
Ciência de Dados: a revolução na tomada de decisões
Deep Learning para Sistemas de Recomendação
DDC - workshop deep learning
Introdução a Redes Neurais Recorrentes
Mineração de Dados em Redes Sociais

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Electronic commerce courselecture one. Pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Encapsulation theory and applications.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Tartificialntelligence_presentation.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
cuic standard and advanced reporting.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Approach and Philosophy of On baking technology
Digital-Transformation-Roadmap-for-Companies.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Electronic commerce courselecture one. Pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Encapsulation theory and applications.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Tartificialntelligence_presentation.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
cuic standard and advanced reporting.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf

Training deep auto encoders for collaborative filtering

  • 1. Training Deep AutoEncoders for Collaborative Filtering Marlesson Santana Prof. Dr. Anderson Soares
  • 2. • Introduction • Objective • Recommender Model • Results and discussion • Conclusion Outline 2
  • 3. RecSys are a subclass of information filtering system that seek to predict the ‘rating’ or ‘preference’ that user would give to an item. Sites like amazon, Netflix and Spotify use RecSys to suggest items to users. Introduction: What is a RecSys ? 3
  • 4. RecSys can be divided into different types: ● Content-Based ● Collaborative filter ● Cold-start ● Hybrid Introduction: Types of RecSys 4
  • 5. The classic CF problem is what tries to infer the missing entries in an mxn matrix, whose (i,j) entry describes the ratings given by the ith user to the jth item. Introduction: Collaborative Filter 5
  • 6. There are many non deep learning types of approaches to CF, Matrix factorization techniques is the most popular. Several recent approaches use autoencoders for RecSys, including the current state-of-the-art. Introduction: RecSys with DeepLearning 6
  • 7. This paper proposes a Deep Autoencoder Model for Collaborative Filtering for rating prediction task in recommender system. Objective 7
  • 8. The dataset used was the original Netflix Prize, separated into subsets of training and testing with differing periods Recommender Model: Dataset 8
  • 9. Encoder and decoder parts of the autoencoder consist of feed-forward neural networks with classical fully connected layers computing: l = f(W * x + b) Recommender Model: Model 9
  • 10. During forward pass or inference, the model takes user represented by his vector of ratings from the training set x ∈ ℝn, where n is number of items x = [s1, s2, s3,..., sn] Recommender Model: Model 10
  • 11. The optimize Masked Mean Squared Error loss (MMSE) where, ri is actual rating, yi is reconstructed and mi is a mask function such that mi = 1 if ri != 0 else mi = 0. Recommender Model: Loss function 11
  • 12. Results and discussion: Activation functions 12 There are two properties which seems to separate activations: ● non-zero negative part ● unbounded positive part
  • 13. Results and discussion: Size of hidden units 13 Single layer autoencoder with 128, 256, 512 and 1024 hidden units in the coding layer. A: training RMSE per epoch; B: evaluation RMSE per epoch.
  • 14. Results and discussion: Dropout 14 This model quickly over-fits if trained with no regularization. Very high values of drop probability (0.8) turned out to be the best
  • 15. Results and discussion: Comparison 15
  • 16. Results and discussion: Inference of RecSys 16 1. Given sparse x, x ∈ ℝn 2. Compute dense f(x) = decode(encode(x)) 3. Filters only f(x)i where xi = 0 4. Recommends the highest value of f(x)i
  • 17. • This paper demonstrated how very deep autoencoders can be successfully trained for RecSys • On the task of future rating prediction, the model outperforms other approaches even without using additional temporal signals • It has been demonstrated the importance of the activation function and the regularization by dropout Conclusion 17