SlideShare a Scribd company logo
2015-12-10
Eliezer de Souza da Silva (State-space models, Dynamic PMF vis HDP)
Tomasz Kuśmierczyk (Tensor factorization)
Session 3: Time variant models
Tensor factorization
State-space models
Dynamic Bayesian PMF (via HDP)
Approximate and Scalable Inference for Complex
Probabilistic Models in Recommender Systems
Part 1: Models and Representations
Literature / Sources
● Temporal Collaborative Filtering with Bayesian Probabilistic Tensor
Factorization.-- Xiong, L., Chen, X., Huang, T. K., Schneider, J. G., &
Carbonell, J. G. 2010. SDM Proceedings.
● Dynamic Matrix Factorization: A State-Space Approach -- John Z. Sun, Kush
R. Varshney and Karthik Subbian. 2012. ICASSP.
● Dynamic Bayesian Probabilistic Matrix Factorization -- Sotirios P. Chatzis.
2014. AAAI.
Temporal Collaborative Filtering
with
Bayesian Probabilistic Tensor Factorization
Matrix Factorization (previous cases)
M Items
NUsers
latent 1 latent D
Ratings (normalized)
Matrix Factorization (previous cases)
Users
(N x D)
Items
(M x D)
Tensors generalization (multi-way data)
- P-mode tensor of dimensions M1 x … x Mp (example: observations x
measurements x time x equipments).
- Multiple relationships between multidimensional variables
- Focus on 3-way (canonical decomposition or parallel factor analysis - CP)
CP Tensor Factorization (current case: 3 way
analysis)
M Items
NUsers
K
Contexts
latent 1 latent D
Ratings (normalized)
CP Tensor Factorization (current case)
Users
(N x D)
Items
(M x D) Context values
(K x D)
M Items
NUsers
K
Contexts
latent 1 latent D
Ratings (normalized)
CP Tensor Factorization (current case)
Temporal ...
● 1 additional type of contexts = time
(3D tensor instead of 2D matrix R)
● In practice:
○ ECCO sales: two context values per season (early/late
season)
○ Netflix, Movielens: one context value per month
MAP Approach: what’s new to PMF
MAP Approach
MAP Approach
MAP Approach
MAP Approach
argmax log p(U,V,T,T0| R)
argmax log p(R|U,V,T,T0) + log p(U,V,T,T0)
MAP Approach
argmax log p(U,V,T,T0| R)
argmax log p(R|U,V,T,T0) + log p(U,V,T,T0)
MAP Approach
argmax log p(U,V,T,T0| R)
argmax log p(R|U,V,T,T0) + log p(U,V,T,T0)
argmax
MAP Approach
● Four params (lambdas)
● SGD
● Block Coordinate Descent
Bayesian approach
Bayesian approach
Bayesian approach
Predictions for
unobserved
Integrate over all params
A posteriori
distribution of
params
Observed
evidence
Bayesian approach: Expectation over posterior dist
Bayesian approach: MCMC estimate
Sample from
posterior
distribution
Linear state-space approach
Linear state-space approach
- User latent factors are time dependent
- gaussian assumptions for the dynamics allows exact inference
Linear state-space approach
- User latent factors are time dependent
- User latent factors are hidden states in a state-space system
time dependent
user features
Linear state-space approach
- items latent factors are stationary
- ratings are time dependent and observed
Stationary items
factors
time dependent
ratings
time dependent
user features
Kalman filters: combining new information
System dynamics
Prediction
Kalman gain
Update
PMF meets Kalman
Stationary items
factors
time dependent
ratings
time dependent
user features
PMF meets Kalman
PMF meets Kalman
- Parameters are time-independent
- Initial state iid zero mean gaussian for all users with similar scaling of preferences σU
- process (time evolution of user preferences) and measurement (estimation of rating from user and item latent
factors) noise are iid zero mean gaussians, σQ
,σR
- Transitions (A) and measurements (items latent factors H) can be calculated to maximize the log-likelihood.
PMF meets Kalman: learning the parameters
- EM with expected joint likelihood maximization
- Other approaches: minimizing the residual prediction error, maximizing the prediction likelihood, maximizing the
measurement likelihood, optimizing the performance after smoothing.
Dynamic Bayesian Probabilistic Matrix Factorization
Dynamic Bayesian Probabilistic Matrix Factorization
- User patterns changing over time
- Groups of users share latent structure (clustering of user features)
- Capture the dynamics of the generative process of the group structure
- dHDP - dynamic hierarchical dirichlet process
Dirichlet distribution
Dirichlet distribution
Dirichlet process
- Distribution of distributions (infinite distribution of discrete distributions)
- Clustering effect: rich gets richer
- Chinese Restaurant process.
Probabilistic Models in Recommender Systems: Time Variant Models
Hierarchical Dirichlet Process (HDP)
HDP for time domain
Bayesian PMF
dHDP
Groups of users
Bayesian PMF

More Related Content

PDF
FDSE2015
PDF
Mining Correlations on Massive Bursty Time Series Collection (DASFAA2015)
PDF
Sampling and Markov Chain Monte Carlo Techniques
PDF
Ph.D. Dissertation Defense
PDF
PMF BPMF and BPTF
PDF
PPTX
Factorization Machines with libFM
ODP
Implementation of Variational Inference for Non-Parametric Hidden Markov Models
FDSE2015
Mining Correlations on Massive Bursty Time Series Collection (DASFAA2015)
Sampling and Markov Chain Monte Carlo Techniques
Ph.D. Dissertation Defense
PMF BPMF and BPTF
Factorization Machines with libFM
Implementation of Variational Inference for Non-Parametric Hidden Markov Models

Similar to Probabilistic Models in Recommender Systems: Time Variant Models (20)

PDF
Decision Making with Hierarchical Credal Sets (IPMU 2014)
PDF
Expectation propagation
PDF
Mining at scale with latent factor models for matrix completion
PDF
MUMS: Bayesian, Fiducial, and Frequentist Conference - Spatially Informed Var...
PDF
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
PDF
Lecture 13 HMMs and the derivations for perusal.pdf
PDF
Bayesian Inference and Filtering
PDF
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
PDF
Unbiased Bayes for Big Data
PDF
Applied machine learning for search engine relevance 3
PDF
Baum3
PDF
slides of ABC talk at i-like workshop, Warwick, May 16
PDF
Bayesian inference and big data: are we there yet? by Jose Luis Hidalgo at Bi...
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
PPT
Machine Learning and Statistical Analysis
Decision Making with Hierarchical Credal Sets (IPMU 2014)
Expectation propagation
Mining at scale with latent factor models for matrix completion
MUMS: Bayesian, Fiducial, and Frequentist Conference - Spatially Informed Var...
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
Lecture 13 HMMs and the derivations for perusal.pdf
Bayesian Inference and Filtering
Multiverse Recommendation: N-dimensional Tensor Factorization for Context-awa...
Unbiased Bayes for Big Data
Applied machine learning for search engine relevance 3
Baum3
slides of ABC talk at i-like workshop, Warwick, May 16
Bayesian inference and big data: are we there yet? by Jose Luis Hidalgo at Bi...
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Machine Learning and Statistical Analysis
Ad

More from Tomasz Kusmierczyk (8)

PDF
Priors for BNNs
PDF
Overconfidence and subnetwork Inference for BNNs
PDF
Introduction to modern Variational Inference.
PDF
Automatic variational inference with latent categorical variables
PDF
Loss Calibrated Variational Inference
PDF
Variational inference using implicit distributions
PDF
On the Causal Effect of Digital Badges
PDF
What are the negative effects of social media?: fighting fake information
Priors for BNNs
Overconfidence and subnetwork Inference for BNNs
Introduction to modern Variational Inference.
Automatic variational inference with latent categorical variables
Loss Calibrated Variational Inference
Variational inference using implicit distributions
On the Causal Effect of Digital Badges
What are the negative effects of social media?: fighting fake information
Ad

Recently uploaded (20)

PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Business Analytics and business intelligence.pdf
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPT
ISS -ESG Data flows What is ESG and HowHow
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Ppt On Nestle.pptx huunnnhhgfvu
Fluorescence-microscope_Botany_detailed content
Business Analytics and business intelligence.pdf
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
climate analysis of Dhaka ,Banglades.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Foundation of Data Science unit number two notes
IB Computer Science - Internal Assessment.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Supervised vs unsupervised machine learning algorithms
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
oil_refinery_comprehensive_20250804084928 (1).pptx
.pdf is not working space design for the following data for the following dat...
ISS -ESG Data flows What is ESG and HowHow

Probabilistic Models in Recommender Systems: Time Variant Models