SlideShare a Scribd company logo
2013/3/27
Transfer learning in
heterogeneous collaborative
filtering domains
Authors/ Weike Pan and Qiang Yang
Affiliation/ Dept. of CSE, Hong Kong University of Science and Technology
Source/ Journal of Artificial Intelligence (2013)
Presenter/ Allen Wu
                                                                              1
Outline
• Introduction
• Heterogeneous collaborative filtering problems




                                                   2013/3/27
• Transfer by collective factorization
• Experimental results
• Conclusion




                                                     2
Introduction
• Data sparsity is a major challenge in collaborative filtering (CF).
   • Overfitting can easily happen for prediction.




                                                                               2013/3/27
• Some auxiliary data of the form “like” or “dislike” may be more
  easily obtained.
   • It’s more convenient for users to express preference.


• How do we take advantage of auxiliary knowledge to alleviate the
  sparsity problem?

• Most existing transfer learning methods in CF consider auxiliary data from
  several perspectives.
   • User-side transfer, item-side transfer, knowledge-transfer.                 3
Probabilistic Matrix Factorization
(NIPS’08)
•




                                     2013/3/27
                                       4
Social Recommendation (CIKM’08)
•




                                  2013/3/27
                                    5
Collective Matrix Factorization (KDD’08)
•




                                           2013/3/27
                                             6
CodeBook Transfer (IJCAI’09)
•




                               2013/3/27
                                 7
Rating-matrix generative model (ICML’09)
• RMGM is derived and extended from FMM generative model,
  which can be formulated as




                                                             2013/3/27
  • The difference:
     • It learns (U, V) and (U3, V3) alternatively.
     • A soft indicator matrix is used. E.g., U [0, 1]n d.




                                                               8
Heterogeneous collaborative filtering
problems
•                   •




                                        2013/3/27
                                          9
Challenges
•




             2013/3/27
             10
Overview of solution
•




                       2013/3/27
                       11
Model formulation
• Assume a user u’s rating on an item i in the target data, rui, is
  generated from




                                                                      2013/3/27
  • user-specific latent feature vector Uu  1 d, where u=1,…,n.

  • item-specific latent feature vector Vi 1 d, where i=1,…,m.

  • some data-dependent effect denoted as B      d d.




                                                                      12
Model formulation (Cont.)
• Likelihood:
• Prior:




                                                    2013/3/27
• Posterior Likelihood Prior (Bayesian inference)
  • Log(Posterior)= Log(Likelihood Prior)




                                                    13
Model formulation
•




                    2013/3/27
                    14
Learning the TCF




                   2013/3/27
                   15
Learning U and V in CMTF
• Theorem 1. Given B and V, we can obtain the user-specific
  latent matrix U in a closed form.




                                                              2013/3/27
                                                              16
Learning U and V in CSVD
•




                           2013/3/27
                           17
Learning U and V in CSVD
(Cont.)




                           2013/3/27
                           18
•




     2013/3/27
19
Algorithm of TCF




                   2013/3/27
                   20
Data sets
•




            2013/3/27
            21
Evaluation metrics
• Summary of Data sets




                         2013/3/27
• Evaluation metrics



                         22
Baselines and parameter settings
•




                                   2013/3/27
                                   23
Performance of Moviepilot data




                                 2013/3/27
                                 24
Performance of Netfliex data




                               2013/3/27
                               25
Performance on Netflix at different
sparsity levels
• SCVD performs
  better than CMTF in




                                      2013/3/27
  all cases.




                                      26
Conclusion
• This paper investigate how to address the sparsity problem in
  CF via a transfer learning solution.




                                                                   2013/3/27
• The TCP framework is proposed to transfer knowledge from
  auxiliary data to target data to alleviates the data sparsity.

• Experimental results show that TCP performs significantly
  better than several state-of-the-art baseline algorithms.

• In the future, the “pure” cold-start problem for users without
  any rating is needed to be addressed via transfer learning.
                                                                   27
2013/3/27
Thank you for
listening.
Q&A



                28

More Related Content

PPTX
Using support vector machine with a hybrid feature selection method to the st...
PPTX
Incremental collaborative filtering via evolutionary co clustering
PPT
A scalable collaborative filtering framework based on co clustering
PPTX
Co-clustering of multi-view datasets: a parallelizable approach
PPT
Organizing the classroom small group 1
PDF
Project-Based Learning Guided Lesson Study Improve the Achievement of Learnin...
PPT
Maed 5040-5070-study of studies presentation
PPTX
Avlm 2009 Guided Indep Learning Wim
Using support vector machine with a hybrid feature selection method to the st...
Incremental collaborative filtering via evolutionary co clustering
A scalable collaborative filtering framework based on co clustering
Co-clustering of multi-view datasets: a parallelizable approach
Organizing the classroom small group 1
Project-Based Learning Guided Lesson Study Improve the Achievement of Learnin...
Maed 5040-5070-study of studies presentation
Avlm 2009 Guided Indep Learning Wim

Viewers also liked (7)

PPTX
Packard Foundation Peer Learning Group
PPT
Peer To Peer Learning 10 7 09f1
PPTX
The effect of ability grouping on students’
PPTX
Teaching (and Learning) with Peer Instruction
PPTX
OER Peer Learning Web-Based Application
PDF
Peer-to-Peer learning technologies, Visualisation and the education around th...
PPTX
Curriculum development
Packard Foundation Peer Learning Group
Peer To Peer Learning 10 7 09f1
The effect of ability grouping on students’
Teaching (and Learning) with Peer Instruction
OER Peer Learning Web-Based Application
Peer-to-Peer learning technologies, Visualisation and the education around th...
Curriculum development
Ad

Similar to Transfer learning in heterogeneous collaborative filtering domains (20)

PPTX
22PCOAM21 Data Quality Session 3 Data Quality.pptx
PDF
How useful is self-supervised pretraining for Visual tasks?
PDF
An Ecore Metamodel for the W3C PROV Provenance Data Model
PPT
Triangular Learner Model
PPTX
Pattern Recognition in Multiple Bike sharing Systems for comparability
PDF
Declarative data analysis
PDF
Cikm 2013 - Beyond Data From User Information to Business Value
PDF
Introduction to ΔQ and Network Performance Science (extracts)
PDF
Model-Based Testing: Concepts, Tools, and Techniques
PDF
Principles of Data Visualization
PPT
GRAPH-BASED RECOMMENDATION SYSTEM
PPTX
A Graph Summarization: A Survey | Summarizing and understanding large graphs
PPTX
Algorithm visualization using pygame and tkinter
PPTX
TELECOM_CHURN_PREDICTIAAAAAAAAAAAAAAAAAON[1].pptx
PDF
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
PDF
Cold-Start Management with Cross-Domain Collaborative Filtering and Tags
PDF
Introduction to Data Analytics with R
PDF
WorldCist 2013 - Behavior Assessment Framework
PPTX
241014_Thuy_Labseminar[Where to Mask: Structure-Guided Masking for Graph Mask...
22PCOAM21 Data Quality Session 3 Data Quality.pptx
How useful is self-supervised pretraining for Visual tasks?
An Ecore Metamodel for the W3C PROV Provenance Data Model
Triangular Learner Model
Pattern Recognition in Multiple Bike sharing Systems for comparability
Declarative data analysis
Cikm 2013 - Beyond Data From User Information to Business Value
Introduction to ΔQ and Network Performance Science (extracts)
Model-Based Testing: Concepts, Tools, and Techniques
Principles of Data Visualization
GRAPH-BASED RECOMMENDATION SYSTEM
A Graph Summarization: A Survey | Summarizing and understanding large graphs
Algorithm visualization using pygame and tkinter
TELECOM_CHURN_PREDICTIAAAAAAAAAAAAAAAAAON[1].pptx
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
Cold-Start Management with Cross-Domain Collaborative Filtering and Tags
Introduction to Data Analytics with R
WorldCist 2013 - Behavior Assessment Framework
241014_Thuy_Labseminar[Where to Mask: Structure-Guided Masking for Graph Mask...
Ad

Recently uploaded (20)

PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
master seminar digital applications in india
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Computing-Curriculum for Schools in Ghana
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Insiders guide to clinical Medicine.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Institutional Correction lecture only . . .
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Pharma ospi slides which help in ospi learning
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
GDM (1) (1).pptx small presentation for students
human mycosis Human fungal infections are called human mycosis..pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
master seminar digital applications in india
Renaissance Architecture: A Journey from Faith to Humanism
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Computing-Curriculum for Schools in Ghana
Supply Chain Operations Speaking Notes -ICLT Program
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Insiders guide to clinical Medicine.pdf
Microbial disease of the cardiovascular and lymphatic systems
Institutional Correction lecture only . . .
Anesthesia in Laparoscopic Surgery in India
Final Presentation General Medicine 03-08-2024.pptx
Pharma ospi slides which help in ospi learning
Abdominal Access Techniques with Prof. Dr. R K Mishra
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
GDM (1) (1).pptx small presentation for students

Transfer learning in heterogeneous collaborative filtering domains