SlideShare a Scribd company logo
Temporal Diversity in Recommender Systems
  Neal Lathia1, Stephen Hailes1, Licia Capra1, Xavier Amatriain2
       1
         Dept. Computer Science, University College London
                  2
                    Telefonica Research, Barcelona

                    ACM SIGIR 2010, Geneva

                       n.lathia@cs.ucl.ac.uk
                  @neal_lathia, @xamat




             EU i-Tour Project
recommender systems

●   many examples over different web domains
●
    a lot of research: accuracy
●   multiple dimensions of usage that equate to user
    satisfaction
evaluating collaborative filtering over time

●   design a methodology to evaluate recommender systems
    that are iteratively updated; explore temporal dimension
    of filtering algorithms1




    1
    N. Lathia, S. Hailes, L. Capra. Temporal Collaborative Filtering with
    Adaptive Neighbourhoods. ACM SIGIR 2009, Boston, USA
temporal diversity

●   ...is not concerned with diversity of a single set of
    recommendations (e.g., are you recommended all six star
    wars movies at once?)
●    ...is concerned with the sequence of recommendations
    that users see (are you recommended the same items
    every week?)
contributions

●   is temporal recommendation diversity important?
●   how to measure temporal diversity and novelty?
●   how much temporal diversity do state-of-the-art CF
    algorithms provide?
●   how to improve temporal diversity?
is diversity important?
data perspective: growth & activity
Temporal Diversity in RecSys - SIGIR2010
demographics (in paper): ~104 respondents
procedure

●   claim: recommender system for “popular movies”
●   rate week 1's recommendations
     ●     movie titles, links to IMDB, DVD Covers
●   (click through buffer screen)
●   rate week 2's recommendations
●   (click through buffer screen)
●   ....
overview of the surveys
Survey 3: Random Movies

W1


W2


W3



W4



W5
Survey 3: Random Movies

W1


W2


W3



W4



W5
Survey 2: Popular Movies, Change Each Week

W1


W2


W3



W4



W5
Survey 2: Popular Movies, Change Each Week

W1


W2


W3



W4



W5
Survey 1: Popular Movies – No Change

W1


W2


W3



W4



W5
Closing Questions
Closing Questions

                    surprise, unrest, rude
                    compliments, “spot on”




                    74% important / very important
                    23% neutral




                    86% important / very important




                    95% important / very important
how did this affect the way people rated?
how did this affect the way people rated?
how did this affect the way people rated?




                                     S3 Random: Always Bad
how did this affect the way people rated?


                                     S2 Popular: Quite Good




                                     S3 Random: Always Bad
how did this affect the way people rated?


                                       S2 Popular: Quite Good
                                       S1 Starts off Quite Good




                                       S1 Ends off Bad
                                       S3 Random: Always Bad




                                 ...ANOVA details in paper...
is diversity important? (yes)
Temporal Diversity in RecSys - SIGIR2010
how to measure temporal diversity?
measuring temporal diversity




diversity = ?
measuring temporal diversity




diversity = 3/10
how much temporal diversity do state-of-the-art
CF algorithms provide?
3 algorithms – 3 influential factors


●   baseline – popularity ranking
●   item-based kNN
●   singular value decomposition


●   profile size vs. diversity
●   ratings added vs. diversity
●   time between sessions vs. diversity
profile size vs. diversity



   baseline              kNN   SVD
profile size vs. diversity



   baseline              kNN   SVD
main results


●   as profile size increases, diversity decreases
●   the more ratings added in the current session, the more
    diversity will be experienced in next session
●   more time between sessions leads to more diversity
consequences


●   want to avoid from having profiles that are too large
●   (conflict #1) want to encourage users to rate as much as
    possible
●   (conflict #2) want users to visit often, but diversity
    increases if they don't


●   how does this relate back to traditional evaluation metrics?
accuracy vs. diversity




more diverse
                                       kNN


                                       SVD
                                       baseline



                       more accurate
how to improve temporal diversity?
3 methods


●   temporal switching
●   temporal user-based switching
●   re-ranking frequent visitor's lists
temporal switching


●   “jump” between algorithms each week
temporal switching


●   “jump” between algorithms each week
re-ranking visitor's lists



  ●   (like we did in survey 2)
re-ranking visitor's lists


●   (like we did in survey 2, amazon did in 1998!)
contributions/summary

●   temporal diversity is important
●   defined (simple, extendable) metric to measure temporal
    recommendation diversity
●   analysed factors that influence diversity; most accurate
    algorithm is not the most diverse
●   hybrid-switching/re-ranking can improve diversity
Temporal Diversity in Recommender Systems
  Neal Lathia1, Stephen Hailes1, Licia Capra1, Xavier Amatriain2
       1
         Dept. Computer Science, University College London
                  2
                    Telefonica Research, Barcelona

                    ACM SIGIR 2010, Geneva

                      n.lathia@cs.ucl.ac.uk
               @neal_lathia, @xamat

            Support by:
            EU FP7 i-Tour
            Grant 234239

More Related Content

ODP
Smartphones and Health: The (Poo) Review
PDF
MobiSys Seminar - Nov 4 2008
PPT
Telefonica Lunch Seminar
PDF
IJCAI Workshop Presentation
PPT
Recsys Presentation
PPT
The Effect of Correlation Coefficients on Communities of Recommenders
PDF
Temporal Defenses for Robust Recommendations
PPTX
ICS2208 lecture7
Smartphones and Health: The (Poo) Review
MobiSys Seminar - Nov 4 2008
Telefonica Lunch Seminar
IJCAI Workshop Presentation
Recsys Presentation
The Effect of Correlation Coefficients on Communities of Recommenders
Temporal Defenses for Robust Recommendations
ICS2208 lecture7

Similar to Temporal Diversity in RecSys - SIGIR2010 (20)

PDF
Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus
PPTX
Design of surveillance systems_UCSF_270225.pptx
PPTX
Recommendations and Discovery at StumbleUpon
PPTX
Recsys 2016 - Accuracy and Diversity in Cross-domain Recommendations for Cold...
PDF
Classification and Detection of Micro-Level Impact-CSCW2017 (Link: http://dl....
 
PPTX
Workshop on Designing Human-Centric MIR Systems
PDF
W4L1_11-667: LARGE LANGUAGE MODELS: METHODS AND APPLICATIONS - Human Evaluati...
PPTX
Opinion Dynamics of Skeptical Agents Read-Through
PDF
Recommandation systems -
PPTX
Research Methodology 4
PPT
sampling_design_good.ppt
PDF
Basic Statistical Concepts.pdf
PPTX
Srm group5 sec_a
PDF
Personal rankings of educational institutions
PDF
SIRUP - Serendipity in Recommendation through User Perception
PDF
What are the negative effects of social media?: fighting fake information
PDF
Smartphones as ubiquitous devices for behavior analysis and better lifestyle ...
PDF
Bmgt 311 chapter_13
PDF
Opinion Dynamics on Networks
PDF
IUI Step-hai 2025 workshop Keynote by Martijn Willemsen
Querylog-based Assessment of Retrievability Bias in a Large Newspaper Corpus
Design of surveillance systems_UCSF_270225.pptx
Recommendations and Discovery at StumbleUpon
Recsys 2016 - Accuracy and Diversity in Cross-domain Recommendations for Cold...
Classification and Detection of Micro-Level Impact-CSCW2017 (Link: http://dl....
 
Workshop on Designing Human-Centric MIR Systems
W4L1_11-667: LARGE LANGUAGE MODELS: METHODS AND APPLICATIONS - Human Evaluati...
Opinion Dynamics of Skeptical Agents Read-Through
Recommandation systems -
Research Methodology 4
sampling_design_good.ppt
Basic Statistical Concepts.pdf
Srm group5 sec_a
Personal rankings of educational institutions
SIRUP - Serendipity in Recommendation through User Perception
What are the negative effects of social media?: fighting fake information
Smartphones as ubiquitous devices for behavior analysis and better lifestyle ...
Bmgt 311 chapter_13
Opinion Dynamics on Networks
IUI Step-hai 2025 workshop Keynote by Martijn Willemsen
Ad

More from Neal Lathia (20)

PDF
Everything around the NLP (London.AI Feb 2021)
PDF
Using machine learning for customer service (Data Talks Club)
PDF
Using language models to supercharge Monzo’s customer support
PDF
Making Better Decisions Faster
PDF
Machine Learning, Faster
PDF
AI & Personalised Experiences
PPTX
Opportunities & Challenges in Personalised Travel
PDF
Bootstrapping a Destination Recommendation Engine
PPTX
Machine Learning for Product Managers
PDF
Mining Smartphone Data (with Python)
PDF
Happier and Healthier with Smartphone Data
PDF
Data Science in Digital Health
PDF
Using Smartphones to Measure (and Intervene in) Daily Life
PDF
Analysing Daily Behaviours with Large-Scale Smartphone Data
PDF
Cambridge Quantified Self Meetup
PDF
Data Science in #mHealth
PDF
Tube Star: Crowd-Sourced Experiences on Public Transport
PDF
Emotion Sense: From Design to Deployment
PDF
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
PDF
Using Smartphones to Research Daily Life
Everything around the NLP (London.AI Feb 2021)
Using machine learning for customer service (Data Talks Club)
Using language models to supercharge Monzo’s customer support
Making Better Decisions Faster
Machine Learning, Faster
AI & Personalised Experiences
Opportunities & Challenges in Personalised Travel
Bootstrapping a Destination Recommendation Engine
Machine Learning for Product Managers
Mining Smartphone Data (with Python)
Happier and Healthier with Smartphone Data
Data Science in Digital Health
Using Smartphones to Measure (and Intervene in) Daily Life
Analysing Daily Behaviours with Large-Scale Smartphone Data
Cambridge Quantified Self Meetup
Data Science in #mHealth
Tube Star: Crowd-Sourced Experiences on Public Transport
Emotion Sense: From Design to Deployment
Opportunities and Challenges of Using Smartphones for Health Monitoring and I...
Using Smartphones to Research Daily Life
Ad

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPT
Teaching material agriculture food technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Getting Started with Data Integration: FME Form 101
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Tartificialntelligence_presentation.pptx
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
cloud_computing_Infrastucture_as_cloud_p
Per capita expenditure prediction using model stacking based on satellite ima...
Teaching material agriculture food technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Reach Out and Touch Someone: Haptics and Empathic Computing
MIND Revenue Release Quarter 2 2025 Press Release
Programs and apps: productivity, graphics, security and other tools
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Network Security Unit 5.pdf for BCA BBA.
Getting Started with Data Integration: FME Form 101
gpt5_lecture_notes_comprehensive_20250812015547.pdf
TLE Review Electricity (Electricity).pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
A comparative analysis of optical character recognition models for extracting...
Empathic Computing: Creating Shared Understanding
Tartificialntelligence_presentation.pptx
OMC Textile Division Presentation 2021.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
cloud_computing_Infrastucture_as_cloud_p

Temporal Diversity in RecSys - SIGIR2010

  • 1. Temporal Diversity in Recommender Systems Neal Lathia1, Stephen Hailes1, Licia Capra1, Xavier Amatriain2 1 Dept. Computer Science, University College London 2 Telefonica Research, Barcelona ACM SIGIR 2010, Geneva n.lathia@cs.ucl.ac.uk @neal_lathia, @xamat EU i-Tour Project
  • 2. recommender systems ● many examples over different web domains ● a lot of research: accuracy ● multiple dimensions of usage that equate to user satisfaction
  • 3. evaluating collaborative filtering over time ● design a methodology to evaluate recommender systems that are iteratively updated; explore temporal dimension of filtering algorithms1 1 N. Lathia, S. Hailes, L. Capra. Temporal Collaborative Filtering with Adaptive Neighbourhoods. ACM SIGIR 2009, Boston, USA
  • 4. temporal diversity ● ...is not concerned with diversity of a single set of recommendations (e.g., are you recommended all six star wars movies at once?) ● ...is concerned with the sequence of recommendations that users see (are you recommended the same items every week?)
  • 5. contributions ● is temporal recommendation diversity important? ● how to measure temporal diversity and novelty? ● how much temporal diversity do state-of-the-art CF algorithms provide? ● how to improve temporal diversity?
  • 9. demographics (in paper): ~104 respondents
  • 10. procedure ● claim: recommender system for “popular movies” ● rate week 1's recommendations ● movie titles, links to IMDB, DVD Covers ● (click through buffer screen) ● rate week 2's recommendations ● (click through buffer screen) ● ....
  • 11. overview of the surveys
  • 12. Survey 3: Random Movies W1 W2 W3 W4 W5
  • 13. Survey 3: Random Movies W1 W2 W3 W4 W5
  • 14. Survey 2: Popular Movies, Change Each Week W1 W2 W3 W4 W5
  • 15. Survey 2: Popular Movies, Change Each Week W1 W2 W3 W4 W5
  • 16. Survey 1: Popular Movies – No Change W1 W2 W3 W4 W5
  • 18. Closing Questions surprise, unrest, rude compliments, “spot on” 74% important / very important 23% neutral 86% important / very important 95% important / very important
  • 19. how did this affect the way people rated?
  • 20. how did this affect the way people rated?
  • 21. how did this affect the way people rated? S3 Random: Always Bad
  • 22. how did this affect the way people rated? S2 Popular: Quite Good S3 Random: Always Bad
  • 23. how did this affect the way people rated? S2 Popular: Quite Good S1 Starts off Quite Good S1 Ends off Bad S3 Random: Always Bad ...ANOVA details in paper...
  • 26. how to measure temporal diversity?
  • 29. how much temporal diversity do state-of-the-art CF algorithms provide?
  • 30. 3 algorithms – 3 influential factors ● baseline – popularity ranking ● item-based kNN ● singular value decomposition ● profile size vs. diversity ● ratings added vs. diversity ● time between sessions vs. diversity
  • 31. profile size vs. diversity baseline kNN SVD
  • 32. profile size vs. diversity baseline kNN SVD
  • 33. main results ● as profile size increases, diversity decreases ● the more ratings added in the current session, the more diversity will be experienced in next session ● more time between sessions leads to more diversity
  • 34. consequences ● want to avoid from having profiles that are too large ● (conflict #1) want to encourage users to rate as much as possible ● (conflict #2) want users to visit often, but diversity increases if they don't ● how does this relate back to traditional evaluation metrics?
  • 35. accuracy vs. diversity more diverse kNN SVD baseline more accurate
  • 36. how to improve temporal diversity?
  • 37. 3 methods ● temporal switching ● temporal user-based switching ● re-ranking frequent visitor's lists
  • 38. temporal switching ● “jump” between algorithms each week
  • 39. temporal switching ● “jump” between algorithms each week
  • 40. re-ranking visitor's lists ● (like we did in survey 2)
  • 41. re-ranking visitor's lists ● (like we did in survey 2, amazon did in 1998!)
  • 42. contributions/summary ● temporal diversity is important ● defined (simple, extendable) metric to measure temporal recommendation diversity ● analysed factors that influence diversity; most accurate algorithm is not the most diverse ● hybrid-switching/re-ranking can improve diversity
  • 43. Temporal Diversity in Recommender Systems Neal Lathia1, Stephen Hailes1, Licia Capra1, Xavier Amatriain2 1 Dept. Computer Science, University College London 2 Telefonica Research, Barcelona ACM SIGIR 2010, Geneva n.lathia@cs.ucl.ac.uk @neal_lathia, @xamat Support by: EU FP7 i-Tour Grant 234239