Jure Leskovec (@jure)
Pinterest and Stanford
1Jure Leskove, Pinterest & Stanford University
2
Recommendations
drive whole businesses!
Jure Leskove, Pinterest & Stanford University
People and Items
3
+
–
+
+
+
–
People Items
Fundamental problem:
Making items discoverable!Jure Leskove, Pinterest & Stanford University
Understanding Products
To make relevant recommendations
we need to understand the products
and how they fit together
Discovering relationships
between products
4Jure Leskove, Pinterest & Stanford University
Product Graph
Ingest product catalogs:
 10s of millions of products
 100s of millions of descriptions, reviews
Infer product networks with multiple
types of directed relationships:
 Input:
Data about items (products)
 Output:
Network with multiple types of relationships
5Jure Leskove, Pinterest & Stanford University
Product Graph: Relations
6
Substitutes:
Purchase
instead
Complements:
Purchase
in addition
Jure Leskove, Pinterest & Stanford University
Product Graph: Description
7
: cleaner; quieter
: cheaper; high power
: well made, easy to install
: fits perfectly, great value
Jure Leskove, Pinterest & Stanford University
Product Graph: Overview
8
substitute
complement
Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
1. Understands the notions of
substitute and complement goods
is substitutable for
complements
9Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
2. Generates explanations
of why certain products are
preferred
“Good quality, soft, light
weight, the colors are
beautiful and exactly like
the picture!”
People prefer this
because:
10Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
3. Discovers micro-categories
of products
Small clusters of tightly related products:
11Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
4. Recommends
baskets of related products
Query: Suggested outfit:
Query: Suggested outfit:
12Jure Leskove, Pinterest & Stanford University
Product Graph: Overview
Building networks from products
Modeling: Can we use product data
to model product relationships?
Understanding: Can we explain
why people prefer certain products
over others?
13Jure Leskove, Pinterest & Stanford University
Problem Setting
Binary prediction task:
Given a pair of products, x and y, predict
whether they are related
(substitute/complementary)
Goal: Build a probabilistic model
that encodes
14Jure Leskove, Pinterest & Stanford University
Problem Setting
How to learn
from data
Train by maximum likelihood:
15
XComplementary
Not
Complementary
Jure Leskove, Pinterest & Stanford University
Approach
Products are described by their properties:
 Review text, Product description,
Brand, Price, …
[0.3, 0, 0, 0.3, 0, 0, 0.2, 0, 0, 0.1]
[0.1, 0, 0, 0, 0.2, 0, 0, 0.1, 0, 0.2]
Challenges:
 How do we discover right features?
 How do we explain relationships?
 How do we identify micro-categories?
16
Shoes Female
Jure Leskove, Pinterest & Stanford University
Our Solution: SCEPTRE
Link Prediction Review “topics”
Discover topics that “explain” product relations
17
Learn to discover topics that
explain the product graph
Jure Leskove, Pinterest & Stanford University
Challenges: Relation Direction
why do people who view
X eventually buy Y?
Relationships we want to learn
are not symmetric
18
Relationships: Explained by product “properties”
“baby, pajamas, pants, colorful”
Directedness: Subjective/qualitative language
“true size, fits well, items are the same color as on the picture”
Jure Leskove, Pinterest & Stanford University
Challenges: Multiple Relations
19
We want to learn multiple
relationships simultaneously
Solution: Learn multiple regressors (one for each
graph), that operate on a single set of topics
Jure Leskove, Pinterest & Stanford University
Challenges: Micro-Categories
20
Model discovers thousands of topics
but no micro-categories
Solution: Product hierarchy
Laptop charger specific topics
are only active for chargers.
These are micro-categories.
Topics at the top are common to all
electronics products, and will contain
generic electronics language
Associate each node in the category
tree with a small number of topics:
Jure Leskove, Pinterest & Stanford University
Building the Graph
C++ implementation that runs
on a single (large-memory) machine
 OpenMP to parallelize computations
Experimental results:
Active part of the Amazon catalog
 10m products
 150m reviews
 250m relationships
21Jure Leskove, Pinterest & Stanford University
Example: Product Graph
22Jure Leskove, Pinterest & Stanford University
Example: Product Graph
23Jure Leskove, Pinterest & Stanford University
Edge Prediction Accuracy
24
Substitute Complement
Men’s
Clothing
96.7% 94.1%
Women’s
Clothing
95.9% 94.1%
Books 93.8% 89.9%
Electronics 95.7% 88.8%
Movies 85.6% -
Music 90.4% -
OVERALL 94.83% 90.23%
Jure Leskove, Pinterest & Stanford University
Results: Micro-Categories
25Jure Leskove, Pinterest & Stanford University
27
How does all this fit
into Pinterest?
Jure Leskove, Pinterest & Stanford University
Connecting People & Objects
28Jure Leskove, Pinterest & Stanford University
Pins: Richly Annotated Objects
29Jure Leskove, Pinterest & Stanford University
Pins are Collected in Boards
30Jure Leskove, Pinterest & Stanford University
30+ Billion Pins
categorized by people into more than
750 Million Boards
50% of pins have been created
in the last 6 months
31
32
Discovering relationships
between objectsJure Leskove, Pinterest & Stanford University
We are hiring!
33
jure@pinterest.com
References
 Inferring Networks of Substitutable and Complementary
Products by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD
International Conference on Knowledge Discovery and Data
Mining (KDD), 2015.
 Hidden Factors and Hidden Topics: Understanding Rating
Dimensions with Review Text by J. McAuley, J. Leskovec. ACM
Conference on Recommender Systems (RecSys), 2013.
 Learning Attitudes and Attributes from Multi-Aspect
Reviews by J. McAuley, J. Leskovec, D. Jurafsky. IEEE
International Conference On Data Mining (ICDM), 2012.
34Jure Leskove, Pinterest & Stanford University

More Related Content

PPTX
The extended project qualification 2
PPTX
Overview of Data Science and AI
PDF
Trustworthy Recommender Systems
PPTX
Westport & Fuel Systems Solution Merger Proposal
PPT
Gamification strategies
PDF
[系列活動] 人工智慧與機器學習在推薦系統上的應用
PDF
“Electronic Shopping Website with Recommendation System”
PDF
Recsys 2016
The extended project qualification 2
Overview of Data Science and AI
Trustworthy Recommender Systems
Westport & Fuel Systems Solution Merger Proposal
Gamification strategies
[系列活動] 人工智慧與機器學習在推薦系統上的應用
“Electronic Shopping Website with Recommendation System”
Recsys 2016

Similar to Inferring networks of substitute and complementary products (20)

PPTX
Connecting social media to e commerce (2)
PDF
Product Comparison Website using Web scraping and Machine learning.
PDF
Recommender systems
PDF
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
PPTX
The Universal Recommender
PDF
AI in Entertainment – Movie Recommendation System
PDF
IRJET- A New Approach to Product Recommendation Systems
PDF
IRJET- A New Approach to Product Recommendation Systems
PDF
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
PDF
Deep Learning for Semantic Search in E-commerce​
PPTX
Recommended System.pptx
PPT
Content based recommendation systems
PPTX
Lecture Notes on Recommender System Introduction
PDF
Recommender Systems Content and Collaborative Filtering
PDF
Frequently Bought Together Recommendations Based on Embeddings
PDF
Modern Perspectives on Recommender Systems and their Applications in Mendeley
PPTX
Personalised Recommendations in E-Commerce
PDF
Designing recommender system for your application
PDF
Embeddings! embeddings everywhere!
PDF
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Connecting social media to e commerce (2)
Product Comparison Website using Web scraping and Machine learning.
Recommender systems
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
The Universal Recommender
AI in Entertainment – Movie Recommendation System
IRJET- A New Approach to Product Recommendation Systems
IRJET- A New Approach to Product Recommendation Systems
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
Deep Learning for Semantic Search in E-commerce​
Recommended System.pptx
Content based recommendation systems
Lecture Notes on Recommender System Introduction
Recommender Systems Content and Collaborative Filtering
Frequently Bought Together Recommendations Based on Embeddings
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Personalised Recommendations in E-Commerce
Designing recommender system for your application
Embeddings! embeddings everywhere!
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Ad

More from Turi, Inc. (20)

PPTX
Webinar - Analyzing Video
PDF
Webinar - Patient Readmission Risk
PPTX
Webinar - Know Your Customer - Arya (20160526)
PPTX
Webinar - Product Matching - Palombo (20160428)
PPTX
Webinar - Pattern Mining Log Data - Vega (20160426)
PPTX
Webinar - Fraud Detection - Palombo (20160428)
PPTX
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
PDF
Pattern Mining: Extracting Value from Log Data
PPTX
Intelligent Applications with Machine Learning Toolkits
PPTX
Text Analysis with Machine Learning
PPTX
Machine Learning with GraphLab Create
PPTX
Machine Learning in Production with Dato Predictive Services
PPTX
Machine Learning in 2016: Live Q&A with Carlos Guestrin
PDF
Scalable data structures for data science
PPTX
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
PDF
Introduction to Recommender Systems
PDF
Machine learning in production
PPTX
Overview of Machine Learning and Feature Engineering
PPTX
SFrame
PPT
Building Personalized Data Products with Dato
Webinar - Analyzing Video
Webinar - Patient Readmission Risk
Webinar - Know Your Customer - Arya (20160526)
Webinar - Product Matching - Palombo (20160428)
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Fraud Detection - Palombo (20160428)
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Pattern Mining: Extracting Value from Log Data
Intelligent Applications with Machine Learning Toolkits
Text Analysis with Machine Learning
Machine Learning with GraphLab Create
Machine Learning in Production with Dato Predictive Services
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Scalable data structures for data science
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Recommender Systems
Machine learning in production
Overview of Machine Learning and Feature Engineering
SFrame
Building Personalized Data Products with Dato
Ad

Recently uploaded (20)

PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPTX
Modernising the Digital Integration Hub
PPT
Geologic Time for studying geology for geologist
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Getting started with AI Agents and Multi-Agent Systems
PPTX
Configure Apache Mutual Authentication
PDF
Architecture types and enterprise applications.pdf
PPTX
TEXTILE technology diploma scope and career opportunities
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
The influence of sentiment analysis in enhancing early warning system model f...
Improvisation in detection of pomegranate leaf disease using transfer learni...
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Comparative analysis of machine learning models for fake news detection in so...
Modernising the Digital Integration Hub
Geologic Time for studying geology for geologist
Zenith AI: Advanced Artificial Intelligence
A proposed approach for plagiarism detection in Myanmar Unicode text
1 - Historical Antecedents, Social Consideration.pdf
Module 1.ppt Iot fundamentals and Architecture
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Getting started with AI Agents and Multi-Agent Systems
Configure Apache Mutual Authentication
Architecture types and enterprise applications.pdf
TEXTILE technology diploma scope and career opportunities
sustainability-14-14877-v2.pddhzftheheeeee
Enhancing plagiarism detection using data pre-processing and machine learning...
A contest of sentiment analysis: k-nearest neighbor versus neural network

Inferring networks of substitute and complementary products

  • 1. Jure Leskovec (@jure) Pinterest and Stanford 1Jure Leskove, Pinterest & Stanford University
  • 2. 2 Recommendations drive whole businesses! Jure Leskove, Pinterest & Stanford University
  • 3. People and Items 3 + – + + + – People Items Fundamental problem: Making items discoverable!Jure Leskove, Pinterest & Stanford University
  • 4. Understanding Products To make relevant recommendations we need to understand the products and how they fit together Discovering relationships between products 4Jure Leskove, Pinterest & Stanford University
  • 5. Product Graph Ingest product catalogs:  10s of millions of products  100s of millions of descriptions, reviews Infer product networks with multiple types of directed relationships:  Input: Data about items (products)  Output: Network with multiple types of relationships 5Jure Leskove, Pinterest & Stanford University
  • 6. Product Graph: Relations 6 Substitutes: Purchase instead Complements: Purchase in addition Jure Leskove, Pinterest & Stanford University
  • 7. Product Graph: Description 7 : cleaner; quieter : cheaper; high power : well made, easy to install : fits perfectly, great value Jure Leskove, Pinterest & Stanford University
  • 8. Product Graph: Overview 8 substitute complement Jure Leskove, Pinterest & Stanford University
  • 9. Product Graph:What it does? 1. Understands the notions of substitute and complement goods is substitutable for complements 9Jure Leskove, Pinterest & Stanford University
  • 10. Product Graph:What it does? 2. Generates explanations of why certain products are preferred “Good quality, soft, light weight, the colors are beautiful and exactly like the picture!” People prefer this because: 10Jure Leskove, Pinterest & Stanford University
  • 11. Product Graph:What it does? 3. Discovers micro-categories of products Small clusters of tightly related products: 11Jure Leskove, Pinterest & Stanford University
  • 12. Product Graph:What it does? 4. Recommends baskets of related products Query: Suggested outfit: Query: Suggested outfit: 12Jure Leskove, Pinterest & Stanford University
  • 13. Product Graph: Overview Building networks from products Modeling: Can we use product data to model product relationships? Understanding: Can we explain why people prefer certain products over others? 13Jure Leskove, Pinterest & Stanford University
  • 14. Problem Setting Binary prediction task: Given a pair of products, x and y, predict whether they are related (substitute/complementary) Goal: Build a probabilistic model that encodes 14Jure Leskove, Pinterest & Stanford University
  • 15. Problem Setting How to learn from data Train by maximum likelihood: 15 XComplementary Not Complementary Jure Leskove, Pinterest & Stanford University
  • 16. Approach Products are described by their properties:  Review text, Product description, Brand, Price, … [0.3, 0, 0, 0.3, 0, 0, 0.2, 0, 0, 0.1] [0.1, 0, 0, 0, 0.2, 0, 0, 0.1, 0, 0.2] Challenges:  How do we discover right features?  How do we explain relationships?  How do we identify micro-categories? 16 Shoes Female Jure Leskove, Pinterest & Stanford University
  • 17. Our Solution: SCEPTRE Link Prediction Review “topics” Discover topics that “explain” product relations 17 Learn to discover topics that explain the product graph Jure Leskove, Pinterest & Stanford University
  • 18. Challenges: Relation Direction why do people who view X eventually buy Y? Relationships we want to learn are not symmetric 18 Relationships: Explained by product “properties” “baby, pajamas, pants, colorful” Directedness: Subjective/qualitative language “true size, fits well, items are the same color as on the picture” Jure Leskove, Pinterest & Stanford University
  • 19. Challenges: Multiple Relations 19 We want to learn multiple relationships simultaneously Solution: Learn multiple regressors (one for each graph), that operate on a single set of topics Jure Leskove, Pinterest & Stanford University
  • 20. Challenges: Micro-Categories 20 Model discovers thousands of topics but no micro-categories Solution: Product hierarchy Laptop charger specific topics are only active for chargers. These are micro-categories. Topics at the top are common to all electronics products, and will contain generic electronics language Associate each node in the category tree with a small number of topics: Jure Leskove, Pinterest & Stanford University
  • 21. Building the Graph C++ implementation that runs on a single (large-memory) machine  OpenMP to parallelize computations Experimental results: Active part of the Amazon catalog  10m products  150m reviews  250m relationships 21Jure Leskove, Pinterest & Stanford University
  • 22. Example: Product Graph 22Jure Leskove, Pinterest & Stanford University
  • 23. Example: Product Graph 23Jure Leskove, Pinterest & Stanford University
  • 24. Edge Prediction Accuracy 24 Substitute Complement Men’s Clothing 96.7% 94.1% Women’s Clothing 95.9% 94.1% Books 93.8% 89.9% Electronics 95.7% 88.8% Movies 85.6% - Music 90.4% - OVERALL 94.83% 90.23% Jure Leskove, Pinterest & Stanford University
  • 25. Results: Micro-Categories 25Jure Leskove, Pinterest & Stanford University
  • 26. 27 How does all this fit into Pinterest? Jure Leskove, Pinterest & Stanford University
  • 27. Connecting People & Objects 28Jure Leskove, Pinterest & Stanford University
  • 28. Pins: Richly Annotated Objects 29Jure Leskove, Pinterest & Stanford University
  • 29. Pins are Collected in Boards 30Jure Leskove, Pinterest & Stanford University
  • 30. 30+ Billion Pins categorized by people into more than 750 Million Boards 50% of pins have been created in the last 6 months 31
  • 31. 32 Discovering relationships between objectsJure Leskove, Pinterest & Stanford University
  • 33. References  Inferring Networks of Substitutable and Complementary Products by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2015.  Hidden Factors and Hidden Topics: Understanding Rating Dimensions with Review Text by J. McAuley, J. Leskovec. ACM Conference on Recommender Systems (RecSys), 2013.  Learning Attitudes and Attributes from Multi-Aspect Reviews by J. McAuley, J. Leskovec, D. Jurafsky. IEEE International Conference On Data Mining (ICDM), 2012. 34Jure Leskove, Pinterest & Stanford University