Embeddings for
Recommendation Systems
Keynote Speech
Sihem Romdhani – Data Scientist
People watch one billion hours of YouTube every
day
YouTube’s recommendations drive 70% of what we watch
=
700.000.000 hours of video / day
YouTube generates 6% of Google’s ad sales revenue.
Digital Marketing and Recommender Systems
Individualized
customer
experience
Higher
Traffic &
Increased
revenue
Product
Bundling
Engaging
Shoppers
Real-Time
Recommendations
AI-based
Recommender
Systems
Machine Learning Models for
predicting product preference.
Recommender Systems help us to
manage big amounts of customer
data, and to extract preferences on
the individual customers‘ level.
AI-based Recommender Systems
Approache
s
Collaborative filtering
Content-based filtering
Embeddings
Embeddings for
Recommendation
Systems
Background On Embeddings in
NLP
Word Embeddings
 Feature Vector representations of words
(i.e. representing text as numbers)
 Capture semantic and syntactic
relationships.
Word Embeddings
 Feature Vector representations of words
(i.e. representing text as numbers)
 Capture semantic and syntactic
relationships.
0.1 0.67 - --
0.5 0.8 - --
0.01 0.9 - --
king
man
woma
n
queen 0.6 0.19 - --
Word embedding
(e.g. word2vec, GloVe)
Similarity( 0.1 0.67 - -- 0.5 0.8 - --
)=0.8
king man
,
Word2vec and Skipgram for Learning Word
Embeddings
0.8 0.01 … 0.89Marketing
Sale
School
art
…
…
….
advertising
Embedding Size
Embeddings
“You shall know a word by the company it keeps” J.R. Firth
Random Initialization1
Word2vec and Skipgram for Learning Word
Embeddings
Untrained Model
Task:
Are the two words neighbors?
0.8 0.01 … 0.89Marketing
Sale
School
art
…
…
….
advertising
Embedding Size
Embeddings
“You shall know a word by the company it keeps” J.R. Firth
Random Initialization1 Look up
embeddings2
Marketing
school
Train the model3 Model
Prediction
4
0.45
0.55
Yes
No
Word2vec and Skipgram for Learning Word
Embeddings
Untrained Model
Task:
Are the two words neighbors?
0.8 0.01 … 0.89Marketing
Sale
School
art
…
…
….
advertising
Embedding Size
Embeddings
“You shall know a word by the company it keeps” J.R. Firth
Random Initialization1 Look up
embeddings2
Marketing
school
Train the model3 Model
Prediction
4
0.45
0.55
Yes
No
Actual Target &
Error Estimation
5
0
1
Yes
No
Update Model Parameters
Word2vec and Skipgram for Learning Word
Embeddings
Marketing is the study and management of exchange relationships.
Source: Wikipedia
Sliding window across running text
context
Input word Output word Target
is marketing 1
is the 1
Dataset
Word2vec and Skipgram for Learning Word
Embeddings
Marketing is the study and management of exchange relationships.
Source: Wikipedia
Sliding window across running text
context
Input word Output word Target
is marketing 1
is the 1
the is 1
the study 1
Dataset
Word2vec and Skipgram for Learning Word
Embeddings
Dataset
Marketing is the study and management of exchange relationships.
Source: Wikipedia
Input word Output word Target
is marketing 1
is school 0
is the 1
is art 0
the is 1
the cat 0
the study 1
Negative Sampling
Embeddings for recommendation systems
Applying
word2vec to
Recommenders
and Advertising
Researchers from the Web Search, E-
commerce and Marketplace domains
have realized that just like one can
train word embeddings by treating a
sequence of words in a sentence as
context, the same can be done for
training embeddings of user
actions by treating sequence of
user actions as context.
Applying word2vec to Recommenders and Advertising
Mihajlo Grbovic, Airbnb
 User activity around an item encodes many
abstract qualities of that item which are difficult to
capture by more direct means (e.g. How do you encode
qualities like “architecture, style and feel” of an Airbnb
listing?
 The word2vec approach has proven successful in
extracting these hidden insights.
 Being able to compare, search, and categorize items
on these abstract dimensions opens up a lot of
opportunities for smarter, better recommendations.
Applying
word2vec to
Recommenders
and Advertising
Airbnb Use Case
Airbnb Recommender Systems
 Airbnb is a marketplace that contains millions of diverse home listings
Potential guests explore the listings through search results generated from a sophisticated Machine Learning models.
Similar Listing Carousel shows listing recommendations related to the current viewed listing.
 Initially, search rankings were determined by a set of hard-coded rules based on very
basic signals
The rules were applied to every guest uniformly, rather than taking into account the unique values that could create the kind of
a personalized experience that keeps guests coming back.
 Airbnb uses machine learning to offer personalized search experience.
Listing
Embeddings in
Search Ranking
Airbnb Recommender Systems
 The listing embeddings are vector representations of Airbnb homes learned
from search sessions.
 They effectively encode many listing features, such as location, price, listing
type, architecture and listing style, all using only 32 float numbers.
 Measure similarities: Similar listings lie nearby in the embedding space.
 Improving Similar Listing Recommendations and Real-Time Personalization
in Search Ranking.
Site
Search
Airbnb Listing Embeddings
Airbnb
Homepage
Site
Search
Airbnb Listing Embeddings
Listing
#111
Airbnb
Homepage
Site
Search
Airbnb Listing Embeddings
Listing
#111
Listing
#2000
Airbnb
Homepage
Airbnb
Homepage
Site
Search
Airbnb Listing Embeddings
Listing
#111
Listing
#2000
Listing
#415
Site
Search
Click sessions
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Airbnb Listing Embeddings
Listing
#111
Listing
#2900
Listing
#415
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
Listing
#800
Listing
#900
Listing
#1000
Airbnb Listing Embeddings
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
….
Airbnb Listing Embeddings
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
….
Input listing Output listing class
#200 #100 1
#200 #300 1
#300 #200 1
#300 #400 1
… … …
Airbnb Listing Embeddings
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
….
Input listing Output listing class
#200 #100 1
#200 #300 1
#300 #200 1
#300 #400 1
… … …
Airbnb Listing Embeddings
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
….
Input listing Output listing class
#200 #100 1
#200 #71 0
#200 #300 1
#200 #417 0
#300 #200 1
#300 #33 0
#300 #400 1
0.06 0.1 … 0.47Listing #1
Listing #2
Listing #3
Listing #4
Listing #5
Listing #6
….
Listing #1000
Vector of 34 floating numbers
Listing Embeddings
Airbnb Listing Embeddings
0.06 0.1 … 0.47Listing #1
Listing #2
Listing #3
Listing #4
Listing #5
Listing #6
….
Listing #1000
Listing #3
Most Similar Listings
Listing #72
Listing #2006
Listing #1345
Listing #491
Listing #304
Vector of 34 floating numbers
Listing Embeddings
recommend
Click
Airbnb Listing Embeddings
New
Labeled
Data
Most Similar Listings
Listing #72
Listing #2006
Listing #1345
Listing #491
Listing #304
After recommending
Clicked
Clicked
Not clicked
Listing #72
Listing #2006
Listing #1345
Recommended
Input listing Output listing label
#3 #1345 0
Improving
Recommendations
Listing #3
Click
Improving
Recommendations
booked
Input listing Output listing label
#200 #100 1
#200 #300 1
#200 #1000 1
#300 #1000 1
Click sessions
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#111
Listing
#2900
Listing
#415
Listing
#100
Listing
#200
Listing
#300
Listing
#400
Listing
#500
Listing
#600
Listing
#700
Listing
#800
Listing
#900
Listing
#1000
New
Labeled
Data
Embeddings for recommendation systems
Challenges
ML systems are not good at
generalizing when the underlying
data distribution changes.
What is needed to
build a ML product
What’s needed to
build a supervised
learning model
Robustness in AI
Covid-19 Pandemic… Sudden change in
Consumers' behavior
New behaviors derail
 old recommender
engines,
 AI inventory trackers,
 and fraud detection
systems
Building Robust Recommender Systems
The AI community needs to implement new
approaches and processes for post-deployment
monitoring like:
 building an alert system to flag changes,
 use human-in-the-loop deployments to
acquire new labels,
 assemble a robust MLOps team.
Valuable product
resilient to conditions
change
Toward Ethical AI…
Recommender Systems Vs. Data Privacy
How to build recommender systems without
compromising data privacy?
Building the
foundation for a
responsible
and innovative
Data
Economy
 Secure computation (e.g. cryptography).
 Secure hardware usage.
 Transparency and Audit.
 ML Models that can learn from small datasets
@ICDEc
sihemromdhani
@Sihem_Romdhani
ICDEc Virtual Conference 2020
romdhani.sihem@gmail.com
Sihem Romdhani
Thank you

More Related Content

PPTX
Embeddings for Recommendation Systems
PPT
SEO & SEM
PDF
Alltoit seo packages
PPT
Keyword Research In A (Not Provided) World
PPSX
Bid4Keywords National Listings
PPTX
Proving the Value of SEO
PPTX
Rule 2014 with Remarketing and RLSA
PPTX
SERP: All you need to know about #SERP
Embeddings for Recommendation Systems
SEO & SEM
Alltoit seo packages
Keyword Research In A (Not Provided) World
Bid4Keywords National Listings
Proving the Value of SEO
Rule 2014 with Remarketing and RLSA
SERP: All you need to know about #SERP

Similar to Embeddings for recommendation systems (20)

PDF
Frequently Bought Together Recommendations Based on Embeddings
PDF
Learned Embeddings for Search and Discovery at Instacart
PDF
[Airbnb Embedding] Real-time Personalization using Embeddings for Search Rank...
PDF
Recsys 2016
PDF
Power of Visualizing Embeddings
PDF
Embeddings! embeddings everywhere!
PDF
Past, present, and future of Recommender Systems: an industry perspective
PDF
Real-time personalized recommendations using product embeddings
PPTX
Search-Based Serving Architecture of Embeddings-Based Recommendations (RecSys...
PPTX
Neural Models for Information Retrieval
PDF
Machine learning advanced applications
PDF
What is Embedding in Machine Learning.pdf
PDF
solulab.com-What are Embedding in Machine Learning.pdf
PDF
Recommender Systems @ Scale - PyData 2019
PPTX
Neural Models for Information Retrieval
PDF
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
PDF
When Deep Learning Meets Recommender System
PDF
Vector search and multimodal embeddings in BigQuery
PPTX
Vectorland: Brief Notes from Using Text Embeddings for Search
PDF
Recommender Systems @ Scale, Big Data Europe Conference 2019
Frequently Bought Together Recommendations Based on Embeddings
Learned Embeddings for Search and Discovery at Instacart
[Airbnb Embedding] Real-time Personalization using Embeddings for Search Rank...
Recsys 2016
Power of Visualizing Embeddings
Embeddings! embeddings everywhere!
Past, present, and future of Recommender Systems: an industry perspective
Real-time personalized recommendations using product embeddings
Search-Based Serving Architecture of Embeddings-Based Recommendations (RecSys...
Neural Models for Information Retrieval
Machine learning advanced applications
What is Embedding in Machine Learning.pdf
solulab.com-What are Embedding in Machine Learning.pdf
Recommender Systems @ Scale - PyData 2019
Neural Models for Information Retrieval
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
When Deep Learning Meets Recommender System
Vector search and multimodal embeddings in BigQuery
Vectorland: Brief Notes from Using Text Embeddings for Search
Recommender Systems @ Scale, Big Data Europe Conference 2019
Ad

More from ICDEcCnferenece (20)

PDF
Total Experience. How to create user-focused digital products that accelerate...
PDF
Two faces of the same coin: Exploring the multilateral perspective of informa...
PDF
Spreading the Word on the Digital Economy
PDF
Readiness of Russian companies for digital transformation: what's changed?
PDF
Possibilities and limitations of the Croatian police in communication via soc...
PDF
Changes in Global Virtual Team Conflict Over Time: The Role of Openness to Li...
PDF
Cause-related marketing: towards an exploration of the factors favoring the p...
PDF
Relationship between culture and user's behavior in the context of informatio...
PDF
Examining the Impact of Digitalization on Unemployment rate During COVID-19 P...
PDF
Success Factors for the Use of Open Educational Resources - A Quantitative Su...
PDF
A Maturity Model for Open Educational Resources in Higher Education Instituti...
PDF
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
PDF
Towards a better digital transformation: learning from the experience of a di...
PDF
Digital Gender Gap in EU-27 ICT Employment During COVID-19 Impact
PDF
Consumer Satisfaction using fitness technology innovation
PDF
Implementing an agile change process to improve digital transformation in hig...
PDF
What leads customer to create and participate in anti-brand community: a netn...
PDF
Closing session: awards for best papers and reviewers
PDF
Transition to Tertiary Education and eLearning in Lebanon against the backdro...
PDF
Internet of Things healthcare system for reducing economic burden
Total Experience. How to create user-focused digital products that accelerate...
Two faces of the same coin: Exploring the multilateral perspective of informa...
Spreading the Word on the Digital Economy
Readiness of Russian companies for digital transformation: what's changed?
Possibilities and limitations of the Croatian police in communication via soc...
Changes in Global Virtual Team Conflict Over Time: The Role of Openness to Li...
Cause-related marketing: towards an exploration of the factors favoring the p...
Relationship between culture and user's behavior in the context of informatio...
Examining the Impact of Digitalization on Unemployment rate During COVID-19 P...
Success Factors for the Use of Open Educational Resources - A Quantitative Su...
A Maturity Model for Open Educational Resources in Higher Education Instituti...
AI-based Business Models in Healthcare: An Empirical Study of Clinical Decisi...
Towards a better digital transformation: learning from the experience of a di...
Digital Gender Gap in EU-27 ICT Employment During COVID-19 Impact
Consumer Satisfaction using fitness technology innovation
Implementing an agile change process to improve digital transformation in hig...
What leads customer to create and participate in anti-brand community: a netn...
Closing session: awards for best papers and reviewers
Transition to Tertiary Education and eLearning in Lebanon against the backdro...
Internet of Things healthcare system for reducing economic burden
Ad

Recently uploaded (20)

PPTX
WEEK-3_TOPIC_Photographic_Rays__Its_Nature_and_Characteristics.pptx
PPTX
QA PROCESS FLOW CHART (1).pptxbbbbbbbbbnnnn
PPTX
Nationalism in India Ch-2.pptx ssssss classs 10
PPTX
Cloud Computing ppt.ppt1QU4FFIWEKWEIFRRGx
PPTX
400kV_Switchyardasdsfesfewffwefrrwewew_Training_Module.pptx
PDF
Annah la Javanaise_ The Truth Behind Gauguin’s Model.pdf
PPTX
IOT Unit 6 PPT ( ~ By Prof. Simran Ahuja ).pptx
PDF
Celebrate Krishna Janmashtami 2025 | Cottage9
PPTX
SUBANEN DANCE DUMENDINGAN DANCE LITERATURE
PPTX
Imelda Ppt Improved 1 March 2021 (1) (3) (1).pptx
PPTX
QA PROCESS FLOW CHART (1).pptxaaaaaaaaaaaa
PPTX
Chemical Reactions in Our Lives.pptxyyyyyyyyy
PPT
huyfuygkgkugi iyugib jiygi uyuyguygv uyguyv
PDF
Music-and-Arts_jwkskwjsjsjsjsjsjsjdisiaiajsjjzjz
PPTX
This is about the usage of color in universities design
PPTX
Operational Research check it out. I like this it is pretty good
PDF
15901922083_PQA.pdf................................
PPTX
GREEN BUILDINGS are eco friendly for environment
PDF
INTRODUCTION-TO-ARTS-PRELIM.pdf arts and appreciation
PDF
630895715-Romanesque-Architecture-ppt.pdf
WEEK-3_TOPIC_Photographic_Rays__Its_Nature_and_Characteristics.pptx
QA PROCESS FLOW CHART (1).pptxbbbbbbbbbnnnn
Nationalism in India Ch-2.pptx ssssss classs 10
Cloud Computing ppt.ppt1QU4FFIWEKWEIFRRGx
400kV_Switchyardasdsfesfewffwefrrwewew_Training_Module.pptx
Annah la Javanaise_ The Truth Behind Gauguin’s Model.pdf
IOT Unit 6 PPT ( ~ By Prof. Simran Ahuja ).pptx
Celebrate Krishna Janmashtami 2025 | Cottage9
SUBANEN DANCE DUMENDINGAN DANCE LITERATURE
Imelda Ppt Improved 1 March 2021 (1) (3) (1).pptx
QA PROCESS FLOW CHART (1).pptxaaaaaaaaaaaa
Chemical Reactions in Our Lives.pptxyyyyyyyyy
huyfuygkgkugi iyugib jiygi uyuyguygv uyguyv
Music-and-Arts_jwkskwjsjsjsjsjsjsjdisiaiajsjjzjz
This is about the usage of color in universities design
Operational Research check it out. I like this it is pretty good
15901922083_PQA.pdf................................
GREEN BUILDINGS are eco friendly for environment
INTRODUCTION-TO-ARTS-PRELIM.pdf arts and appreciation
630895715-Romanesque-Architecture-ppt.pdf

Embeddings for recommendation systems