SlideShare a Scribd company logo
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Why User Behavior Insights?
KMWorld Enterprise Search
November 21, 2024
S T A V R O S M A C R A K I S ,
P R O D U C T M A N A G E R , O P E N S E A R C H @ A W S
E R I C P U G H
C O - F O U N D E R , O P E N S O U R C E C O N N E C T I O N S
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 2
Search is strategic
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 3
Search is strategic
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 4
And yet it never
seems to work right
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 5
And yet it never
seems to work right
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 6
Why is it hard?
And yet it never
seems to work right
Search is strategic
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
mbedded
• Company search within
• Person search within email
• Applications
• Document search and recommendations (heavy on text)
• Academic documents for PhD researchers
• News for general public, for journalist
• Intranet, highly heterogenous, inconsistent formats
• Knowledge management for researchers, for sales people, for application
engineers
• Legal and regulatory, both high-precision (relevant regulation) and high-
recall (discovery)
• FAQs, call centers, troubleshooting
• Looking for items, not documents (heavy on structured data)
• E-commerce search and recommendations: looking for a thing to buy
• Anything from plumbing parts to real estate to yoga lessons
• Media – movies, TV, …
• Job search; placements
xpert search for interns, for research agents, for hedge fund managers
urant, flight search, fractional jets – real-time availability
e position
ilarity; SmugMug very different from
varied use cases
varied datasets
varied users
7
one size DoEsn’t fit aLl
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 8
How can you fix it?
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Conference survey
• Understand application data
• Talk to users and sponsors
• Regression testing
• Performance testing
• Log search actions
• Multilingual coverage
• Evaluate quality of results
• Privacy
9
What search needs
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
What search needs
10
Conference survey
• Talk to users and sponsors
• Privacy
• Understand application data
• Evaluate quality of results
• Log search actions
• Performance testing
• Regression testing
• Multilingual coverage
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Sort by gaps
11
Conference survey
• Performance testing
• Multilingual coverage
• Regression testing
• Evaluate quality of results
• Talk to users and sponsors
• Privacy
• Understand application data
• Log search actions
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
What most needs improvement
12
Conference survey
• Log search actions
• Understand application data
• Evaluate quality of results
• Talk to users and sponsors
• Regression testing
• Performance testing
• Multilingual coverage
• Privacy
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
What are even big users missing?
13
Conference survey
• Log search actions
• Understand application data
• Evaluate quality of results
• Talk to users and sponsors
• Regression testing
• Performance testing
• Multilingual coverage
• Privacy
AWS customer gaps
• Search specialist engineers
• Search is a discipline
• Logging
• Quality evaluation
• Incorporating multiple signals
• Tuning
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
14
Measurement and analysis
• Behavioral data
• Online evaluation
• Offline evaluation
• A/B testing
Search processing
• Lexical search
• Semantic search
• Sparse neural
Search tuning
• Manual tuning
• Hybrid search
• Semantic reranking
• Learning to Rank (LTR)
The virtuous circle of search improvement
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
§ Query understanding
§ Query rewriting
§ Lexical search
§ Neural sparse retrieval
§ Semantic vector search
§ Multimodal search
§ LLM reranking
§ LLM summarization
How do you choose the best technique(s)?
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Tune
16
Measurement and analysis
• Search behavior
• Online evaluation tools
• A/B testing
• Creation of judgement sets
• Offline evaluation tools
• Metrics
Search processing
• Lexical search
• ML: Semantic search
• ML: Query understanding
• ML: Sparse neural
• ML: Relevance reranking
• ML: Multimodal
Tuning
• Hybridizing
• Regression evaluation
• Manual tuning of signals
• ML: Bayesian optimization (LTB)
• ML: Semantic model fine-tuning
• ML: Learning to Rank (LTR)
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Combined strengths
17
Semantic search Lexical search Hybrid search
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
So how do you tune and combine them?
§ Query understanding +
§ Query rewriting +
§ Lexical search +
§ Neural sparse retrieval +
§ Semantic vector search +
§ Multimodal search +
§ LLM reranking +
§ LLM summarization
Hybrids
Tuning (field boosts, …)
Retrieval Augmented Generation
(RAG)
+
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
You need to collect and analyze data
19
Measurement and analysis
• Search behavior
• Online evaluation tools
• A/B testing
• Creation of judgement sets
• Offline evaluation tools
• Metrics
Search processing
• Lexical search
• ML: Semantic search
• ML: Query understanding
• ML: Sparse neural
• ML: Relevance reranking
• ML: Multimodal
Search tuning
• Hybridizing
• Regression evaluation
• Manual tuning of signals
• ML: Bayesian optimization (LTB)
• ML: Semantic model fine-tuning
• ML: Learning to Rank (LTR)
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 20
User Behavior
Search
Results
Conversion
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 21
User Behavior – clicks
Search
Results
Conversion
14:23 SEARCH_CLICK
SearchID: 900354 UserID:969
Query: good to be the king louis
14:25 PLAY_CLICK
SearchID: 900254 UserID:969
ObjectID: 9393
14:24 RESULT_CLICK
SearchID: 900254 UserID:969
ObjectID: 9393
Position: 4
14:23 RESULTS
SearchID: 900354 UserID:969
Object 2323, 5498, 3434, 9393, 539
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
User Behavior – end-to-end record
22
User 969 at 14:23
Search 900254 for “good to be the king louis”
Results Object 2323, 5498, 3434, 9393, 5394, 9
Choose Result 4 = Object 9393
Play Object 9393 = UPC 490404333
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
DEMO
23
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Build !
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Build !
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Metrics Data structure designed for ease of: aggregation,
filtering and grouping
Covers multiple use cases:
Quick summaries of performance using different metrics
Investigate performance regressions over time
Compare search configurations
Evaluate performance under different query sets (main sample, top queries, etc)
Deep dive into best and worst performing queries
Search Quality Metrics
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 29
Example of relevance comparison
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
30
4. Deploy Best
Hybrid Search
Configuration
3. Run Hybrid Search
Optimizer
2. Calculate Implicit
Judgements
1. Collect User Signals
(=UBI Data)
Hybrid Search
Optimizer Based
on User Behavior
Insights Data
Start over
again!
q Interleaved A/B tests
q Bandit algorithms
q Establishing guardrails
Hybrid search optimizer…
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
User Behavior Insights
• Collect fine-grained behavioral data
• Standard schema for search, search results,
actions on search results
• Trace actions on search results causally
• Feed both manual and ML tuning
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
https://www.ubisearch.dev/

More Related Content

PDF
ANT319_Create-a-data-marketplace-with-Amazon-DataZone.pdf
PDF
Automated Workflows and AI Agents with Amazon Bedrock
PDF
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
PDF
O365Con18 - How to Run a Search Project in SharePoint - Matthew McDermott
PDF
Innovate 7 Principles for effective and cost-efficient generative AI apps.pdf
PDF
Agents for Enterprise Workflows - Berkeley LLM AI Agents MOOC
PDF
[AWS Techshift] Innovation and AI/ML Sagemaker Build-in 머신러닝 모델 활용 및 Marketpl...
ANT319_Create-a-data-marketplace-with-Amazon-DataZone.pdf
Automated Workflows and AI Agents with Amazon Bedrock
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
O365Con18 - How to Run a Search Project in SharePoint - Matthew McDermott
Innovate 7 Principles for effective and cost-efficient generative AI apps.pdf
Agents for Enterprise Workflows - Berkeley LLM AI Agents MOOC
[AWS Techshift] Innovation and AI/ML Sagemaker Build-in 머신러닝 모델 활용 및 Marketpl...

Similar to Why User Behavior Insights? KMWorld Enterprise Search & Discovery 2024 (7)

PPTX
Build_generative_AI_applications_with_Amazon_Bedrock_and_open_source_framewor...
PDF
Evaluating Large Language Models for Your Applications and Why It Matters
PPTX
Seth Earley Presentation on Knowledge Management Through Search-Based Applica...
PPT
SpiderIT_presentation
PPTX
How Startups can leverage big data?
PDF
Democratize Data with GovernanceDemocratize Data with GovernanceDemocratize D...
PPTX
33 Tactics to Engage and Retain More Customers - IRCE 2016
Build_generative_AI_applications_with_Amazon_Bedrock_and_open_source_framewor...
Evaluating Large Language Models for Your Applications and Why It Matters
Seth Earley Presentation on Knowledge Management Through Search-Based Applica...
SpiderIT_presentation
How Startups can leverage big data?
Democratize Data with GovernanceDemocratize Data with GovernanceDemocratize D...
33 Tactics to Engage and Retain More Customers - IRCE 2016
Ad

More from OpenSource Connections (20)

PDF
Test driven relevancy
PDF
How To Structure Your Search Team for Success
PPT
The right path to making search relevant - Taxonomy Bootcamp London 2019
PDF
Payloads and OCR with Solr
PPTX
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
PDF
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
PPTX
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
PPTX
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
PDF
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
PPTX
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
PPTX
Haystack 2019 - Search with Vectors - Simon Hughes
PPTX
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
PPTX
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
PDF
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
PDF
Haystack 2019 - Architectural considerations on search relevancy in the conte...
PPTX
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
PPTX
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
PPTX
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
PDF
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
Test driven relevancy
How To Structure Your Search Team for Success
The right path to making search relevant - Taxonomy Bootcamp London 2019
Payloads and OCR with Solr
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
Ad

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
Teaching material agriculture food technology
PDF
Modernizing your data center with Dell and AMD
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Electronic commerce courselecture one. Pdf
PDF
Approach and Philosophy of On baking technology
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
A Presentation on Artificial Intelligence
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
Teaching material agriculture food technology
Modernizing your data center with Dell and AMD
Diabetes mellitus diagnosis method based random forest with bat algorithm
Electronic commerce courselecture one. Pdf
Approach and Philosophy of On baking technology
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
MYSQL Presentation for SQL database connectivity
The Rise and Fall of 3GPP – Time for a Sabbatical?
Network Security Unit 5.pdf for BCA BBA.
20250228 LYD VKU AI Blended-Learning.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Chapter 3 Spatial Domain Image Processing.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Dropbox Q2 2025 Financial Results & Investor Presentation
“AI and Expert System Decision Support & Business Intelligence Systems”
A Presentation on Artificial Intelligence
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Empathic Computing: Creating Shared Understanding

Why User Behavior Insights? KMWorld Enterprise Search & Discovery 2024

  • 1. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Why User Behavior Insights? KMWorld Enterprise Search November 21, 2024 S T A V R O S M A C R A K I S , P R O D U C T M A N A G E R , O P E N S E A R C H @ A W S E R I C P U G H C O - F O U N D E R , O P E N S O U R C E C O N N E C T I O N S
  • 2. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 2 Search is strategic
  • 3. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 3 Search is strategic
  • 4. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 4 And yet it never seems to work right
  • 5. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 5 And yet it never seems to work right
  • 6. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 6 Why is it hard? And yet it never seems to work right Search is strategic
  • 7. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. mbedded • Company search within • Person search within email • Applications • Document search and recommendations (heavy on text) • Academic documents for PhD researchers • News for general public, for journalist • Intranet, highly heterogenous, inconsistent formats • Knowledge management for researchers, for sales people, for application engineers • Legal and regulatory, both high-precision (relevant regulation) and high- recall (discovery) • FAQs, call centers, troubleshooting • Looking for items, not documents (heavy on structured data) • E-commerce search and recommendations: looking for a thing to buy • Anything from plumbing parts to real estate to yoga lessons • Media – movies, TV, … • Job search; placements xpert search for interns, for research agents, for hedge fund managers urant, flight search, fractional jets – real-time availability e position ilarity; SmugMug very different from varied use cases varied datasets varied users 7 one size DoEsn’t fit aLl
  • 8. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 8 How can you fix it?
  • 9. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Conference survey • Understand application data • Talk to users and sponsors • Regression testing • Performance testing • Log search actions • Multilingual coverage • Evaluate quality of results • Privacy 9 What search needs
  • 10. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. What search needs 10 Conference survey • Talk to users and sponsors • Privacy • Understand application data • Evaluate quality of results • Log search actions • Performance testing • Regression testing • Multilingual coverage
  • 11. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Sort by gaps 11 Conference survey • Performance testing • Multilingual coverage • Regression testing • Evaluate quality of results • Talk to users and sponsors • Privacy • Understand application data • Log search actions
  • 12. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. What most needs improvement 12 Conference survey • Log search actions • Understand application data • Evaluate quality of results • Talk to users and sponsors • Regression testing • Performance testing • Multilingual coverage • Privacy
  • 13. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. What are even big users missing? 13 Conference survey • Log search actions • Understand application data • Evaluate quality of results • Talk to users and sponsors • Regression testing • Performance testing • Multilingual coverage • Privacy AWS customer gaps • Search specialist engineers • Search is a discipline • Logging • Quality evaluation • Incorporating multiple signals • Tuning
  • 14. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 14 Measurement and analysis • Behavioral data • Online evaluation • Offline evaluation • A/B testing Search processing • Lexical search • Semantic search • Sparse neural Search tuning • Manual tuning • Hybrid search • Semantic reranking • Learning to Rank (LTR) The virtuous circle of search improvement
  • 15. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. § Query understanding § Query rewriting § Lexical search § Neural sparse retrieval § Semantic vector search § Multimodal search § LLM reranking § LLM summarization How do you choose the best technique(s)?
  • 16. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Tune 16 Measurement and analysis • Search behavior • Online evaluation tools • A/B testing • Creation of judgement sets • Offline evaluation tools • Metrics Search processing • Lexical search • ML: Semantic search • ML: Query understanding • ML: Sparse neural • ML: Relevance reranking • ML: Multimodal Tuning • Hybridizing • Regression evaluation • Manual tuning of signals • ML: Bayesian optimization (LTB) • ML: Semantic model fine-tuning • ML: Learning to Rank (LTR)
  • 17. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Combined strengths 17 Semantic search Lexical search Hybrid search
  • 18. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. So how do you tune and combine them? § Query understanding + § Query rewriting + § Lexical search + § Neural sparse retrieval + § Semantic vector search + § Multimodal search + § LLM reranking + § LLM summarization Hybrids Tuning (field boosts, …) Retrieval Augmented Generation (RAG) +
  • 19. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. You need to collect and analyze data 19 Measurement and analysis • Search behavior • Online evaluation tools • A/B testing • Creation of judgement sets • Offline evaluation tools • Metrics Search processing • Lexical search • ML: Semantic search • ML: Query understanding • ML: Sparse neural • ML: Relevance reranking • ML: Multimodal Search tuning • Hybridizing • Regression evaluation • Manual tuning of signals • ML: Bayesian optimization (LTB) • ML: Semantic model fine-tuning • ML: Learning to Rank (LTR)
  • 20. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 20 User Behavior Search Results Conversion
  • 21. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 21 User Behavior – clicks Search Results Conversion 14:23 SEARCH_CLICK SearchID: 900354 UserID:969 Query: good to be the king louis 14:25 PLAY_CLICK SearchID: 900254 UserID:969 ObjectID: 9393 14:24 RESULT_CLICK SearchID: 900254 UserID:969 ObjectID: 9393 Position: 4 14:23 RESULTS SearchID: 900354 UserID:969 Object 2323, 5498, 3434, 9393, 539
  • 22. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. User Behavior – end-to-end record 22 User 969 at 14:23 Search 900254 for “good to be the king louis” Results Object 2323, 5498, 3434, 9393, 5394, 9 Choose Result 4 = Object 9393 Play Object 9393 = UPC 490404333
  • 23. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. DEMO 23
  • 24. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Build !
  • 25. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Build !
  • 26. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Metrics Data structure designed for ease of: aggregation, filtering and grouping Covers multiple use cases: Quick summaries of performance using different metrics Investigate performance regressions over time Compare search configurations Evaluate performance under different query sets (main sample, top queries, etc) Deep dive into best and worst performing queries Search Quality Metrics
  • 27. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
  • 28. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
  • 29. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 29 Example of relevance comparison
  • 30. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. 30 4. Deploy Best Hybrid Search Configuration 3. Run Hybrid Search Optimizer 2. Calculate Implicit Judgements 1. Collect User Signals (=UBI Data) Hybrid Search Optimizer Based on User Behavior Insights Data Start over again! q Interleaved A/B tests q Bandit algorithms q Establishing guardrails Hybrid search optimizer…
  • 31. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. User Behavior Insights • Collect fine-grained behavioral data • Standard schema for search, search results, actions on search results • Trace actions on search results causally • Feed both manual and ML tuning
  • 32. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. https://www.ubisearch.dev/