SlideShare a Scribd company logo
Big data ° Real time
The open big data serving engine; store, search,
rank and organize big data at user serving time.
Big data maturity levels
Latent Data is produced but not systematically leveraged
Examples Credit card transaction data is stored for audit purposes.
Movie streaming events are logged.
Analysis Data is used to inform decisions made by humans
Examples Statistics on credit card fraud are gathered to create policies for flagging fraudulent transactions.
Lists of movies popular with various user segments are compiled to inform curated recommendation lists.
Learning Data is used to learn automated decisions disconnected from direct action
Examples Fraudulent credit card transactions are automatically flagged.
Lists of movie recommendations per user segment are automatically generated.
Acting Automated data-driven decisions are made in real time
Examples Fraudulent credit card transactions are automatically blocked.
Personalized movie recommendations are computed when needed by that user.
Closer look: Acting
Acting Automated data-driven decisions are made in real time
Examples Fraudulent credit card transactions are automatically blocked.
Personalized movie recommendations are computed when needed by that user.
Two types
Decisions can be made by considering a single data item:
Streaming, or stateless model evaluation
Decisions need to consider many data items:
Big data serving
Big data serving: What is required?
Real-time actions: Find data and make inferences in tens of milliseconds.
Realtime knowledge: Handle data changes at high continuous rates.
Scalable: Handle large requests rates over big data sets.
Always available: Recover from hardware failures without human intervention.
Online evolvable: Change schemas, logic, models, hardware while online.
Integrated: Data feeds from Hadoop, learned models from TensorFlow etc.
Introducing Vespa
An open source platform for big data serving
As Hadoop: Developed at Yahoo for search, now for all big data serving cases
Open source: Visit the new site at http://vespa.ai
Big data: Makes the Big Data Serving features available for everyone
Vespa at Oath / Yahoo
Oath:Tumblr, TechCrunch, Huffington Post, Aol, Engadget, Gemini, News, Sports, Finance, Mail, etc.
Hundreds of Vespa applications,
… serving over a billion users
… over 200.000 queries per second
… over billions of content items
Vespa is
A platform for low latency computations over large, evolving data sets
• Search and selection over structured and unstructured data
• Relevance scoring: NL features, advanced ML models, TensorFlow etc.
• Query time organization and aggregation of matching data
• Real-time writes at a high sustained rate
• Live elastic and auto-recovering stateful content clusters
• Processing logic container (Java)
• Managed clusters: One to hundreds of nodes
Typical use cases: text search, personalization / recommendation / targeting, real-time data display
Case study: Zedge
The primary motivations for Zedge to use Vespa are
1) simplify search and recommender systems for Zedge Android and
iOS apps, both for serving (reduce amount of custom code to maintain) and
for processing/indexing (reduce need for big data jobs by calculating more
on the fly with tensors in Vespa)
2) accelerate innovation for content discovery, e.g. easier to improve
ranking with machine learning using Vespa in combination with Tensorflow
than with e.g. our custom code recommender systems. An added bonus so
far has been that more people understand both search and recommender
systems due to the overall reduction in complexity of search and
recommender systems
- Zedge VP of Data, Amund Tveit
2017 Worldwide
Download Leaders
Comparisons
Vespa: Focus on big data serving: Large scale, efficient, ML models
ElasticSearch: Focus on analytics: Log ingestion, visualization etc.
Solr: Focused on enterprise search: Handling document formats etc.
Relational databases: Transactions, hard to scale, no IR, no relevance
NoSQl stores: Easier to scale, no transactions, no IR, no relevance
Hadoop/Cloudera/
Hortonworks:
Big Data, but not for serving
Text search, relevance,
grouping and aggregation
Analytics
Vespa Elastic Search
Big data serving
Vespa and Elastic Search use cases
Analytics vs big data serving
Analytics Big data serving
Response time in low seconds Response time in low milliseconds
Low query rate High query rate
Time series, append only Random writes
Down time, data loss acceptable HA, no data loss, online redistribution
Massive data sets (trillions of docs) are cheap Massive data sets are more expensive
Analytics GUI integration Machine learning integration
VS
Vespa architecture
Container node
Query
Application
Package
Admin &
Config
Content node
Deploy
- Configuration
- Components
- ML models
Scatter-gather
Core
sharding
models models models
1) Parallelization
2) Move execution to data nodes
3) Prepared data structures (indexes etc.)
Scalable low latency execution:
How to bound latency
Amdahl’s law:
speedup = 1 / (s + p / N)
SLA
Latency: 100ms @ 95%
Throughput: 500 qps
Utilizing increased resources to
potentially increase quality of
returned results.
Inference in Vespa
Tensor data model: Multidimensional collections of
numbers in queries, documents, models
Tensor math express all common machine-learned
models with join, map, reduce
TensorFlow and ONNX integration: Deploy
TensorFlow and ONNX (SciKit, Caffe2, PyTorch
etc.) directly on Vespa
Vespa execution engine optimized for repeated
execution of models over many data items, and
running many inferences in parallel
<application package>/models/
search music {
rank-profile song inherits default {
first-phase {
expression {
0.7 * nativeRank(artist,album,track) +
0.1 * tensorflow(tf-model-dir) +
0.1 * onnx(onnx-model-file, output) +
0.1 * xgboost(xgboost-model-file)
}
}
}
}
map(
join(
reduce(
join(
Placeholder,
Weights_1,
f(x,y)(x * y)
),
sum,
d1
),
Weights_2,
f(x,y)(x + y)
),
f(x)(max(0,x))
)Placeholder Weights_1
matmul Weights_2
add
relu
Vespa Recap
Making the best use of big data often implies making decisions in real time
Vespa is the only open source platform optimized for such big data serving
Available on https://vespa.ai
Quick start: Run a complete application (on a laptop or AWS) in 10 minutes
http://docs.vespa.ai/documentation/vespa-quick-start.html
Tutorial: Make a scalable blog search and recommendation engine from scratch
http://docs.vespa.ai/documentation/tutorials/blog-search.html

More Related Content

PDF
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
PDF
Big data landscape
PPTX
Real time bi solution architecture
PPTX
Introduction of big data unit 1
PDF
Big Data Ecosystem
PDF
Machine Data Analytics
PPTX
Big data unit 2
PDF
Big data ecosystem
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
Big data landscape
Real time bi solution architecture
Introduction of big data unit 1
Big Data Ecosystem
Machine Data Analytics
Big data unit 2
Big data ecosystem

What's hot (20)

PPT
Big Tools for Big Data
PPTX
Big data analytics: Technology's bleeding edge
PDF
Big Data Landscape 2016
PPTX
Big Data Unit 4 - Hadoop
PPTX
Fundamentals of big data analytics and Hadoop
PPTX
Big Data Analysis Patterns with Hadoop, Mahout and Solr
PPT
Hadoop mapreduce and yarn frame work- unit5
PPTX
NoSQL Type, Bigdata, and Analytics
PPTX
Data science big data and analytics
PPTX
Hadoop - An Introduction
PPTX
Big data technology unit 3
PPTX
Big Data Analytics
PDF
Future of Data - Big Data
PPTX
Big Data and Hadoop
PPTX
PPTX
Intro to bigdata on gcp (1)
PPTX
Bigdata
PPTX
DW Appliance
PPTX
Big Data Analytics & Architecture
ODP
Big Data Analytics - Introduction
Big Tools for Big Data
Big data analytics: Technology's bleeding edge
Big Data Landscape 2016
Big Data Unit 4 - Hadoop
Fundamentals of big data analytics and Hadoop
Big Data Analysis Patterns with Hadoop, Mahout and Solr
Hadoop mapreduce and yarn frame work- unit5
NoSQL Type, Bigdata, and Analytics
Data science big data and analytics
Hadoop - An Introduction
Big data technology unit 3
Big Data Analytics
Future of Data - Big Data
Big Data and Hadoop
Intro to bigdata on gcp (1)
Bigdata
DW Appliance
Big Data Analytics & Architecture
Big Data Analytics - Introduction
Ad

Similar to Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath (20)

PDF
Big data serving: Processing and inference at scale in real time
PDF
Introduction Big Data
PPTX
Big Data Session 1.pptx
PPTX
Big Data Analytics PPT - S1 working .pptx
PPTX
Big data technologies with Case Study Finance and Healthcare
PPTX
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
PPT
Lecture 5 - Big Data and Hadoop Intro.ppt
PDF
Big data on_aws in korea by abhishek sinha (lunch and learn)
PPTX
Lecture1 BIG DATA and Types of data in details
PPTX
Azure Data Explorer deep dive - review 04.2020
PPTX
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
PPTX
TSE_Pres12.pptx
DOCX
Big data and Hadoop overview
PDF
Big Data Processing & Analytics: Improving data insight.pdf
PDF
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
PDF
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
PPTX
big data eco system fundamentals of data science
PDF
Cloud and Bid data Dr.VK.pdf
PPSX
Big Data
Big data serving: Processing and inference at scale in real time
Introduction Big Data
Big Data Session 1.pptx
Big Data Analytics PPT - S1 working .pptx
Big data technologies with Case Study Finance and Healthcare
Unushs susus susujss. Ssuusussjjsjsit 4.pptx
Lecture 5 - Big Data and Hadoop Intro.ppt
Big data on_aws in korea by abhishek sinha (lunch and learn)
Lecture1 BIG DATA and Types of data in details
Azure Data Explorer deep dive - review 04.2020
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
TSE_Pres12.pptx
Big data and Hadoop overview
Big Data Processing & Analytics: Improving data insight.pdf
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
big data eco system fundamentals of data science
Cloud and Bid data Dr.VK.pdf
Big Data
Ad

More from Yahoo Developer Network (20)

PDF
Developing Mobile Apps for Performance - Swapnil Patel, Verizon Media
PDF
Athenz - The Open-Source Solution to Provide Access Control in Dynamic Infras...
PDF
Athenz & SPIFFE, Tatsuya Yano, Yahoo Japan
PDF
Athenz with Istio - Single Access Control Model in Cloud Infrastructures, Tat...
PDF
CICD at Oath using Screwdriver
PPTX
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
PDF
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
PPTX
Apache YARN Federation and Tez at Microsoft, Anupam Upadhyay, Adrian Nicoara,...
PPTX
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
PDF
HDFS Scalability and Security, Daryn Sharp, Senior Engineer, Oath
PPTX
Hadoop {Submarine} Project: Running deep learning workloads on YARN, Wangda T...
PDF
Moving the Oath Grid to Docker, Eric Badger, Oath
PDF
Architecting Petabyte Scale AI Applications
PPTX
Jun 2017 HUG: YARN Scheduling – A Step Beyond
PDF
Jun 2017 HUG: Large-Scale Machine Learning: Use Cases and Technologies
PPTX
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
PPTX
February 2017 HUG: Exactly-once end-to-end processing with Apache Apex
PPTX
February 2017 HUG: Data Sketches: A required toolkit for Big Data Analytics
PDF
October 2016 HUG: Pulsar,  a highly scalable, low latency pub-sub messaging s...
PPTX
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Developing Mobile Apps for Performance - Swapnil Patel, Verizon Media
Athenz - The Open-Source Solution to Provide Access Control in Dynamic Infras...
Athenz & SPIFFE, Tatsuya Yano, Yahoo Japan
Athenz with Istio - Single Access Control Model in Cloud Infrastructures, Tat...
CICD at Oath using Screwdriver
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
Apache YARN Federation and Tez at Microsoft, Anupam Upadhyay, Adrian Nicoara,...
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
HDFS Scalability and Security, Daryn Sharp, Senior Engineer, Oath
Hadoop {Submarine} Project: Running deep learning workloads on YARN, Wangda T...
Moving the Oath Grid to Docker, Eric Badger, Oath
Architecting Petabyte Scale AI Applications
Jun 2017 HUG: YARN Scheduling – A Step Beyond
Jun 2017 HUG: Large-Scale Machine Learning: Use Cases and Technologies
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
February 2017 HUG: Exactly-once end-to-end processing with Apache Apex
February 2017 HUG: Data Sketches: A required toolkit for Big Data Analytics
October 2016 HUG: Pulsar,  a highly scalable, low latency pub-sub messaging s...
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
Teaching material agriculture food technology
PDF
Advanced IT Governance
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Electronic commerce courselecture one. Pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Empathic Computing: Creating Shared Understanding
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
KodekX | Application Modernization Development
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Reach Out and Touch Someone: Haptics and Empathic Computing
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Understanding_Digital_Forensics_Presentation.pptx
Teaching material agriculture food technology
Advanced IT Governance
MYSQL Presentation for SQL database connectivity
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Electronic commerce courselecture one. Pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Empathic Computing: Creating Shared Understanding
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KodekX | Application Modernization Development
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf

Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath

  • 1. Big data ° Real time The open big data serving engine; store, search, rank and organize big data at user serving time.
  • 2. Big data maturity levels Latent Data is produced but not systematically leveraged Examples Credit card transaction data is stored for audit purposes. Movie streaming events are logged. Analysis Data is used to inform decisions made by humans Examples Statistics on credit card fraud are gathered to create policies for flagging fraudulent transactions. Lists of movies popular with various user segments are compiled to inform curated recommendation lists. Learning Data is used to learn automated decisions disconnected from direct action Examples Fraudulent credit card transactions are automatically flagged. Lists of movie recommendations per user segment are automatically generated. Acting Automated data-driven decisions are made in real time Examples Fraudulent credit card transactions are automatically blocked. Personalized movie recommendations are computed when needed by that user.
  • 3. Closer look: Acting Acting Automated data-driven decisions are made in real time Examples Fraudulent credit card transactions are automatically blocked. Personalized movie recommendations are computed when needed by that user. Two types Decisions can be made by considering a single data item: Streaming, or stateless model evaluation Decisions need to consider many data items: Big data serving
  • 4. Big data serving: What is required? Real-time actions: Find data and make inferences in tens of milliseconds. Realtime knowledge: Handle data changes at high continuous rates. Scalable: Handle large requests rates over big data sets. Always available: Recover from hardware failures without human intervention. Online evolvable: Change schemas, logic, models, hardware while online. Integrated: Data feeds from Hadoop, learned models from TensorFlow etc.
  • 5. Introducing Vespa An open source platform for big data serving As Hadoop: Developed at Yahoo for search, now for all big data serving cases Open source: Visit the new site at http://vespa.ai Big data: Makes the Big Data Serving features available for everyone
  • 6. Vespa at Oath / Yahoo Oath:Tumblr, TechCrunch, Huffington Post, Aol, Engadget, Gemini, News, Sports, Finance, Mail, etc. Hundreds of Vespa applications, … serving over a billion users … over 200.000 queries per second … over billions of content items
  • 7. Vespa is A platform for low latency computations over large, evolving data sets • Search and selection over structured and unstructured data • Relevance scoring: NL features, advanced ML models, TensorFlow etc. • Query time organization and aggregation of matching data • Real-time writes at a high sustained rate • Live elastic and auto-recovering stateful content clusters • Processing logic container (Java) • Managed clusters: One to hundreds of nodes Typical use cases: text search, personalization / recommendation / targeting, real-time data display
  • 8. Case study: Zedge The primary motivations for Zedge to use Vespa are 1) simplify search and recommender systems for Zedge Android and iOS apps, both for serving (reduce amount of custom code to maintain) and for processing/indexing (reduce need for big data jobs by calculating more on the fly with tensors in Vespa) 2) accelerate innovation for content discovery, e.g. easier to improve ranking with machine learning using Vespa in combination with Tensorflow than with e.g. our custom code recommender systems. An added bonus so far has been that more people understand both search and recommender systems due to the overall reduction in complexity of search and recommender systems - Zedge VP of Data, Amund Tveit 2017 Worldwide Download Leaders
  • 9. Comparisons Vespa: Focus on big data serving: Large scale, efficient, ML models ElasticSearch: Focus on analytics: Log ingestion, visualization etc. Solr: Focused on enterprise search: Handling document formats etc. Relational databases: Transactions, hard to scale, no IR, no relevance NoSQl stores: Easier to scale, no transactions, no IR, no relevance Hadoop/Cloudera/ Hortonworks: Big Data, but not for serving
  • 10. Text search, relevance, grouping and aggregation Analytics Vespa Elastic Search Big data serving Vespa and Elastic Search use cases
  • 11. Analytics vs big data serving Analytics Big data serving Response time in low seconds Response time in low milliseconds Low query rate High query rate Time series, append only Random writes Down time, data loss acceptable HA, no data loss, online redistribution Massive data sets (trillions of docs) are cheap Massive data sets are more expensive Analytics GUI integration Machine learning integration VS
  • 13. Container node Query Application Package Admin & Config Content node Deploy - Configuration - Components - ML models Scatter-gather Core sharding models models models 1) Parallelization 2) Move execution to data nodes 3) Prepared data structures (indexes etc.) Scalable low latency execution: How to bound latency
  • 14. Amdahl’s law: speedup = 1 / (s + p / N)
  • 15. SLA Latency: 100ms @ 95% Throughput: 500 qps Utilizing increased resources to potentially increase quality of returned results.
  • 16. Inference in Vespa Tensor data model: Multidimensional collections of numbers in queries, documents, models Tensor math express all common machine-learned models with join, map, reduce TensorFlow and ONNX integration: Deploy TensorFlow and ONNX (SciKit, Caffe2, PyTorch etc.) directly on Vespa Vespa execution engine optimized for repeated execution of models over many data items, and running many inferences in parallel
  • 17. <application package>/models/ search music { rank-profile song inherits default { first-phase { expression { 0.7 * nativeRank(artist,album,track) + 0.1 * tensorflow(tf-model-dir) + 0.1 * onnx(onnx-model-file, output) + 0.1 * xgboost(xgboost-model-file) } } } }
  • 18. map( join( reduce( join( Placeholder, Weights_1, f(x,y)(x * y) ), sum, d1 ), Weights_2, f(x,y)(x + y) ), f(x)(max(0,x)) )Placeholder Weights_1 matmul Weights_2 add relu
  • 19. Vespa Recap Making the best use of big data often implies making decisions in real time Vespa is the only open source platform optimized for such big data serving Available on https://vespa.ai Quick start: Run a complete application (on a laptop or AWS) in 10 minutes http://docs.vespa.ai/documentation/vespa-quick-start.html Tutorial: Make a scalable blog search and recommendation engine from scratch http://docs.vespa.ai/documentation/tutorials/blog-search.html