Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath

Big data ° Real time
The open big data serving engine; store, search,
rank and organize big data at user serving time.

Big data maturity levels
Latent Data is produced but not systematically leveraged
Examples Credit card transaction data is stored for audit purposes.
Movie streaming events are logged.
Analysis Data is used to inform decisions made by humans
Examples Statistics on credit card fraud are gathered to create policies for flagging fraudulent transactions.
Lists of movies popular with various user segments are compiled to inform curated recommendation lists.
Learning Data is used to learn automated decisions disconnected from direct action
Examples Fraudulent credit card transactions are automatically flagged.
Lists of movie recommendations per user segment are automatically generated.
Acting Automated data-driven decisions are made in real time
Examples Fraudulent credit card transactions are automatically blocked.
Personalized movie recommendations are computed when needed by that user.

Closer look: Acting
Acting Automated data-driven decisions are made in real time
Examples Fraudulent credit card transactions are automatically blocked.
Personalized movie recommendations are computed when needed by that user.
Two types
Decisions can be made by considering a single data item:
Streaming, or stateless model evaluation
Decisions need to consider many data items:
Big data serving

Big data serving: What is required?
Real-time actions: Find data and make inferences in tens of milliseconds.
Realtime knowledge: Handle data changes at high continuous rates.
Scalable: Handle large requests rates over big data sets.
Always available: Recover from hardware failures without human intervention.
Online evolvable: Change schemas, logic, models, hardware while online.
Integrated: Data feeds from Hadoop, learned models from TensorFlow etc.

Introducing Vespa
An open source platform for big data serving
As Hadoop: Developed at Yahoo for search, now for all big data serving cases
Open source: Visit the new site at http://vespa.ai
Big data: Makes the Big Data Serving features available for everyone

Vespa at Oath / Yahoo
Oath:Tumblr, TechCrunch, Huffington Post, Aol, Engadget, Gemini, News, Sports, Finance, Mail, etc.
Hundreds of Vespa applications,
… serving over a billion users
… over 200.000 queries per second
… over billions of content items

Vespa is
A platform for low latency computations over large, evolving data sets
• Search and selection over structured and unstructured data
• Relevance scoring: NL features, advanced ML models, TensorFlow etc.
• Query time organization and aggregation of matching data
• Real-time writes at a high sustained rate
• Live elastic and auto-recovering stateful content clusters
• Processing logic container (Java)
• Managed clusters: One to hundreds of nodes
Typical use cases: text search, personalization / recommendation / targeting, real-time data display

Case study: Zedge
The primary motivations for Zedge to use Vespa are
1) simplify search and recommender systems for Zedge Android and
iOS apps, both for serving (reduce amount of custom code to maintain) and
for processing/indexing (reduce need for big data jobs by calculating more
on the fly with tensors in Vespa)
2) accelerate innovation for content discovery, e.g. easier to improve
ranking with machine learning using Vespa in combination with Tensorflow
than with e.g. our custom code recommender systems. An added bonus so
far has been that more people understand both search and recommender
systems due to the overall reduction in complexity of search and
recommender systems
- Zedge VP of Data, Amund Tveit
2017 Worldwide
Download Leaders

Comparisons
Vespa: Focus on big data serving: Large scale, efficient, ML models
ElasticSearch: Focus on analytics: Log ingestion, visualization etc.
Solr: Focused on enterprise search: Handling document formats etc.
Relational databases: Transactions, hard to scale, no IR, no relevance
NoSQl stores: Easier to scale, no transactions, no IR, no relevance
Hadoop/Cloudera/
Hortonworks:
Big Data, but not for serving

Text search, relevance,
grouping and aggregation
Analytics
Vespa Elastic Search
Big data serving
Vespa and Elastic Search use cases

Analytics vs big data serving
Analytics Big data serving
Response time in low seconds Response time in low milliseconds
Low query rate High query rate
Time series, append only Random writes
Down time, data loss acceptable HA, no data loss, online redistribution
Massive data sets (trillions of docs) are cheap Massive data sets are more expensive
Analytics GUI integration Machine learning integration
VS

Container node
Query
Application
Package
Admin &
Config
Content node
Deploy
- Configuration
- Components
- ML models
Scatter-gather
Core
sharding
models models models
1) Parallelization
2) Move execution to data nodes
3) Prepared data structures (indexes etc.)
Scalable low latency execution:
How to bound latency

Amdahl’s law:
speedup = 1 / (s + p / N)

SLA
Latency: 100ms @ 95%
Throughput: 500 qps
Utilizing increased resources to
potentially increase quality of
returned results.

Inference in Vespa
Tensor data model: Multidimensional collections of
numbers in queries, documents, models
Tensor math express all common machine-learned
models with join, map, reduce
TensorFlow and ONNX integration: Deploy
TensorFlow and ONNX (SciKit, Caffe2, PyTorch
etc.) directly on Vespa
Vespa execution engine optimized for repeated
execution of models over many data items, and
running many inferences in parallel

<application package>/models/
search music {
rank-profile song inherits default {
first-phase {
expression {
0.7 * nativeRank(artist,album,track) +
0.1 * tensorflow(tf-model-dir) +
0.1 * onnx(onnx-model-file, output) +
0.1 * xgboost(xgboost-model-file)
}
}
}
}

map(
join(
reduce(
join(
Placeholder,
Weights_1,
f(x,y)(x * y)
),
sum,
d1
),
Weights_2,
f(x,y)(x + y)
),
f(x)(max(0,x))
)Placeholder Weights_1
matmul Weights_2
add
relu

Vespa Recap
Making the best use of big data often implies making decisions in real time
Vespa is the only open source platform optimized for such big data serving
Available on https://vespa.ai
Quick start: Run a complete application (on a laptop or AWS) in 10 minutes
http://docs.vespa.ai/documentation/vespa-quick-start.html
Tutorial: Make a scalable blog search and recommendation engine from scratch
http://docs.vespa.ai/documentation/tutorials/blog-search.html

Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath

More Related Content

What's hot (20)

Similar to Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath (20)

More from Yahoo Developer Network (20)

Recently uploaded (20)

Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, Oath