SlideShare a Scribd company logo
Kyle Kingsbury Talks about the
Jepsen Test: !
!
What VoltDB Learned About
Data Accuracy and Consistency!
or!
presents…!
Presenters!
John Hugg, VoltDB Inc.
Founding Engineer at VoltDB
Was involved with both adding bugs
found by Jepsen and fixing them.
Kyle Kingsbury, Jepsen.io (aphyr)
Creator of Jepsen, Riemann, Tesser
Likes to break databases. 

Broke VoltDB, in fact.
Why did VoltDB
pick Jepsen?
What is VoltDB and
what’s it good for?
Learn More
So What?
Jepsen
Central
voltdb.com/jepsen
voltdb.com/jepsen
jepsen.io pages
VoltDB’s Jepsen-specific
content
Learn more about VoltDB
Hacker News comments
What is VoltDB?
•  Scale-out, clustered, SQL Relational database
•  Strong serializable transactions by default, even
at high scale
•  An excellent processing engine with rich import/
export functionality
Use Cases
•  Anything where decisions based on logic are made
in real time for incoming events:
•  Policy Enforcement
•  Fraud Detection
•  Ad Tech
•  Real-time personalization
•  Payments
•  Anything where math / calculations are done:
•  Billing and reporting on live data
•  State tracking
Example: Telco
Mobile phone is
dialed.
Request sent to
VoltDB to decide if it
should be let
through.
Single transaction looks at state
and decides if this call:
is fraudulent?
is permitted under plan?
has prepaid balance to cover?
State
Blacklists
Fraud Rules
Billing Info
Recent Activity for both Numbers
Export
to OLAP
Example: Micro
Personalization
User clicks link on
a website. This
generates a
request to VoltDB.
VoltDB transaction
scans a table of rules
and checks which
apply to this event.
Eventually the
transaction decides
what to show the user
next.
That decision is
exported to HDFS
Spark ML is used to look at
historical data in HDFS and
generate new rules.
These rules are loaded into
VoltDB every few hours.
User sees
personalized
content
StateProcessing
Complex,
Transactional
Business Logic
Scale-Out
Performance
Streaming Events
SQL Relational State
Why Jepsen?
•  We are always hungry for tests!
•  Could build VoltDB-Jepsen harness
ourselves, but…

Wouldn’t be as good and wouldn’t have
Kingsbury’s credibility.
•  Customers have asked about it.
•  Kingsbury has a built-in audience (marketing)
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
VoltDB’s Takeaway
VoltDB’s Takeaway
Serious question:
What’s the worst that
could happen?
VoltDB Takeaway
•  Our policy: Consistency or data loss bugs are
blocking bugs to be prioritized above all else.
•  So if Jepsen finds bugs, we need to fix them
ASAP.
•  The risk is that Jepsen finds bugs that we have to
fix, which might impact our schedule.
•  But that’s dumb. If our product has bugs, not
knowing about them doesn’t make them not there.
VoltDB Takeaway
Marketing & Perception:
•  Passing Jepsen is good. People talking about
VoltDB is good. Showing we care about this
stuff is good.
•  Having bugs is bad, but discussing and fixing
issues openly and seriously can be positive.
Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency
Reproducible!
•  100% reproducible test:
•  Set up Jepsen from
Github
•  Clone Jepen VoltDB
driver from Github
•  Run!
•  Can’t do this with systems
you don’t control.
github.com/jepsen-io/voltdb!
I want to learn more!
chat.voltdb.com
forum.voltdb.com
askanengineer
@voltdb.com
@johnhugg
@voltdb
@aphyr
voltdb.com/jepsen

More Related Content

PDF
Stored Procedure Superpowers: A Developer’s Guide
PDF
Acting on Real-time Behavior: How Peak Games Won Transactions
PDF
Why you really want SQL in a Real-Time Enterprise Environment
PDF
Eat Your Data and Have It Too: Get the Blazing Performance of In-Memory Opera...
PDF
Using a Fast Operational Database to Build Real-time Streaming Aggregations
PDF
Mike Stonebraker on Designing An Architecture For Real-time Event Processing
PDF
Arguments for a Unified IoT Architecture
PDF
Lambda-B-Gone: In-memory Case Study for Faster, Smarter and Simpler Answers
Stored Procedure Superpowers: A Developer’s Guide
Acting on Real-time Behavior: How Peak Games Won Transactions
Why you really want SQL in a Real-Time Enterprise Environment
Eat Your Data and Have It Too: Get the Blazing Performance of In-Memory Opera...
Using a Fast Operational Database to Build Real-time Streaming Aggregations
Mike Stonebraker on Designing An Architecture For Real-time Event Processing
Arguments for a Unified IoT Architecture
Lambda-B-Gone: In-memory Case Study for Faster, Smarter and Simpler Answers

What's hot (20)

PDF
The Expert Guide to Fast Data
PPTX
How to Build Real-Time Streaming Analytics with an In-memory, Scale-out SQL D...
PDF
How to Build Cloud-based Microservice Environments with Docker and VoltDB
PDF
Memory Database Technology is Driving a New Cycle of Business Innovation
PDF
APAC Kafka Summit - Best Of
PDF
Scalable and Reliable Logging at Pinterest
PDF
Streaming Data in the Cloud with Confluent and MongoDB Atlas | Robert Walters...
PDF
Event & Data Mesh as a Service: Industrializing Microservices in the Enterpri...
PDF
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
PPTX
2011 march cloud computing atlanta
PPTX
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
PPTX
MongoDB Days UK: Tales from the Field
PPTX
RedisConf18 - The Intelligent Database Proxy
PDF
How to design and implement a data ops architecture with sdc and gcp
PDF
Monitoring MySQL at scale
PDF
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
PPTX
Netflix Data Engineering @ Uber Engineering Meetup
PDF
ASPgems - kappa architecture
PPTX
Don't Drop ACID (July 2021)
PDF
Event Sourcing in less than 20 minutes - With Akka and Java 8
The Expert Guide to Fast Data
How to Build Real-Time Streaming Analytics with an In-memory, Scale-out SQL D...
How to Build Cloud-based Microservice Environments with Docker and VoltDB
Memory Database Technology is Driving a New Cycle of Business Innovation
APAC Kafka Summit - Best Of
Scalable and Reliable Logging at Pinterest
Streaming Data in the Cloud with Confluent and MongoDB Atlas | Robert Walters...
Event & Data Mesh as a Service: Industrializing Microservices in the Enterpri...
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
2011 march cloud computing atlanta
How DataStax Enterprise and Azure Make Your Apps Scale from Day 1
MongoDB Days UK: Tales from the Field
RedisConf18 - The Intelligent Database Proxy
How to design and implement a data ops architecture with sdc and gcp
Monitoring MySQL at scale
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Netflix Data Engineering @ Uber Engineering Meetup
ASPgems - kappa architecture
Don't Drop ACID (July 2021)
Event Sourcing in less than 20 minutes - With Akka and Java 8
Ad

Viewers also liked (8)

PDF
VoltDB : A Technical Overview
PDF
Transforming Your Business with Fast Data – Five Use Case Examples
PDF
Understanding the Operational Database Infrastructure for IoT and Fast Data
PDF
Understanding the Top Four Use Cases for IoT
PPTX
Estudo comparativo entr bancos RDBMS, NoSQL e NewSQL
PDF
Moving Beyond Batch: Transactional Databases for Real-time Data
PDF
NewSQL overview, Feb 2015
PDF
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
VoltDB : A Technical Overview
Transforming Your Business with Fast Data – Five Use Case Examples
Understanding the Operational Database Infrastructure for IoT and Fast Data
Understanding the Top Four Use Cases for IoT
Estudo comparativo entr bancos RDBMS, NoSQL e NewSQL
Moving Beyond Batch: Transactional Databases for Real-time Data
NewSQL overview, Feb 2015
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
Ad

Similar to Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency (20)

PDF
VoltDB Big Data Camp LA 2014 - Scott Jar
PDF
[2C6]Everyplay_Big_Data
PPTX
Inside Wordnik's Architecture
PDF
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
PDF
Building a Bank out of Microservices (NDC Sydney, August 2016)
PDF
Webinar: How Banks Manage Reference Data with MongoDB
KEY
MonoRails - GoGaRuCo 2012
PPTX
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
PDF
How we integrate Machine Learning Algorithms into our IT Platform at Outfittery
KEY
Make Life Suck Less (Building Scalable Systems)
PDF
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
PDF
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
PDF
Behavior-Driven Development (BDD) Testing with Apache Spark with Aaron Colcor...
PDF
Decision Making based on Machine Learning at Outfittery (W-JAX 2017)
PDF
GOTO Night: Decision Making Based on Machine Learning
PDF
How we integrate Machine Learning Algorithms into our IT Platform at Outfitte...
PDF
Joyent circa 2006 (Scale with Rails)
PDF
AmsterdamJUG September 2019 - Better software, faster: Principles of Continuo...
PDF
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...
PDF
PlayStation and Lucene - Indexing 1M documents per second: Presented by Alexa...
VoltDB Big Data Camp LA 2014 - Scott Jar
[2C6]Everyplay_Big_Data
Inside Wordnik's Architecture
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
Building a Bank out of Microservices (NDC Sydney, August 2016)
Webinar: How Banks Manage Reference Data with MongoDB
MonoRails - GoGaRuCo 2012
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
How we integrate Machine Learning Algorithms into our IT Platform at Outfittery
Make Life Suck Less (Building Scalable Systems)
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
Behavior-Driven Development (BDD) Testing with Apache Spark with Aaron Colcor...
Decision Making based on Machine Learning at Outfittery (W-JAX 2017)
GOTO Night: Decision Making Based on Machine Learning
How we integrate Machine Learning Algorithms into our IT Platform at Outfitte...
Joyent circa 2006 (Scale with Rails)
AmsterdamJUG September 2019 - Better software, faster: Principles of Continuo...
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...
PlayStation and Lucene - Indexing 1M documents per second: Presented by Alexa...

More from VoltDB (13)

PDF
TripleLift: Preparing for a New Programmatic Ad-Tech World
PDF
Fast Data Choices: 5 Strategies for Evaluating Alternative Business and Techn...
PDF
How First to Value Beats First to Market: Case Studies of Fast Data Success
PPTX
Lessons Learned: The Impact of Fast Data for Personalization
PDF
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
PDF
The Two Generals Problem
PDF
How to Build Fast Data Applications: Evaluating the Top Contenders
PDF
Fast Data – the New Big Data
PDF
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
PDF
The 10 MS Rule: Getting to 'Yes' with Fast Data & Hadoop
PDF
The State of Streaming Analytics: The Need for Speed and Scale
PDF
Fast Data: Achieving Real-Time Data Analysis Across the Financial Data Continuum
PDF
VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...
TripleLift: Preparing for a New Programmatic Ad-Tech World
Fast Data Choices: 5 Strategies for Evaluating Alternative Business and Techn...
How First to Value Beats First to Market: Case Studies of Fast Data Success
Lessons Learned: The Impact of Fast Data for Personalization
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
The Two Generals Problem
How to Build Fast Data Applications: Evaluating the Top Contenders
Fast Data – the New Big Data
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
The 10 MS Rule: Getting to 'Yes' with Fast Data & Hadoop
The State of Streaming Analytics: The Need for Speed and Scale
Fast Data: Achieving Real-Time Data Analysis Across the Financial Data Continuum
VoltDB and Flytxt Present: Building a Single Technology Platform for Real-Tim...

Recently uploaded (20)

PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
medical staffing services at VALiNTRY
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Essential Infomation Tech presentation.pptx
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PPTX
ai tools demonstartion for schools and inter college
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
L1 - Introduction to python Backend.pptx
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
Nekopoi APK 2025 free lastest update
PDF
Softaken Excel to vCard Converter Software.pdf
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
medical staffing services at VALiNTRY
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Odoo Companies in India – Driving Business Transformation.pdf
Essential Infomation Tech presentation.pptx
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
2025 Textile ERP Trends: SAP, Odoo & Oracle
Odoo POS Development Services by CandidRoot Solutions
Navsoft: AI-Powered Business Solutions & Custom Software Development
ai tools demonstartion for schools and inter college
Wondershare Filmora 15 Crack With Activation Key [2025
L1 - Introduction to python Backend.pptx
wealthsignaloriginal-com-DS-text-... (1).pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Understanding Forklifts - TECH EHS Solution
Which alternative to Crystal Reports is best for small or large businesses.pdf
Nekopoi APK 2025 free lastest update
Softaken Excel to vCard Converter Software.pdf

Kyle Kingsbury Talks about the Jepsen Test: What VoltDB Learned About Data Accuracy and Consistency

  • 1. Kyle Kingsbury Talks about the Jepsen Test: ! ! What VoltDB Learned About Data Accuracy and Consistency! or! presents…!
  • 2. Presenters! John Hugg, VoltDB Inc. Founding Engineer at VoltDB Was involved with both adding bugs found by Jepsen and fixing them. Kyle Kingsbury, Jepsen.io (aphyr) Creator of Jepsen, Riemann, Tesser Likes to break databases. 
 Broke VoltDB, in fact.
  • 3. Why did VoltDB pick Jepsen? What is VoltDB and what’s it good for? Learn More So What?
  • 6. What is VoltDB? •  Scale-out, clustered, SQL Relational database •  Strong serializable transactions by default, even at high scale •  An excellent processing engine with rich import/ export functionality
  • 7. Use Cases •  Anything where decisions based on logic are made in real time for incoming events: •  Policy Enforcement •  Fraud Detection •  Ad Tech •  Real-time personalization •  Payments •  Anything where math / calculations are done: •  Billing and reporting on live data •  State tracking
  • 8. Example: Telco Mobile phone is dialed. Request sent to VoltDB to decide if it should be let through. Single transaction looks at state and decides if this call: is fraudulent? is permitted under plan? has prepaid balance to cover? State Blacklists Fraud Rules Billing Info Recent Activity for both Numbers Export to OLAP
  • 9. Example: Micro Personalization User clicks link on a website. This generates a request to VoltDB. VoltDB transaction scans a table of rules and checks which apply to this event. Eventually the transaction decides what to show the user next. That decision is exported to HDFS Spark ML is used to look at historical data in HDFS and generate new rules. These rules are loaded into VoltDB every few hours. User sees personalized content
  • 12. Why Jepsen? •  We are always hungry for tests! •  Could build VoltDB-Jepsen harness ourselves, but…
 Wouldn’t be as good and wouldn’t have Kingsbury’s credibility. •  Customers have asked about it. •  Kingsbury has a built-in audience (marketing)
  • 61. VoltDB’s Takeaway Serious question: What’s the worst that could happen?
  • 62. VoltDB Takeaway •  Our policy: Consistency or data loss bugs are blocking bugs to be prioritized above all else. •  So if Jepsen finds bugs, we need to fix them ASAP. •  The risk is that Jepsen finds bugs that we have to fix, which might impact our schedule. •  But that’s dumb. If our product has bugs, not knowing about them doesn’t make them not there.
  • 63. VoltDB Takeaway Marketing & Perception: •  Passing Jepsen is good. People talking about VoltDB is good. Showing we care about this stuff is good. •  Having bugs is bad, but discussing and fixing issues openly and seriously can be positive.
  • 65. Reproducible! •  100% reproducible test: •  Set up Jepsen from Github •  Clone Jepen VoltDB driver from Github •  Run! •  Can’t do this with systems you don’t control. github.com/jepsen-io/voltdb!
  • 66. I want to learn more! chat.voltdb.com forum.voltdb.com askanengineer @voltdb.com @johnhugg @voltdb @aphyr voltdb.com/jepsen