SlideShare a Scribd company logo
DocumentDB: Blazing fast planet-scale NoSQL
DocumentDB Team
E-mail: askdocdb@microsoft.com
Twitter: @documentdb
The NoSQL landscape
2
NoSQL Evolution
Volume
Velocity
Variety
• How can my app deal with massive
volumes of data and throughput?
• How do I elastically scale my database?
• How do I write responsive apps?
• How do I make data available where my users are?
• How do I write highly available apps?
• How do I deal with schema changes?
• How do I iterate rapidly?
• What data models work at scale?
NoSQL Evolution
Volume
Hyper
converged/hyper
scale architectures
Horizontal
partitioning
Elastic scale
Velocity
Write optimized
database engines
Global distribution
Active-active
topologies
Tunable
consistency
Variety
Dynamically typed
databases
Schema Free
databases
Logical index
layouts (inverted,
columnar etc)
2009
MongoDB/
Riak/Neo4J
2015
DocumentDB
GA
2006
BigTablep
aper
Cassandra
20082007
Dynamo paper/
AWS SimpleDB
2014
DocumentDB
Preview
20162012
AWS DynamoDB
2010
Project
Florence
Onesizedoesnotfitall
Azure PaaS
Scale-up
Co-located compute & storage
Index Mgmt
/QP
Local
persistence
Local
compute
& storage
• A single database
up to 1TB (future,
4TB)
Azure SQL DB
Read optimized
Relational
Scale-Out
Disaggregated remote storage
Distributed file system
<1 EB
Azure Data Lake/U-SQL, HDInsight/Spark
Data Lakes
Compute runtimes
…
Index Mgmt
/QP
Local
persistence
Local compute &
storage on each
shard
Scale-out
Co-located compute & storage
• A single collection 1PB &
100s of millions of req/sec
• Multiple collections in a
database
Azure DocumentDB
Write and Read optimized
NoSQL
Common scenarios
Retail, CMS, Education
• Product Catalog
• Product Recommendations
• Personalization
• Campaign Management
• Blogs and CMS
Gaming
• Multiplayer Games
• Social Gameplay
• Leaderboards
• Game analytics
IoT, Sensor Data
• Telemetry + Event Store
• Telematics
• Device Registry
Social Analytics, Ad Tech
• User behavior telemetry
• Personalization
• Customer 360 view
Capabilities
Globaldistributionfromthe
ground-up
Regional Availability
As a Ring 0 service, DocumentDB will be available by default in all new Azure regions
Guaranteed Low
Latency
“I want my data wherever my users are.”
Reads <10ms @ P99, <1ms @ P50
Writes <15ms @ P99, <6ms at P50
• Globally distributed with reads and
writes served from local region
• Write optimized, latch-free database
engine designed for SSDs and low
latency access
• Synchronous and automatic indexing
at sustained ingestion rates
GuaranteedLowLatency
11
Elastically scalable storage
• System designed to independently scale
storage and throughput
• Transparent server side partition
management and routing
• Automatically indexed SSD storage
• Automatic global distribution of data
across any number of Azure regions
• Optionally evict old data using built-in
support for TTL
Elastically scalable
throughput
• Elastically scale throughput from 100 to
10s of millions of requests/sec across
multiple regions.
• Customers pay by the hour for the
provisioned throughput.
• Transparent server side partition
management and routing.
More throughput
Less throughput
9PM PST
Less throughput
More throughput11PM PST
99.99% availability SLA
• Multi-homing APIs - apps don’t need to
be redeployed during regional failover
• Customers can simulate/trigger manual
failover (via portal or APIs)
• Automatic failover (policy driven) in the
event of regional failures
• All clusters configured with 10-20 FDS
• Each partition is protected by a replica set
• Majority quorum based durable,
synchronous commits within a DC
99.99%
Highavailability
15
Well defined
consistency models
• Global distribution forces us to
navigate the CAP theorem
• Intuitive programming model for well-
defined, relaxed consistency models
with clear PACELC tradeoffs
• Four well-defined consistency levels
to choose from
• Can be overridden on a per request
basis
Strong consistency,
High latency
Eventual consistency,
Low latency
27%
3%
54%
16%
Observed Distribution
BoundedStaleness
Eventual
Session
Strong
Schema agnostic indexing
• At global scale, ALTER TABLE and schema/index management
is a non-starter
• Automatic and synchronous indexing of all ingested content
• No need to define schemas or secondary indices upfront!
• Highly write optimized database engine with latch free and log
structured techniques
• Fully resource governed with back pressure and rate limiting
built into the log structured storage engine
• Online and in-situ index transformations
No Problem
No Schema
Rich SQL and JavaScript queries
• No impedance mismatch - JavaScript is the
type system of the database engine
• Query using either SQL and JavaScript (or
both)
• Write business logic entirely in JavaScript
with stored procedures and triggers
• JavaScript language integrated multi-item
ACID transactions with snapshot isolation
TCP (SSL), HTTPS
DocumentDB Database Engine
AccessingDocumentDB
SQL JavaScript MongoDB
Query IL Database Runtime
Java .NET
Native DocumentDB client drivers
Java
.NET
Ruby
…
Native MongoDB client drivers
…
…
…
Roadmap
• Unique Index
• Aggregates
• Deeper engine level integration
• Support for more databases
Customer Growth
• Significant customer growth since the initial
launch
• ISVs : Parse, Sitecore and others
• Large MongoDB customers running into
security, scalability, robustness issues with
MongoDB
MongoDBAPICompatibility
Core database operations CRUD/Query
• Insert, InsertMany, InsertOne, Update, UpdateMany, UpdateOne, ReplaceOne, DeleteOne,
DeleteMany, Remove
• $inc, $mul, $rename, $set, $unset, $min, $max
• $addToSet, $pullAll, $pull, $pushAll, $slice, $push, $pop, $each, $sort, $position, $all, $size,
$elemMatch
• Bitwise, comparison, logical operators
• $type,
• $mod, $regex,
• $2dspehere, 2d, polygon, $near, $nearSphere, $geoWithin, $geoIntersects (incl. geometry
support for points, lines, polygons sphere)
• find, insert, update, delete, getLastError, getMore, findAndModify
• getnonce, logout, authenticate
• createIndex, listIndexes, dropIndexes, connectionStatus, reIndex, listDatabases, collStats,
dbStats
Turnkey
• Fully managed, fully secure and compliant and backed by SLAs for availability, latency,
consistency and throughput
• Partitioned collections
• Global distribution across any number of regions
Security and Compliance
Security
Compliance
Certification Details Compliance Status
Strong Privacy and Security Commitments
· No mining of customer data for advertising
· No voluntary disclosure to law enforcement agencies
Achieved
Contractual commitment to meet US and EU data residency requirements Achieved
ISO 27001 Achieved
ISO 27018 Achieved
EU Model Clauses (EUMC) Achieved
HIPAA Business Associate Agreement Achieved
PCI Started (in progress)
SOC 1 & SOC 2 Started (in progress)
FedRAMP, IRS 1075, UK Official (IL2) Started (in progress)
Health Information Trust Alliance (HITRUST) Planned
Recent Updates
DocumentDB Local Emulator
Free, downloadable, and high fidelity version of the cloud service for offline dev/test
Change Feed • Lambda pattern with significantly lower TCO
• Single scalable database solution for both ingestion and query
Q&A

More Related Content

PPTX
[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...
PPTX
Azure DocumentDB: Advanced Features for Large Scale-Apps
PPTX
Introducing Azure DocumentDB - NoSQL, No Problem
PPTX
Scaling MongoDB
PPTX
Webinar: When to Use MongoDB
PPTX
When to Use MongoDB
PPTX
Explore Azure Cosmos DB
PPTX
MongoDB at Scale
[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...
Azure DocumentDB: Advanced Features for Large Scale-Apps
Introducing Azure DocumentDB - NoSQL, No Problem
Scaling MongoDB
Webinar: When to Use MongoDB
When to Use MongoDB
Explore Azure Cosmos DB
MongoDB at Scale

What's hot (17)

PPT
Migrating to MongoDB: Best Practices
PPTX
Webinar: Scaling MongoDB
PPTX
When to Use MongoDB...and When You Should Not...
PPTX
Webinar: Choosing the Right Shard Key for High Performance and Scale
PDF
Migrating from RDBMS to MongoDB
DOCX
Dynamo db pros and cons
PDF
Webinar: Schema Patterns and Your Storage Engine
PPTX
HBaseCon 2015: HBase @ CyberAgent
PDF
Common MongoDB Use Cases
PPTX
An Enterprise Architect's View of MongoDB
PDF
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
PDF
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
PPTX
High Performance Applications with MongoDB
PDF
Amazon Dynamo DB for Developers (김일호) - AWS DB Day
PPTX
Prepare for Peak Holiday Season with MongoDB
PPTX
Agility and Scalability with MongoDB
PDF
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
Migrating to MongoDB: Best Practices
Webinar: Scaling MongoDB
When to Use MongoDB...and When You Should Not...
Webinar: Choosing the Right Shard Key for High Performance and Scale
Migrating from RDBMS to MongoDB
Dynamo db pros and cons
Webinar: Schema Patterns and Your Storage Engine
HBaseCon 2015: HBase @ CyberAgent
Common MongoDB Use Cases
An Enterprise Architect's View of MongoDB
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
High Performance Applications with MongoDB
Amazon Dynamo DB for Developers (김일호) - AWS DB Day
Prepare for Peak Holiday Season with MongoDB
Agility and Scalability with MongoDB
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
Ad

Viewers also liked (17)

PPTX
[PASS Summit 2016] Azure DocumentDB: A Deep Dive into Advanced Features
PDF
SQL Server vs. Azure DocumentDB – Ein Battle zwischen XML und JSON
PPTX
Test driving Azure Search and DocumentDB
PDF
Cap in depth
PDF
NoSQL in Perspective
PPTX
Azure DocumentDb
PDF
Anything Data: Big, Streaming, NoSQL, Cloud, Science ... A Sloppy Travel Guide
PDF
Analyze and visualize non-relational data with DocumentDB + Power BI
PPTX
Introduction to Azure DocumentDB
PPTX
Modeling JSON data for NoSQL document databases
PPTX
Microsoft azure documentDB
PPTX
Developing Solutions with Azure DocumentDB
PPTX
Introduction to Azure DocumentDB
PPSX
Azure DocumentDB
PPT
CAP, PACELC, and Determinism
PPTX
Microsoft Azure DocumentDB - Global Azure Bootcamp 2016
PPTX
Introducing DocumentDB
[PASS Summit 2016] Azure DocumentDB: A Deep Dive into Advanced Features
SQL Server vs. Azure DocumentDB – Ein Battle zwischen XML und JSON
Test driving Azure Search and DocumentDB
Cap in depth
NoSQL in Perspective
Azure DocumentDb
Anything Data: Big, Streaming, NoSQL, Cloud, Science ... A Sloppy Travel Guide
Analyze and visualize non-relational data with DocumentDB + Power BI
Introduction to Azure DocumentDB
Modeling JSON data for NoSQL document databases
Microsoft azure documentDB
Developing Solutions with Azure DocumentDB
Introduction to Azure DocumentDB
Azure DocumentDB
CAP, PACELC, and Determinism
Microsoft Azure DocumentDB - Global Azure Bootcamp 2016
Introducing DocumentDB
Ad

Similar to Azure DocumentDB Overview (20)

PPTX
No SQL, No Problem: Use Azure DocumentDB
PPTX
Webinar - Introduction to Azure DocumentDB
PPTX
Azure DocumentDB
PPTX
Cool NoSQL on Azure with DocumentDB
PPTX
Azure DocumentDB 101
PDF
[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB
PPTX
Azure doc db (slideshare)
PPTX
Accelerating a Path to Digital With a Cloud Data Strategy
PDF
Samedi SQL Québec - La plateforme data de Azure
PPTX
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
PDF
Simplifying & accelerating application development with MongoDB's intelligent...
PPTX
Freeing Yourself from an RDBMS Architecture
PPTX
RavenDB overview
PPTX
Technical overview of Azure Cosmos DB
PDF
Introduction to MongoDB
PPTX
L’architettura di classe enterprise di nuova generazione
PDF
How to Get Started with Your MongoDB Pilot Project
PPTX
Radu pintilie + liviu mazilu document db
PDF
MongodB Internals
PPTX
Survey of the Microsoft Azure Data Landscape
No SQL, No Problem: Use Azure DocumentDB
Webinar - Introduction to Azure DocumentDB
Azure DocumentDB
Cool NoSQL on Azure with DocumentDB
Azure DocumentDB 101
[「RDB技術者のためのNoSQLガイド」出版記念セミナー] Azure DocumentDB
Azure doc db (slideshare)
Accelerating a Path to Digital With a Cloud Data Strategy
Samedi SQL Québec - La plateforme data de Azure
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
Simplifying & accelerating application development with MongoDB's intelligent...
Freeing Yourself from an RDBMS Architecture
RavenDB overview
Technical overview of Azure Cosmos DB
Introduction to MongoDB
L’architettura di classe enterprise di nuova generazione
How to Get Started with Your MongoDB Pilot Project
Radu pintilie + liviu mazilu document db
MongodB Internals
Survey of the Microsoft Azure Data Landscape

Recently uploaded (20)

PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
Complete React Javascript Course Syllabus.pdf
PDF
top salesforce developer skills in 2025.pdf
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
Materi-Enum-and-Record-Data-Type (1).pptx
PPTX
Transform Your Business with a Software ERP System
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Online Work Permit System for Fast Permit Processing
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
System and Network Administration Chapter 2
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PPTX
Essential Infomation Tech presentation.pptx
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
medical staffing services at VALiNTRY
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Introduction to Artificial Intelligence
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
AI in Product Development-omnex systems
PPTX
Operating system designcfffgfgggggggvggggggggg
ISO 45001 Occupational Health and Safety Management System
Complete React Javascript Course Syllabus.pdf
top salesforce developer skills in 2025.pdf
Wondershare Filmora 15 Crack With Activation Key [2025
Materi-Enum-and-Record-Data-Type (1).pptx
Transform Your Business with a Software ERP System
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Online Work Permit System for Fast Permit Processing
Design an Analysis of Algorithms I-SECS-1021-03
System and Network Administration Chapter 2
How to Choose the Right IT Partner for Your Business in Malaysia
Essential Infomation Tech presentation.pptx
Understanding Forklifts - TECH EHS Solution
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
medical staffing services at VALiNTRY
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Introduction to Artificial Intelligence
How to Migrate SBCGlobal Email to Yahoo Easily
AI in Product Development-omnex systems
Operating system designcfffgfgggggggvggggggggg

Azure DocumentDB Overview

  • 1. DocumentDB: Blazing fast planet-scale NoSQL DocumentDB Team E-mail: askdocdb@microsoft.com Twitter: @documentdb
  • 3. NoSQL Evolution Volume Velocity Variety • How can my app deal with massive volumes of data and throughput? • How do I elastically scale my database? • How do I write responsive apps? • How do I make data available where my users are? • How do I write highly available apps? • How do I deal with schema changes? • How do I iterate rapidly? • What data models work at scale?
  • 4. NoSQL Evolution Volume Hyper converged/hyper scale architectures Horizontal partitioning Elastic scale Velocity Write optimized database engines Global distribution Active-active topologies Tunable consistency Variety Dynamically typed databases Schema Free databases Logical index layouts (inverted, columnar etc) 2009 MongoDB/ Riak/Neo4J 2015 DocumentDB GA 2006 BigTablep aper Cassandra 20082007 Dynamo paper/ AWS SimpleDB 2014 DocumentDB Preview 20162012 AWS DynamoDB 2010 Project Florence
  • 5. Onesizedoesnotfitall Azure PaaS Scale-up Co-located compute & storage Index Mgmt /QP Local persistence Local compute & storage • A single database up to 1TB (future, 4TB) Azure SQL DB Read optimized Relational Scale-Out Disaggregated remote storage Distributed file system <1 EB Azure Data Lake/U-SQL, HDInsight/Spark Data Lakes Compute runtimes … Index Mgmt /QP Local persistence Local compute & storage on each shard Scale-out Co-located compute & storage • A single collection 1PB & 100s of millions of req/sec • Multiple collections in a database Azure DocumentDB Write and Read optimized NoSQL
  • 6. Common scenarios Retail, CMS, Education • Product Catalog • Product Recommendations • Personalization • Campaign Management • Blogs and CMS Gaming • Multiplayer Games • Social Gameplay • Leaderboards • Game analytics IoT, Sensor Data • Telemetry + Event Store • Telematics • Device Registry Social Analytics, Ad Tech • User behavior telemetry • Personalization • Customer 360 view
  • 9. Regional Availability As a Ring 0 service, DocumentDB will be available by default in all new Azure regions
  • 10. Guaranteed Low Latency “I want my data wherever my users are.” Reads <10ms @ P99, <1ms @ P50 Writes <15ms @ P99, <6ms at P50 • Globally distributed with reads and writes served from local region • Write optimized, latch-free database engine designed for SSDs and low latency access • Synchronous and automatic indexing at sustained ingestion rates
  • 12. Elastically scalable storage • System designed to independently scale storage and throughput • Transparent server side partition management and routing • Automatically indexed SSD storage • Automatic global distribution of data across any number of Azure regions • Optionally evict old data using built-in support for TTL
  • 13. Elastically scalable throughput • Elastically scale throughput from 100 to 10s of millions of requests/sec across multiple regions. • Customers pay by the hour for the provisioned throughput. • Transparent server side partition management and routing. More throughput Less throughput 9PM PST Less throughput More throughput11PM PST
  • 14. 99.99% availability SLA • Multi-homing APIs - apps don’t need to be redeployed during regional failover • Customers can simulate/trigger manual failover (via portal or APIs) • Automatic failover (policy driven) in the event of regional failures • All clusters configured with 10-20 FDS • Each partition is protected by a replica set • Majority quorum based durable, synchronous commits within a DC 99.99%
  • 16. Well defined consistency models • Global distribution forces us to navigate the CAP theorem • Intuitive programming model for well- defined, relaxed consistency models with clear PACELC tradeoffs • Four well-defined consistency levels to choose from • Can be overridden on a per request basis Strong consistency, High latency Eventual consistency, Low latency 27% 3% 54% 16% Observed Distribution BoundedStaleness Eventual Session Strong
  • 17. Schema agnostic indexing • At global scale, ALTER TABLE and schema/index management is a non-starter • Automatic and synchronous indexing of all ingested content • No need to define schemas or secondary indices upfront! • Highly write optimized database engine with latch free and log structured techniques • Fully resource governed with back pressure and rate limiting built into the log structured storage engine • Online and in-situ index transformations No Problem No Schema
  • 18. Rich SQL and JavaScript queries • No impedance mismatch - JavaScript is the type system of the database engine • Query using either SQL and JavaScript (or both) • Write business logic entirely in JavaScript with stored procedures and triggers • JavaScript language integrated multi-item ACID transactions with snapshot isolation
  • 19. TCP (SSL), HTTPS DocumentDB Database Engine AccessingDocumentDB SQL JavaScript MongoDB Query IL Database Runtime Java .NET Native DocumentDB client drivers Java .NET Ruby … Native MongoDB client drivers … … …
  • 20. Roadmap • Unique Index • Aggregates • Deeper engine level integration • Support for more databases Customer Growth • Significant customer growth since the initial launch • ISVs : Parse, Sitecore and others • Large MongoDB customers running into security, scalability, robustness issues with MongoDB MongoDBAPICompatibility Core database operations CRUD/Query • Insert, InsertMany, InsertOne, Update, UpdateMany, UpdateOne, ReplaceOne, DeleteOne, DeleteMany, Remove • $inc, $mul, $rename, $set, $unset, $min, $max • $addToSet, $pullAll, $pull, $pushAll, $slice, $push, $pop, $each, $sort, $position, $all, $size, $elemMatch • Bitwise, comparison, logical operators • $type, • $mod, $regex, • $2dspehere, 2d, polygon, $near, $nearSphere, $geoWithin, $geoIntersects (incl. geometry support for points, lines, polygons sphere) • find, insert, update, delete, getLastError, getMore, findAndModify • getnonce, logout, authenticate • createIndex, listIndexes, dropIndexes, connectionStatus, reIndex, listDatabases, collStats, dbStats Turnkey • Fully managed, fully secure and compliant and backed by SLAs for availability, latency, consistency and throughput • Partitioned collections • Global distribution across any number of regions
  • 23. Compliance Certification Details Compliance Status Strong Privacy and Security Commitments · No mining of customer data for advertising · No voluntary disclosure to law enforcement agencies Achieved Contractual commitment to meet US and EU data residency requirements Achieved ISO 27001 Achieved ISO 27018 Achieved EU Model Clauses (EUMC) Achieved HIPAA Business Associate Agreement Achieved PCI Started (in progress) SOC 1 & SOC 2 Started (in progress) FedRAMP, IRS 1075, UK Official (IL2) Started (in progress) Health Information Trust Alliance (HITRUST) Planned
  • 25. DocumentDB Local Emulator Free, downloadable, and high fidelity version of the cloud service for offline dev/test
  • 26. Change Feed • Lambda pattern with significantly lower TCO • Single scalable database solution for both ingestion and query
  • 27. Q&A

Editor's Notes

  • #7: Retail: Used by Jet.com, Walmart, Rakuten (#1 retailer in Japan). The schema flexibility and elastic scale are key (What happens on Black Friday). Also popular for content management systems like DNN. IoT: Used by Schneider Electric, Itron, and major car companies (Toyota and Ford), also used by Azure IoT Hub internally Gaming: Used by the #1 on iPhone (Next Games), #1 on Playstation/Steam (Hello Games), #1 on Xbox (Halo), and potentially board game with Wizards Social Analytics + Ad Tech: Used by MSN.com, Skype Telemetry: Used by Windows Error Reporting, Universal Store