SlideShare a Scribd company logo
Building near real-time HTAP solutions with Azure
Cosmos DB & Azure Synapse Analytics
Sri Chintala
Program Manager, Microsoft
• Azure Cosmos DB is optimized for
operational workloads with single-digit
millisecond read and write latency
• 99.999% high availability, guaranteed
throughput and consistency
• Turnkey global data replication across all
Azure regions
Fast NoSQL database with open APIs for any scale
Real-time
Applications
& Services
Azure
Cosmos DB
Azure Cosmos DB
But what if I want to run analytics
in near real-time on my operational
data at scale?
• If you have large amounts of data, analytical
queries will take a long time to run and will be
resource intensive.
• HUGE performance impact on the OLTP
workloads.
Running OLTP and OLAP workloads on the same
database
Real-time
Applications &
Services
Azure
Cosmos DB
Reporting &
Dashboards
Azure Cosmos
DB
Spark connector
User
Applications
Azure
Cosmos DB
Data Lake
Extract
(Pipelines)
Transform
Enrich
Orchestrate
Power BI
Separating OLTP & OLAP
Serve
Ingest data periodically from Azure Cosmos DB to
Data Lake
Manage data formats and storage layer to optimize
for analytics
Apache Spark
for Synapse
Synapse SQL
Azure Synapse Link for Azure Cosmos DB Preview
Breaking down the barrier between OLTP & OLAP
Azure Synapse Link : How it works?
Analytical Store
Column store optimized for
analytical queries
Transactional Store
Row store optimized for
transactional operations
Azure Cosmos DB Azure Synapse Analytics
Container
Cloud-Native HTAP
Azure
Synapse Link
SQL
Auto-Sync
Machine learning
Big data analytics
BI Dashboards
Preview
Operational
Data
Generate near real-time insights on your operational data
DEMO
A retailer looking to build their new-age supply chain management platform on Azure Cosmos DB
The supply chain management system tracks retail operations across 1000s of locations across the
world and tracks inventory across the 100s of product SKUs sold
Let us explore how Synapse Link for Cosmos DB enables the following operational analytics scenarios:
- Building an end-to-end sales forecasting pipeline
- Business Intelligence reporting
Scenario
Azure Synapse Link Common Use Cases
Supply chain analytics, forecasting & reporting
IOT predictive maintenance
Real-time personalization
Questions
Additional Azure Cosmos DB sessions
INT 125 - Building scalable and secure applications with Azure Cosmos DB
@AzureCosmosDB youtube.com/azurecosmosdb

More Related Content

PPTX
Microsoft Purview
PPT
SSIS Presentation
PPTX
Azure Data Factory for Azure Data Week
PPTX
Data saturday Oslo Azure Purview Erwin de Kreuk
PPTX
Introducing Azure SQL Data Warehouse
PDF
Data Modeling with NGSI, NGSI-LD
PDF
Lessons from Large-Scale Cloud Software at Databricks
PPTX
Real time data integration best practices and architecture
Microsoft Purview
SSIS Presentation
Azure Data Factory for Azure Data Week
Data saturday Oslo Azure Purview Erwin de Kreuk
Introducing Azure SQL Data Warehouse
Data Modeling with NGSI, NGSI-LD
Lessons from Large-Scale Cloud Software at Databricks
Real time data integration best practices and architecture

What's hot (20)

PPTX
Azure active directory
PDF
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
PPTX
Azure data factory
PPTX
TechEvent Databricks on Azure
PDF
Azure Synapse Analytics
PPTX
Introduction to Azure monitor
PDF
Microsoft Azure Overview
PPTX
Data Vault Vs Data Lake
PPTX
Azure Data Factory Data Flows Training v005
PPTX
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
PDF
Data modeling for the business
PDF
Data Virtualization Manager for z/OS
PPTX
Databricks on AWS.pptx
PDF
Microsoft Azure & Active Directory Design Patterns ~認証基盤の展開に関するエッセンス~
PPTX
Power bi overview of static row level security
PPTX
Resource description framework
PPTX
Azure data factory
PDF
SSIS Tutorial For Beginners | SQL Server Integration Services (SSIS) | MSBI T...
PPTX
Fact table design for data ware house
PDF
Azure Data Factory V2; The Data Flows
Azure active directory
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure data factory
TechEvent Databricks on Azure
Azure Synapse Analytics
Introduction to Azure monitor
Microsoft Azure Overview
Data Vault Vs Data Lake
Azure Data Factory Data Flows Training v005
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
Data modeling for the business
Data Virtualization Manager for z/OS
Databricks on AWS.pptx
Microsoft Azure & Active Directory Design Patterns ~認証基盤の展開に関するエッセンス~
Power bi overview of static row level security
Resource description framework
Azure data factory
SSIS Tutorial For Beginners | SQL Server Integration Services (SSIS) | MSBI T...
Fact table design for data ware house
Azure Data Factory V2; The Data Flows
Ad

Similar to Building near real-time HTAP solutions using Synapse Link for Azure Cosmos DB (17)

PDF
Microsoft Build 2020: Data Science Recap
PPTX
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
PPTX
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
DOCX
PRIME COMPARISON of Azure Data Bricks, Azure Synapse, vs Azure Data Factory.docx
PPTX
Azure Databricks - An Introduction 2019 Roadshow.pptx
PPTX
Cepta The Future of Data with Power BI
PDF
Azure Data Platform Overview.pdf
PPTX
Afternoons with Azure - Azure Data Services
 
PDF
PDF
Module 2 - Datalake
PPTX
Analytics in the Cloud
PPTX
Azure satpn19 time series analytics with azure adx
PDF
1 Introduction to Microsoft data platform analytics for release
PDF
AZ900-AzureFundamentals-part-9.pdf
PDF
AWS Tech Talks - Data Lake Analytics
PDF
New Developments in the Open Source Ecosystem: Apache Spark 3.0, Delta Lake, ...
PPTX
Microsoft Azure update
Microsoft Build 2020: Data Science Recap
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
PRIME COMPARISON of Azure Data Bricks, Azure Synapse, vs Azure Data Factory.docx
Azure Databricks - An Introduction 2019 Roadshow.pptx
Cepta The Future of Data with Power BI
Azure Data Platform Overview.pdf
Afternoons with Azure - Azure Data Services
 
Module 2 - Datalake
Analytics in the Cloud
Azure satpn19 time series analytics with azure adx
1 Introduction to Microsoft data platform analytics for release
AZ900-AzureFundamentals-part-9.pdf
AWS Tech Talks - Data Lake Analytics
New Developments in the Open Source Ecosystem: Apache Spark 3.0, Delta Lake, ...
Microsoft Azure update
Ad

More from Timothy McAliley (7)

PDF
Azure Data Certifications and Training - Timothy McAliley
PDF
2020-07-17 NOVASQL Presentation - Azure Kubernetes Service
PPTX
Go Serverless with Cosmos DB, Azure Functions and Blazor
PDF
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
PPTX
Azure Cloud Adoption Framework + Governance - Sana Khan and Jay Kumar
PDF
NOVA SQL MeetUp Webinar Slides - June 5, 2020 Chris Seferlis - Azure SQL Mana...
PDF
A Tour of Azure SQL Databases (NOVA SQL UG 2020)
Azure Data Certifications and Training - Timothy McAliley
2020-07-17 NOVASQL Presentation - Azure Kubernetes Service
Go Serverless with Cosmos DB, Azure Functions and Blazor
Data Migration to Azure SQL and Azure SQL Managed Instance - June 19 2020
Azure Cloud Adoption Framework + Governance - Sana Khan and Jay Kumar
NOVA SQL MeetUp Webinar Slides - June 5, 2020 Chris Seferlis - Azure SQL Mana...
A Tour of Azure SQL Databases (NOVA SQL UG 2020)

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PPT
Teaching material agriculture food technology
PDF
cuic standard and advanced reporting.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Modernizing your data center with Dell and AMD
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
Big Data Technologies - Introduction.pptx
Teaching material agriculture food technology
cuic standard and advanced reporting.pdf
Unlocking AI with Model Context Protocol (MCP)
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
NewMind AI Monthly Chronicles - July 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Per capita expenditure prediction using model stacking based on satellite ima...
Dropbox Q2 2025 Financial Results & Investor Presentation
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
20250228 LYD VKU AI Blended-Learning.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Modernizing your data center with Dell and AMD
Advanced Soft Computing BINUS July 2025.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks

Building near real-time HTAP solutions using Synapse Link for Azure Cosmos DB

  • 1. Building near real-time HTAP solutions with Azure Cosmos DB & Azure Synapse Analytics Sri Chintala Program Manager, Microsoft
  • 2. • Azure Cosmos DB is optimized for operational workloads with single-digit millisecond read and write latency • 99.999% high availability, guaranteed throughput and consistency • Turnkey global data replication across all Azure regions Fast NoSQL database with open APIs for any scale Real-time Applications & Services Azure Cosmos DB Azure Cosmos DB
  • 3. But what if I want to run analytics in near real-time on my operational data at scale?
  • 4. • If you have large amounts of data, analytical queries will take a long time to run and will be resource intensive. • HUGE performance impact on the OLTP workloads. Running OLTP and OLAP workloads on the same database Real-time Applications & Services Azure Cosmos DB Reporting & Dashboards Azure Cosmos DB Spark connector
  • 5. User Applications Azure Cosmos DB Data Lake Extract (Pipelines) Transform Enrich Orchestrate Power BI Separating OLTP & OLAP Serve Ingest data periodically from Azure Cosmos DB to Data Lake Manage data formats and storage layer to optimize for analytics Apache Spark for Synapse Synapse SQL
  • 6. Azure Synapse Link for Azure Cosmos DB Preview Breaking down the barrier between OLTP & OLAP
  • 7. Azure Synapse Link : How it works? Analytical Store Column store optimized for analytical queries Transactional Store Row store optimized for transactional operations Azure Cosmos DB Azure Synapse Analytics Container Cloud-Native HTAP Azure Synapse Link SQL Auto-Sync Machine learning Big data analytics BI Dashboards Preview Operational Data Generate near real-time insights on your operational data
  • 9. A retailer looking to build their new-age supply chain management platform on Azure Cosmos DB The supply chain management system tracks retail operations across 1000s of locations across the world and tracks inventory across the 100s of product SKUs sold Let us explore how Synapse Link for Cosmos DB enables the following operational analytics scenarios: - Building an end-to-end sales forecasting pipeline - Business Intelligence reporting Scenario
  • 10. Azure Synapse Link Common Use Cases Supply chain analytics, forecasting & reporting IOT predictive maintenance Real-time personalization
  • 11. Questions Additional Azure Cosmos DB sessions INT 125 - Building scalable and secure applications with Azure Cosmos DB @AzureCosmosDB youtube.com/azurecosmosdb

Editor's Notes

  • #3: What are the requirements of modern applications: Apps & services generate growing volumes of operational data Users expect these apps & services to be milli-second response times and always-on Businesses have to serve users who are globally distributed Hence businesses have chosen Azure Cosmos DB as their operational database of choice courtesy ..
  • #4: Generate insights over growing volumes of operational data at scale
  • #5: Data volume =>OLTP performance impact
  • #6: To ensure that there is no performance impact on your applications or OLTP workloads, today data engineers Create pipelines for data movement from Cosmos DB to ADLS. Operationalize the pipelines Monitor and manage the pipelines. Then the same needs to be done where the data lands – Ensure the data is in the right format , for example., columnar formats for analytics workloads Create and manage the storage account where the data is moved Only after these can you start to use the flexibility of your analytic workloads for ex. with Spark to transform or enrich and serve with SQL. But your analytical workloads are not running against the latest state of operational data due periodic ingest of data.
  • #7: We’re excited to announce Azure Synapse Link for Azure Cosmos DB as a cloud-native hybrid transactional and analytical processing (HTAP) capability that enables you to analytics over operational data in Azure Cosmos DB with no ETL and no performance impact on your transactional workloads. Azure Synapse Link finally breaks down the barrier that has long existed between the OLTP and OLAP systems. You can now generate near real-time analytics over your operational data in Cosmos DB at scale…. With a ‘Single click’
  • #8: So what’s the magic powering Synapse Link? First animation: So far, your operational data in Cosmos DB is internally stored in a row-oriented ‘transactional store’. This store is optimized for transactional reads/writes & operational queries. Second animation: Now, we’re excited to bring you a fully-managed native ‘analytical store’ within Azure Cosmos DB container. Analytical store is a fully-isolated column-oriented store, optimized for typical analytical queries over large volumes of data. Your inserts, updates, deletes to operational data are automatically synced from transactional store to analytical store in near real-time within minutes. This ‘auto-sync’ capability does not consume RUs allocated for your operational workloads. Third animation: Now with Synapse Link, you can connect Cosmos DB Analytical store with Synapse Analytics with a ‘single click’. This allows you to interchangeably query operational data in Cosmos DB using Spark & SQL runtimes. This enables you to build machine learning pipelines, BI dashboards and run big-data analytics in near real-time on your operational data. Talking points (4 mins): Introduce Synapse Link as cloud-native HTAP capability to be able to run near real time analytics over operational data with no ETL & no perf impact on transactional workloads Synapse Link for Cosmos DB consists of two main components – first is built-in isolated column store called ‘Analytical Store’ and second native integration with the Spark & SQL runtimes of Azure Synapse Analytics Talk about Analytical Store: Auto-sync as native capability to handle inserts, updates, deletes to operational data are automatically synced from transactional store to analytical store in near real-time. In public preview, expected latency is 2 minutes with further optimizations going on to reduce this to 10s of seconds Auto-sync transforms your operational data from row-format in the transactional store into a column-format in the analytical store, optimized for complex analytical queries and large scans No RU impact Support for global distribution for analytics from local copy Talk about Synapse Runtime support: Spark SQL Serverless
  • #9: DEMO