SlideShare a Scribd company logo
A developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure Databricks
“More than any other factor,
customer experiences determine
whether companies thrive and
profit, or struggle and fade.”
– Forrester Research
Speed
79%
won’t return
to a slow website
Personalization
38%
won't call again if they
have to repeat themselves
Consistency
65%
get frustrated with
inconsistent device experiences
Harness the power of
Big Data analytics
apps to exceed
customer needs
Enhance any type of
app with Big Data
analytics
Introducing
Wide World Importers
Online shopping through
company website
E-commerce directly in the
consumer’s hand, anywhere
Wide World ImportersWide World Importers
Retail stores
Wide World Importers
Web e-commerce Mobile e-commerce
Wide World Importers seeks to expand
customers through an omni-channel strategy
Retail stores
The solutions needed
to reach more
customers and grow
the business
1. Scale with ease to reach more consumers
2. Unlock business insights from unstructured data
3. Enhance user experience with advanced analytics
4. Apply real-time analytics for instant updates
5. Infuse AI into apps to actively engage with customers
A Z U R E D A T A B R I C K S
Microsoft Azure
Optimized Databricks Runtime Engine
DATABRICKS I/O SERVERLESS
Collaborative Workspace
Cloud storage
Data warehouses
Hadoop storage
IoT / streaming data
Rest APIs
Machine learning models
BI tools
Data exports
Data warehouses
Azure Databricks
Enhance Productivity
Deploy Production Jobs & Workflows
APACHE SPARK
MULTI-STAGE PIPELINES
DATA ENGINEER
JOB SCHEDULER NOTIFICATION & LOGS
DATA SCIENTIST BUSINESS ANALYST
Build on secure & trusted cloud Scale without limits
A Z U R E D A T A B R I C K S
A developer's introduction to big data processing with Azure Databricks
Azure Databricks
Logical Architectures
INGEST STORE PREP & TRAIN MODEL & SERVE
Azure Blob Storage
Logs, files and media
(unstructured)
Azure SQL Data
Warehouse
Azure Data Factory
Azure Analysis
Services
Azure Databricks
(Python, Scala, Spark SQL)
Polybase
Business/custom apps
(Structured)
Power BI
Azure also supports other Big Data services like Azure HDInsight and Azure Data Lake to allow customers to tailor the above architecture to meet their unique needs.
INGEST STORE PREP & TRAIN MODEL & SERVE
Azure Blob Storage
Logs, files and media
(unstructured)
Azure SQL Data
Warehouse
Azure Data Factory
Azure Analysis
Services
Polybase
Business/custom apps
(Structured)
Power BI
Azure also supports other Big Data services like Azure HDInsight and Azure Data Lake to allow customers to tailor the above architecture to meet their unique needs.
Azure Databricks
(Python, Scala, Spark SQL)
Azure Databricks
(Spark ML, Spark R, SparklyR)
Intelligent Apps
Cosmos DB
INGEST STORE PREP & TRAIN MODEL & SERVE
Logs, files and media
(unstructured)
Sensors and IoT
(unstructured)
HDInsight
(Kafka)
Power BIAzure Databricks
(Python, Scala, Spark SQL)
Intelligent Apps
Cosmos DBEvent Hub
IoT Hub
Azure Databricks
(Spark ML, Spark R, SparklyR)
Azure Blob Storage
Batch Data
(Apps, logs) Azure Data Factory
Azure Databricks at //BUILD 2018
Tuesday:
• BRK3320 The Developer Data Scientist – Creating New Analytics
Driven Applications using Apache Spark with Azure Databricks
• WRK2601 Using Databricks to Analyze Telemetry Data Stored in
Azure Blob Storage
Wednesday:
• BRK4102 ETL 2.0 - Data Engineering for developers
• BRK3314 Leveraging Azure Databricks to minimize time to insight
by combining Batch and Stream processing pipelines.
• BRK3708 Machine learning at scale
A developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure Databricks
A developer's introduction to big data processing with Azure Databricks

More Related Content

PPTX
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
PDF
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
PDF
Spark as a Service with Azure Databricks
PDF
Cortana Analytics Workshop: Azure Data Lake
PDF
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
PPTX
Global AI Bootcamp Madrid - Azure Databricks
PDF
Microsoft Build 2020: Data Science Recap
PPTX
Azure data bricks by Eugene Polonichko
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
Spark as a Service with Azure Databricks
Cortana Analytics Workshop: Azure Data Lake
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
Global AI Bootcamp Madrid - Azure Databricks
Microsoft Build 2020: Data Science Recap
Azure data bricks by Eugene Polonichko

What's hot (20)

PDF
Azure databricks c sharp corner toronto feb 2019 heather grandy
PDF
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
PDF
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
PDF
Big Data Adavnced Analytics on Microsoft Azure
PPTX
Azure Databricks - An Introduction (by Kris Bock)
PPTX
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
PPTX
Introduction to Azure Databricks
PDF
Unified Data Analytics: Helping Data Teams Solve the World’s Toughest Problems
PDF
Azure Synapse 101 Webinar Presentation
PDF
Einstieg in Machine Learning für Datenbankentwickler
PDF
Trivadis Azure Data Lake
PDF
Introduction to Azure Synapse Webinar
PPTX
Building Advanced Analytics Pipelines with Azure Databricks
PPTX
TechEvent Databricks on Azure
PDF
USQL Trivadis Azure Data Lake Event
PDF
201905 Azure Databricks for Machine Learning
PDF
USQ Landdemos Azure Data Lake
PPTX
Microsoft Azure Databricks
PPTX
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
PPTX
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Azure databricks c sharp corner toronto feb 2019 heather grandy
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
Big Data Adavnced Analytics on Microsoft Azure
Azure Databricks - An Introduction (by Kris Bock)
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
Introduction to Azure Databricks
Unified Data Analytics: Helping Data Teams Solve the World’s Toughest Problems
Azure Synapse 101 Webinar Presentation
Einstieg in Machine Learning für Datenbankentwickler
Trivadis Azure Data Lake
Introduction to Azure Synapse Webinar
Building Advanced Analytics Pipelines with Azure Databricks
TechEvent Databricks on Azure
USQL Trivadis Azure Data Lake Event
201905 Azure Databricks for Machine Learning
USQ Landdemos Azure Data Lake
Microsoft Azure Databricks
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ad

Similar to A developer's introduction to big data processing with Azure Databricks (20)

PPTX
Cloud Scale Analytics Pitch Deck
PPTX
Overview Microsoft's ML & AI tools
PPTX
Power BI for Big Data and the New Look of Big Data Solutions
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
PPTX
IARE_BDBA_ PPT_0.pptx
PDF
Trivadis - Microsoft Transform your data estate with cloud, data and AI
PDF
Modern Business Intelligence and Advanced Analytics
PDF
Lecture 1-big data engineering (Introduction).pdf
PDF
Bigdata (1) converted
PDF
Transformando la vida cotidiana a través de Big Data
PPTX
Microsoft cloud big data strategy
PPTX
Big Data in Engineering Applications
PPTX
Big data? No. Big Decisions are What You Want
PPTX
Big data an elephant business opportunities
PDF
Simply Business' Data Platform
PPTX
NYC Data Amp - Microsoft Azure and Data Services Overview
PPTX
How to Capitalize on Big Data with Oracle Analytics Cloud
PPTX
Big Data PPT by Rohit Dubey
PDF
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
Cloud Scale Analytics Pitch Deck
Overview Microsoft's ML & AI tools
Power BI for Big Data and the New Look of Big Data Solutions
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
IARE_BDBA_ PPT_0.pptx
Trivadis - Microsoft Transform your data estate with cloud, data and AI
Modern Business Intelligence and Advanced Analytics
Lecture 1-big data engineering (Introduction).pdf
Bigdata (1) converted
Transformando la vida cotidiana a través de Big Data
Microsoft cloud big data strategy
Big Data in Engineering Applications
Big data? No. Big Decisions are What You Want
Big data an elephant business opportunities
Simply Business' Data Platform
NYC Data Amp - Microsoft Azure and Data Services Overview
How to Capitalize on Big Data with Oracle Analytics Cloud
Big Data PPT by Rohit Dubey
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
Ad

More from Microsoft Tech Community (20)

PPTX
100 ways to use Yammer
PPTX
10 Yammer Group Suggestions
PPTX
Removing Security Roadblocks to IoT Deployment Success
PPTX
Building mobile apps with Visual Studio and Xamarin
PPTX
Best practices with Microsoft Graph: Making your applications more performant...
PPTX
Interactive emails in Outlook with Adaptive Cards
PPTX
Unlocking security insights with Microsoft Graph API
PPTX
Break through the serverless barriers with Durable Functions
PPTX
Multiplayer Server Scaling with Azure Container Instances
PPTX
Explore Azure Cosmos DB
PPTX
Media Streaming Apps with Azure and Xamarin
PPTX
DevOps for Data Science
PPTX
Real-World Solutions with PowerApps: Tips & tricks to manage your app complexity
PPTX
Azure Functions and Microsoft Graph
PPTX
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
PPTX
Getting Started with Visual Studio Tools for AI
PPTX
Using AML Python SDK
PPTX
Mobile Workforce Location Tracking with Bing Maps
PPTX
Cognitive Services Labs in action Anomaly detection
PPTX
Speech Devices SDK
100 ways to use Yammer
10 Yammer Group Suggestions
Removing Security Roadblocks to IoT Deployment Success
Building mobile apps with Visual Studio and Xamarin
Best practices with Microsoft Graph: Making your applications more performant...
Interactive emails in Outlook with Adaptive Cards
Unlocking security insights with Microsoft Graph API
Break through the serverless barriers with Durable Functions
Multiplayer Server Scaling with Azure Container Instances
Explore Azure Cosmos DB
Media Streaming Apps with Azure and Xamarin
DevOps for Data Science
Real-World Solutions with PowerApps: Tips & tricks to manage your app complexity
Azure Functions and Microsoft Graph
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Getting Started with Visual Studio Tools for AI
Using AML Python SDK
Mobile Workforce Location Tracking with Bing Maps
Cognitive Services Labs in action Anomaly detection
Speech Devices SDK

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PPT
Teaching material agriculture food technology
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Empathic Computing: Creating Shared Understanding
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Modernizing your data center with Dell and AMD
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Advanced methodologies resolving dimensionality complications for autism neur...
Dropbox Q2 2025 Financial Results & Investor Presentation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
NewMind AI Monthly Chronicles - July 2025
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
Teaching material agriculture food technology
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Per capita expenditure prediction using model stacking based on satellite ima...
Empathic Computing: Creating Shared Understanding
“AI and Expert System Decision Support & Business Intelligence Systems”
MYSQL Presentation for SQL database connectivity
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Modernizing your data center with Dell and AMD
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

A developer's introduction to big data processing with Azure Databricks

  • 3. “More than any other factor, customer experiences determine whether companies thrive and profit, or struggle and fade.” – Forrester Research
  • 4. Speed 79% won’t return to a slow website Personalization 38% won't call again if they have to repeat themselves Consistency 65% get frustrated with inconsistent device experiences
  • 5. Harness the power of Big Data analytics apps to exceed customer needs
  • 6. Enhance any type of app with Big Data analytics
  • 8. Online shopping through company website E-commerce directly in the consumer’s hand, anywhere Wide World ImportersWide World Importers Retail stores
  • 9. Wide World Importers Web e-commerce Mobile e-commerce Wide World Importers seeks to expand customers through an omni-channel strategy Retail stores
  • 10. The solutions needed to reach more customers and grow the business 1. Scale with ease to reach more consumers 2. Unlock business insights from unstructured data 3. Enhance user experience with advanced analytics 4. Apply real-time analytics for instant updates 5. Infuse AI into apps to actively engage with customers
  • 11. A Z U R E D A T A B R I C K S Microsoft Azure
  • 12. Optimized Databricks Runtime Engine DATABRICKS I/O SERVERLESS Collaborative Workspace Cloud storage Data warehouses Hadoop storage IoT / streaming data Rest APIs Machine learning models BI tools Data exports Data warehouses Azure Databricks Enhance Productivity Deploy Production Jobs & Workflows APACHE SPARK MULTI-STAGE PIPELINES DATA ENGINEER JOB SCHEDULER NOTIFICATION & LOGS DATA SCIENTIST BUSINESS ANALYST Build on secure & trusted cloud Scale without limits A Z U R E D A T A B R I C K S
  • 15. INGEST STORE PREP & TRAIN MODEL & SERVE Azure Blob Storage Logs, files and media (unstructured) Azure SQL Data Warehouse Azure Data Factory Azure Analysis Services Azure Databricks (Python, Scala, Spark SQL) Polybase Business/custom apps (Structured) Power BI Azure also supports other Big Data services like Azure HDInsight and Azure Data Lake to allow customers to tailor the above architecture to meet their unique needs.
  • 16. INGEST STORE PREP & TRAIN MODEL & SERVE Azure Blob Storage Logs, files and media (unstructured) Azure SQL Data Warehouse Azure Data Factory Azure Analysis Services Polybase Business/custom apps (Structured) Power BI Azure also supports other Big Data services like Azure HDInsight and Azure Data Lake to allow customers to tailor the above architecture to meet their unique needs. Azure Databricks (Python, Scala, Spark SQL) Azure Databricks (Spark ML, Spark R, SparklyR) Intelligent Apps Cosmos DB
  • 17. INGEST STORE PREP & TRAIN MODEL & SERVE Logs, files and media (unstructured) Sensors and IoT (unstructured) HDInsight (Kafka) Power BIAzure Databricks (Python, Scala, Spark SQL) Intelligent Apps Cosmos DBEvent Hub IoT Hub Azure Databricks (Spark ML, Spark R, SparklyR) Azure Blob Storage Batch Data (Apps, logs) Azure Data Factory
  • 18. Azure Databricks at //BUILD 2018 Tuesday: • BRK3320 The Developer Data Scientist – Creating New Analytics Driven Applications using Apache Spark with Azure Databricks • WRK2601 Using Databricks to Analyze Telemetry Data Stored in Azure Blob Storage Wednesday: • BRK4102 ETL 2.0 - Data Engineering for developers • BRK3314 Leveraging Azure Databricks to minimize time to insight by combining Batch and Stream processing pipelines. • BRK3708 Machine learning at scale

Editor's Notes