SlideShare a Scribd company logo
Azure Big Data Story
@LynnLangit
I love to…Learn & Build
Azure Big Data Story
Azure Big Data by the V’s
Value
Volume Velocity Variety Veracity
Big Data = Business Value ?
60% of Big Data projects
FAIL
to go beyond pilot and will be abandoned
(through 2017) - Gartner
Volume
How big is Big Data?
Variety
Is my Data rectangular?
Persistence Choices
Files
Hadoop
NoSQL
Relational
Azure Big Data Story
Azure Persistence Choices
• Storage
• Store Simple
Files
• Data Lake
• HDInsight
• Cloudera
Hadoop
• MANY…
NoSQL
• SQL Azure
• SQL Azure DW
• SQL Server on Azure
VM
Relational
Drilling In: Relational AND NoSQL
Azure Persistence Choices Detailed
• Storage
• Store Simple
Files
• Data Lake
• HDInsight
• Cloudera
Hadoop • Redis Caching
• DataStax Enterprise
• Document DB
• Mongo Labs
• Graph Engine
NoSQL
• SQL Azure
• SQL Azure DW
• SQL Server on Azure VM
Relational
Velocity
How fast is my Data?
Veracity
How clean is my Data?
Load Choices
Load
Stream
Batch
Azure Load Choices
Load Libraries
StreamEvent Hub
Batch
Stream
Analytics
Data Cleaning Choices
ETL
Client
Machine
Learning
Azure Data Cleaning Choices
ETL
• SQL Server VM
• Data Pipeline
Client
• Power BI
• Power Query
ML
• Azure ML
• Data Marketplace
• SQL Server DQS
Data Pipelines
Azure Data Factory
Azure Big Data Story
Public Cloud
or
Hybrid Cloud
Data
Model
On Premise
SQL Server+
Cloud
Azure+
Key-Value
Queues
None
Windows Queues
Azure Redis Cache
Azure Queues
Wide Sparse
Columns
Columnstore Index
SSAS Tabular Models
Azure Tables
DataStax Enterprise (Cassandra)
Files FileTable, Filestream
XML data type
Azure BLOB Store
StoreSimple
JSON or
Graph
SQL Server 2016
None
Azure DocumentDB / Graph Engine (beta)
Hosted MongoDB or Neo4J
Large
Relational
SQL Server Enterprise
PDW
SQL Analysis Services
SQL Database (basic, standard, premium)
APS
SQL Data Warehouse
Hadoop Hortonworks HDInsight/ Data Lake,
Hosted Cloudera
Other Stream Insight Event Hub, StreamAnalytics, ML
Marketplace
Azure Big Data Story
Value
How useful is my Big Data?
Big Data = Business Value ?
60% of Big Data projects
FAIL
to go beyond pilot and will be abandoned
(through 2017) - Gartner
Azure Big Data Story
Architectural
Patterns
Architecture 1- File Storage / Backup
Architecture 2- Data Warehouse
Architecture 3 – Operational Database
Old
becomes
New
Architecture 4 – Small Big Data
MORE DATA
Architecture 5 – Big Data
www.TeachingKidsProgramming.org
Azure Big Data Story
@LynnLangit

More Related Content

PPTX
Database Choices
PPTX
Big data in Azure
PPTX
Bleeding Edge Databases
PPTX
Using Premium Data - for Business Analysts
PPTX
Big Data on azure
PDF
Cloud Big Data Architectures
PPTX
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
PPTX
Integration Monday - Analysing StackExchange data with Azure Data Lake
Database Choices
Big data in Azure
Bleeding Edge Databases
Using Premium Data - for Business Analysts
Big Data on azure
Cloud Big Data Architectures
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
Integration Monday - Analysing StackExchange data with Azure Data Lake

What's hot (18)

PPTX
Options for Data Prep - A Survey of the Current Market
PPTX
Simplifying And Accelerating Data Access for Python With Dremio and Apache Arrow
PDF
Cortana Analytics Workshop: Azure Data Lake
PDF
DBP-010_Using Azure Data Services for Modern Data Applications
PDF
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
PDF
Modern Data architecture Design
PPTX
Microsoft Machine Learning Smackdown
PPTX
TechDays NL 2016 - Building your scalable secure IoT Solution on Azure
PDF
Building Data Lakes with Apache Airflow
PPTX
Finding new Customers using D&B and Excel Power Query
PPTX
Lecture1
PPTX
Getting to 1.5M Ads/sec: How DataXu manages Big Data
PDF
Big data on AWS
PDF
Bi on Big Data - Strata 2016 in London
PPTX
A developer's introduction to big data processing with Azure Databricks
PPTX
NoSQL for the SQL Server Pro
PPTX
REDSHIFT - Amazon
PPTX
Apache Arrow: In Theory, In Practice
Options for Data Prep - A Survey of the Current Market
Simplifying And Accelerating Data Access for Python With Dremio and Apache Arrow
Cortana Analytics Workshop: Azure Data Lake
DBP-010_Using Azure Data Services for Modern Data Applications
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
Modern Data architecture Design
Microsoft Machine Learning Smackdown
TechDays NL 2016 - Building your scalable secure IoT Solution on Azure
Building Data Lakes with Apache Airflow
Finding new Customers using D&B and Excel Power Query
Lecture1
Getting to 1.5M Ads/sec: How DataXu manages Big Data
Big data on AWS
Bi on Big Data - Strata 2016 in London
A developer's introduction to big data processing with Azure Databricks
NoSQL for the SQL Server Pro
REDSHIFT - Amazon
Apache Arrow: In Theory, In Practice
Ad

Viewers also liked (12)

PPTX
PPTX
Microsoft Azure Big Data Analytics
PPTX
Azure Spark - Big Data - Coresic 2016
PDF
Big data on Azure for Architects
PPTX
Big Data en Azure: Azure Data Lake
PPTX
Intorducing Big Data and Microsoft Azure
PDF
Dive into Spark Streaming
PPTX
Big data architectures and the data lake
PPTX
Getting started with microsoft azure in 30 mins
PPTX
Azure Data Lake Analytics Deep Dive
PPTX
Microsoft Azure vs Amazon Web Services (AWS) Services & Feature Mapping
PPTX
Microsoft Cloud Computing - Windows Azure Platform
Microsoft Azure Big Data Analytics
Azure Spark - Big Data - Coresic 2016
Big data on Azure for Architects
Big Data en Azure: Azure Data Lake
Intorducing Big Data and Microsoft Azure
Dive into Spark Streaming
Big data architectures and the data lake
Getting started with microsoft azure in 30 mins
Azure Data Lake Analytics Deep Dive
Microsoft Azure vs Amazon Web Services (AWS) Services & Feature Mapping
Microsoft Cloud Computing - Windows Azure Platform
Ad

Similar to Azure Big Data Story (20)

PPTX
Big Data Analytics: Finding diamonds in the rough with Azure
PDF
Big Data - Module 1
PPTX
Azure Data Engineer Online Training Course - Azure Data Engineer Training Ame...
PPTX
Microsoft cloud big data strategy
PPTX
Microsoft Azure News - Dec 2016
PPTX
Big Data Platform Landscape by 2017
PDF
Big Data for the Rest of Us - OpenWest 2014 - Matt Asay
PPTX
Azure data platform overview
PPTX
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
PPTX
Big Data on Azure Tutorial
PPTX
Arquitectura de Datos en Azure
PPT
Integrating RDBMS with Big Data V3.0 now with SPARK!
PDF
[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...
PPTX
AzureDay - Introduction Big Data Analytics.
PDF
Java/Scala Lab: Anton Vidishchev - Microsoft Azure как облачная платформа для...
PPTX
storage on windows azure
PPTX
Choosing technologies for a big data solution in the cloud
PPTX
Choosing right data store & processing
PPTX
Introduction to Microsoft’s Hadoop solution (HDInsight)
PDF
Prague data management meetup 2018-03-27
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data - Module 1
Azure Data Engineer Online Training Course - Azure Data Engineer Training Ame...
Microsoft cloud big data strategy
Microsoft Azure News - Dec 2016
Big Data Platform Landscape by 2017
Big Data for the Rest of Us - OpenWest 2014 - Matt Asay
Azure data platform overview
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
Big Data on Azure Tutorial
Arquitectura de Datos en Azure
Integrating RDBMS with Big Data V3.0 now with SPARK!
[WITH THE VISION 2017] IoT/AI時代を生き抜くためのデータ プラットフォーム (Leveraging Azure Data Se...
AzureDay - Introduction Big Data Analytics.
Java/Scala Lab: Anton Vidishchev - Microsoft Azure как облачная платформа для...
storage on windows azure
Choosing technologies for a big data solution in the cloud
Choosing right data store & processing
Introduction to Microsoft’s Hadoop solution (HDInsight)
Prague data management meetup 2018-03-27

More from Lynn Langit (20)

PPTX
VariantSpark on AWS
PPTX
Serverless Architectures
PPTX
10+ Years of Teaching Kids Programming
PPTX
Blastn plus jupyter on Docker
PDF
Testing in Ballerina Language
PPTX
Teaching Kids to create Alexa Skills
PPTX
Practical cloud
PPTX
Understanding Jupyter notebooks using bioinformatics examples
PPTX
Genome-scale Big Data Pipelines
PPTX
Teaching Kids Programming
PPTX
Practical Cloud
PPTX
Serverless Reality
PPTX
Genomic Scale Big Data Pipelines
PPTX
VariantSpark - a Spark library for genomics
PPTX
Bioinformatics Data Pipelines built by CSIRO on AWS
PPTX
Serverless Reality
PDF
Beyond Relational
PPTX
New AWS Services for Bioinformatics
PPTX
Google Cloud and Data Pipeline Patterns
PPTX
Scaling Galaxy on Google Cloud Platform
VariantSpark on AWS
Serverless Architectures
10+ Years of Teaching Kids Programming
Blastn plus jupyter on Docker
Testing in Ballerina Language
Teaching Kids to create Alexa Skills
Practical cloud
Understanding Jupyter notebooks using bioinformatics examples
Genome-scale Big Data Pipelines
Teaching Kids Programming
Practical Cloud
Serverless Reality
Genomic Scale Big Data Pipelines
VariantSpark - a Spark library for genomics
Bioinformatics Data Pipelines built by CSIRO on AWS
Serverless Reality
Beyond Relational
New AWS Services for Bioinformatics
Google Cloud and Data Pipeline Patterns
Scaling Galaxy on Google Cloud Platform

Recently uploaded (20)

PDF
Nekopoi APK 2025 free lastest update
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Introduction to Artificial Intelligence
PDF
Understanding Forklifts - TECH EHS Solution
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
AI in Product Development-omnex systems
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
System and Network Administration Chapter 2
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
medical staffing services at VALiNTRY
PDF
Digital Strategies for Manufacturing Companies
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Nekopoi APK 2025 free lastest update
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Introduction to Artificial Intelligence
Understanding Forklifts - TECH EHS Solution
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
CHAPTER 2 - PM Management and IT Context
AI in Product Development-omnex systems
Design an Analysis of Algorithms II-SECS-1021-03
System and Network Administration Chapter 2
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Odoo POS Development Services by CandidRoot Solutions
Wondershare Filmora 15 Crack With Activation Key [2025
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
medical staffing services at VALiNTRY
Digital Strategies for Manufacturing Companies
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf

Azure Big Data Story

Editor's Notes

  • #3: https://about.me/LynnLangit
  • #4: ScottGu Blog - http://weblogs.asp.net/scottgu Azure Big Data -- http://azure.microsoft.com/blog/topics/big-data/ Data Factory Pipeline Sample (Blog) -- http://azure.microsoft.com/blog/2015/04/24/azure-data-factory-update-simplified-sample-deployment/
  • #6: http://www.bain.com/publications/articles/three-promises-and-perils-of-big-data.aspx
  • #12: http://azure.microsoft.com/en-us/documentation/infographics/building-real-world-cloud-apps/
  • #20: Image credit - http://tapoueh.org/images/pipeline.png Azure Data Factory tutorial - http://azure.microsoft.com/en-us/documentation/articles/data-factory-get-started-using-editor/
  • #21: Image credit - http://tapoueh.org/images/pipeline.png Azure Data Factory tutorial - http://azure.microsoft.com/en-us/documentation/articles/data-factory-get-started-using-editor/
  • #22: ScottGu Blog - http://weblogs.asp.net/scottgu Azure Big Data -- http://azure.microsoft.com/blog/topics/big-data/ Data Factory Pipeline Sample (Blog) -- http://azure.microsoft.com/blog/2015/04/24/azure-data-factory-update-simplified-sample-deployment/
  • #25: http://recode.net/2014/04/15/microsofts-big-data-angle-office-as-a-friendly-front-end/
  • #27: http://www.bain.com/publications/articles/three-promises-and-perils-of-big-data.aspx
  • #28: http://www.wsj.com/articles/ubers-new-funding-values-it-at-over-41-billion-1417715938
  • #33: http://www.nytimes.com/newsgraphics/2013/08/18/reshaping-new-york/
  • #35: www.thingful.net