SlideShare a Scribd company logo
Usama wahab Khan
MVP,MCT, CTO @Evolution Technologies
Usama Wahab Khan
Father, data Scientist, Developer/Nerd, Traveler
Twitter : @usamawahabkhan
LinkedIn : Usamawahabkhan
Data abundance
Processes Businesses are tasked to store,
interpret, manage, transform,
process, aggregate and report
on data
Consumers There are a wider range of
consumers using different
types of devices to consume or
generate data
Variety There’s a wider variety of data
types that need to be
processed and stored
Responsibiliti
es
A data engineers role is
responsible for more data types
and technologies
Technologies Microsoft Azure provides a
wide set of tools and
technologies
New skills
for new
platforms
Changing
loading
approaches
From
implementi
ng to
provisionin
g
Data engineering job
responsibilities
CONTROL EASE OF USE
Azure Data Lake
Analytics
Any Hadoop technology,
any distribution
Workload optimized,
managed clusters
Data Engineering in a
Job-as-a-service model
Azure Marketplace
HDP | CDH | MapR
Azure Data Lake
Analytics
Virtual Machines Managed Clusters Big Data as-a-service
Azure HDInsight
Frictionless & Optimized
Spark clusters
Azure Databricks
BIG
DATA
ANALYTICS
Reduced
Administration
B I G D ATA I N M I C R O S O F T A Z U R E
Azure Data Lake Store
Azure Storage
BIG
DATA
STORAGE
What is Azure Databricks?
A fast, easy and collaborative Apache® Spark™ based analytics platform optimized for Azure
Designed in collaboration with the founders of Apache Spark
One-click set up; streamlined workflows
Interactive workspace that enables collaboration between data scientists, data engineers, and business
analysts.
Native integration with Azure services (Power BI, SQL DW, Cosmos DB, Blob Storage)
Enterprise-grade Azure security (Active Directory integration, compliance, enterprise -grade SLAs)
Best of Microsoft
Best of Databricks
Azure databricks by usama whaba khan
Azure Databricks
Enhance Productivity Build on secure & trusted cloud Scale without limits
Azure databricks by usama whaba khan
Reference architecture
Reference architecture Business intelligence
Reference architecture Real-time analytics Big data
Demo
Q & A
Usama Wahab Khan
Twitter : @usamawahabkhan
LinkedIn : Usamawahabkhan
Thank you 

More Related Content

PDF
Modernizing to a Cloud Data Architecture
PPTX
MCT Summit Azure automated Machine Learning
PPTX
Eugene Polonichko "Architecture of modern data warehouse"
PDF
Analytics in a Day Virtual Workshop
 
PDF
2016 Spark Summit East Keynote: Ali Ghodsi and Databricks Community Edition demo
PPTX
Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...
PDF
Modern Data Architecture
PDF
Analytics in a Day Virtual Workshop
 
Modernizing to a Cloud Data Architecture
MCT Summit Azure automated Machine Learning
Eugene Polonichko "Architecture of modern data warehouse"
Analytics in a Day Virtual Workshop
 
2016 Spark Summit East Keynote: Ali Ghodsi and Databricks Community Edition demo
Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...
Modern Data Architecture
Analytics in a Day Virtual Workshop
 

What's hot (20)

PPTX
Azure Databricks - An Introduction (by Kris Bock)
PDF
Azure databricks c sharp corner toronto feb 2019 heather grandy
PDF
Analytics in a Day Ft. Synapse Virtual Workshop
 
PDF
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
PDF
Analytics in a Day Ft. Synapse Virtual Workshop
 
PPTX
Ai & Data Analytics 2018 - Azure Databricks for data scientist
PPTX
Big Data Analytics in the Cloud with Microsoft Azure
PDF
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
PPTX
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
PDF
Introducing MLflow for End-to-End Machine Learning on Databricks
PDF
Definitive Guide to Select Right Data Warehouse (2020)
PDF
Building Data Lakes with Apache Airflow
PDF
Einstieg in Machine Learning für Datenbankentwickler
PDF
From hadoop to spark
PPTX
Big Data with Azure
PDF
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
PDF
Analytics in a Day Ft. Synapse Virtual Workshop
 
PPTX
Azure cafe marketplace with looker data analytics
PDF
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
PDF
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Azure Databricks - An Introduction (by Kris Bock)
Azure databricks c sharp corner toronto feb 2019 heather grandy
Analytics in a Day Ft. Synapse Virtual Workshop
 
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
Analytics in a Day Ft. Synapse Virtual Workshop
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Big Data Analytics in the Cloud with Microsoft Azure
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
Introducing MLflow for End-to-End Machine Learning on Databricks
Definitive Guide to Select Right Data Warehouse (2020)
Building Data Lakes with Apache Airflow
Einstieg in Machine Learning für Datenbankentwickler
From hadoop to spark
Big Data with Azure
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
Analytics in a Day Ft. Synapse Virtual Workshop
 
Azure cafe marketplace with looker data analytics
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Ad

Similar to Azure databricks by usama whaba khan (20)

PPTX
Azure Certification | Azure Fundamentals to DevOps
PDF
Azure Certification | Azure Fundamentals to DevOps
PPTX
Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
PPTX
Azure Data Engineer Training In Hyderabad | Microsoft Azure
DOCX
What are the basic key concepts before learning Azure Data Engineer.docx
PPTX
AzureDay - Introduction Big Data Analytics.
PPTX
Azure data engineering PPT.pptxAzure data engineering PPT.pptx
PDF
Trivadis Azure Data Lake
PPTX
Data Engineer Course in Hyderabad - Azure Data Engineer Course Hyderabad.pptx
PPTX
Microsoft Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
PDF
Big Data Adavnced Analytics on Microsoft Azure
PPTX
Big Data Analytics: Finding diamonds in the rough with Azure
PPTX
Azure Data Engineer Course | Microsoft Azure Data Engineer.pptx
PPTX
Analyzing StackExchange data with Azure Data Lake
PPTX
Power BI for Big Data and the New Look of Big Data Solutions
PDF
Modern Business Intelligence and Advanced Analytics
PPTX
Demystifying data engineering
PPTX
Introduction to Data Engineering
PDF
Big Data Analytics from Azure Cloud to Power BI Mobile
PPTX
Building data pipelines for modern data warehouse with Apache® Spark™ and .NE...
Azure Certification | Azure Fundamentals to DevOps
Azure Certification | Azure Fundamentals to DevOps
Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
Azure Data Engineer Training In Hyderabad | Microsoft Azure
What are the basic key concepts before learning Azure Data Engineer.docx
AzureDay - Introduction Big Data Analytics.
Azure data engineering PPT.pptxAzure data engineering PPT.pptx
Trivadis Azure Data Lake
Data Engineer Course in Hyderabad - Azure Data Engineer Course Hyderabad.pptx
Microsoft Azure Data Engineer Training | Azure Data Engineer Course in Hyderabad
Big Data Adavnced Analytics on Microsoft Azure
Big Data Analytics: Finding diamonds in the rough with Azure
Azure Data Engineer Course | Microsoft Azure Data Engineer.pptx
Analyzing StackExchange data with Azure Data Lake
Power BI for Big Data and the New Look of Big Data Solutions
Modern Business Intelligence and Advanced Analytics
Demystifying data engineering
Introduction to Data Engineering
Big Data Analytics from Azure Cloud to Power BI Mobile
Building data pipelines for modern data warehouse with Apache® Spark™ and .NE...
Ad

More from Usama Wahab Khan Cloud, Data and AI (15)

PPTX
unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx
PPTX
TechDayPakistan-Slides RAG with Cosmos DB.pptx
PPTX
introduction Azure OpenAI by Usama wahab khan
PPTX
ServerLess by usama Azure fuctions.pptx
PPTX
Azure synapse by usama whaba khan
PPTX
Introduction to development using the share point framework mv ps
PPTX
GIS Into to Cloud Microsoft Azure
PPTX
Build with Serverless Applications with azure functions By usama wahab Khan
PPTX
Microsoft PowerApps Introduction by Usama Wahab Khan MVP
PPTX
PPTX
Windows azure overview for SharePoint Pros
PPTX
Developing apps for share point 2013
PPTX
SPS Gulf : SharePoint 2013 Cloud Business App
PPTX
SharePoint 2013 REST and CSOM
unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
introduction Azure OpenAI by Usama wahab khan
ServerLess by usama Azure fuctions.pptx
Azure synapse by usama whaba khan
Introduction to development using the share point framework mv ps
GIS Into to Cloud Microsoft Azure
Build with Serverless Applications with azure functions By usama wahab Khan
Microsoft PowerApps Introduction by Usama Wahab Khan MVP
Windows azure overview for SharePoint Pros
Developing apps for share point 2013
SPS Gulf : SharePoint 2013 Cloud Business App
SharePoint 2013 REST and CSOM

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PPTX
A Presentation on Artificial Intelligence
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Tartificialntelligence_presentation.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Machine Learning_overview_presentation.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
Teaching material agriculture food technology
A Presentation on Artificial Intelligence
Advanced methodologies resolving dimensionality complications for autism neur...
Unlocking AI with Model Context Protocol (MCP)
Mobile App Security Testing_ A Comprehensive Guide.pdf
Electronic commerce courselecture one. Pdf
SOPHOS-XG Firewall Administrator PPT.pptx
cuic standard and advanced reporting.pdf
Assigned Numbers - 2025 - Bluetooth® Document
The Rise and Fall of 3GPP – Time for a Sabbatical?
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
A comparative analysis of optical character recognition models for extracting...
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Tartificialntelligence_presentation.pptx
Encapsulation_ Review paper, used for researhc scholars
Diabetes mellitus diagnosis method based random forest with bat algorithm
Machine Learning_overview_presentation.pptx
Network Security Unit 5.pdf for BCA BBA.

Azure databricks by usama whaba khan

  • 1. Usama wahab Khan MVP,MCT, CTO @Evolution Technologies
  • 2. Usama Wahab Khan Father, data Scientist, Developer/Nerd, Traveler Twitter : @usamawahabkhan LinkedIn : Usamawahabkhan
  • 3. Data abundance Processes Businesses are tasked to store, interpret, manage, transform, process, aggregate and report on data Consumers There are a wider range of consumers using different types of devices to consume or generate data Variety There’s a wider variety of data types that need to be processed and stored Responsibiliti es A data engineers role is responsible for more data types and technologies Technologies Microsoft Azure provides a wide set of tools and technologies
  • 4. New skills for new platforms Changing loading approaches From implementi ng to provisionin g Data engineering job responsibilities
  • 5. CONTROL EASE OF USE Azure Data Lake Analytics Any Hadoop technology, any distribution Workload optimized, managed clusters Data Engineering in a Job-as-a-service model Azure Marketplace HDP | CDH | MapR Azure Data Lake Analytics Virtual Machines Managed Clusters Big Data as-a-service Azure HDInsight Frictionless & Optimized Spark clusters Azure Databricks BIG DATA ANALYTICS Reduced Administration B I G D ATA I N M I C R O S O F T A Z U R E Azure Data Lake Store Azure Storage BIG DATA STORAGE
  • 6. What is Azure Databricks? A fast, easy and collaborative Apache® Spark™ based analytics platform optimized for Azure Designed in collaboration with the founders of Apache Spark One-click set up; streamlined workflows Interactive workspace that enables collaboration between data scientists, data engineers, and business analysts. Native integration with Azure services (Power BI, SQL DW, Cosmos DB, Blob Storage) Enterprise-grade Azure security (Active Directory integration, compliance, enterprise -grade SLAs) Best of Microsoft Best of Databricks
  • 8. Azure Databricks Enhance Productivity Build on secure & trusted cloud Scale without limits
  • 12. Reference architecture Real-time analytics Big data
  • 13. Demo
  • 14. Q & A Usama Wahab Khan Twitter : @usamawahabkhan LinkedIn : Usamawahabkhan

Editor's Notes

  • #2: Introduce the team (self-introductions). Mention LearnAI – team. 3 day airlift, transition from pure databricks to AML We will use notebooks to introduce tools and techniques, and then return to one use-case We have three kinds of session: (1) presentation style, (2) demos (w/ small exercises), (3) hands-on labs. Last day is a Hackathon (w/ two use cases) Check people’s skills. Experience with Databricks, Jupyter notebooks, VS Code, Deep Learning. Who has heard of AMLCompute? Who has used it? Who has used CI/CD and git version control?
  • #4: Instructor notes It important to stress that at this stage we are just going through a high level overview of how the growth of data has had an impact on a wide range of people, processes and technologies. Is there anyone in the classroom that are having to deal with new data types, processes, or technologies? Has their role evolved? Use the answers to drill down into a short discussion if necessary. The end game is to ensure that the students see relevancy to learning the material you are about to present
  • #5: Instructor notes This slide outlines the three core areas that Data Engineers will be responsible for. First is a responsibility to learn new skills for the new platforms. This may mean understanding new data storage paradigms such as No-SQL solutions, or streaming data solutions. It will also likely involve learning new languages depending on the technologies that are provisioned. Data Engineers will also have to be open to the idea that data loading techniques that were appropriate for on-premises scenarios, may not necessarily work for processing data within the cloud. More details will follow as they go through the course. Students should also understand that with the move from on-premises to the cloud, they will move from a place of physically implementing machines and services, to provisioning them either using the Azure portal, or likely be creating code that can be used to quickly deploy services with minimal errors. Azure Administrators refer to this as infrastructure as code