SlideShare a Scribd company logo
Azure Data Factory: Data Wrangling
Power Query in ADF
Updated Public Preview Q1 CY21
What is Data Wrangling?
 Code-free data exploration and data prep
 Operationalize Power Query as an activity by
translating M script in ADF data flow script
 Execute Power Query as a pipeline activity using the
ADF data flow serverless, scaled-out, ADF-managed
Apache Spark engine
 Essentially acts as a data-first entry point to building
ADF data flows
ADF Data Wrangling Use Cases
 Data Engineer is building an ETL process in ADF uses PQ to explore data using data profiling
 Business Analyst is a PQ desktop user and wishes to operationalize their M query in a data
pipeline that sinks data in the Lake
 Data Engineer needs to prep data for modeling and ETL by using a data-first approach. Creates a
PQ wrangling activity and adds it to pipeline.
 Trimming strings
 Data type conversions
 Rename columns
 Remove columns
 Value prop
 “Data Wrangling in ADF”, not “Power Query lift-and-shift”
ADF Data Wrangling Roadmap – PQ Activity
 Continue to add more M functions to fold into Spark
 Add more native connectors that work in both ADF & Power Query
 Enable V-Net in Power Query Online data wrangling experience in ADF
 Launch PQ activity in Synapse Pipelines
 Enable interactive monitoring similar to Copy and Data Flow
Additional
resources
Documentation
List of tutorial videos
Expression language reference
Performance guide
ADF twitter
ADF tech community blog

More Related Content

PPTX
Azure Data Factory Data Flows Training (Sept 2020 Update)
PPTX
Azure Data Factory Data Flows Training v005
PPTX
Azure Data Factory ETL Patterns in the Cloud
PPTX
ADF Mapping Data Flows Level 300
PPTX
Data Quality Patterns in the Cloud with Azure Data Factory
PPTX
Azure Data Factory for Redmond SQL PASS UG Sept 2018
PPTX
Data quality patterns in the cloud with ADF
PPTX
Azure Data Factory for Azure Data Week
Azure Data Factory Data Flows Training (Sept 2020 Update)
Azure Data Factory Data Flows Training v005
Azure Data Factory ETL Patterns in the Cloud
ADF Mapping Data Flows Level 300
Data Quality Patterns in the Cloud with Azure Data Factory
Azure Data Factory for Redmond SQL PASS UG Sept 2018
Data quality patterns in the cloud with ADF
Azure Data Factory for Azure Data Week

What's hot (20)

PPTX
Azure Data Factory Data Flow
PPTX
Microsoft Azure Data Factory Hands-On Lab Overview Slides
PPTX
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
PPTX
Mapping Data Flows Training April 2021
PPTX
Azure Data Factory Data Flow Limited Preview for January 2019
PPTX
Deep Dive into Azure Data Factory v2
PPTX
ADF Mapping Data Flows Training Slides V1
PPTX
Mapping Data Flows Training deck Q1 CY22
PPTX
Microsoft Azure Data Factory Data Flow Scenarios
PPTX
Microsoft Build 2018 Analytic Solutions with Azure Data Factory and Azure SQL...
PDF
ADF Mapping Data Flow Private Preview Migration
PPTX
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
PPTX
Azure data factory
PDF
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
PPTX
ETL in the Cloud With Microsoft Azure
PPTX
Sql Bits 2020 - Designing Performant and Scalable Data Lakes using Azure Data...
PPTX
Microsoft Azure BI Solutions in the Cloud
PDF
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
PDF
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
PDF
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Azure Data Factory Data Flow
Microsoft Azure Data Factory Hands-On Lab Overview Slides
SQL Saturday Redmond 2019 ETL Patterns in the Cloud
Mapping Data Flows Training April 2021
Azure Data Factory Data Flow Limited Preview for January 2019
Deep Dive into Azure Data Factory v2
ADF Mapping Data Flows Training Slides V1
Mapping Data Flows Training deck Q1 CY22
Microsoft Azure Data Factory Data Flow Scenarios
Microsoft Build 2018 Analytic Solutions with Azure Data Factory and Azure SQL...
ADF Mapping Data Flow Private Preview Migration
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Azure data factory
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
ETL in the Cloud With Microsoft Azure
Sql Bits 2020 - Designing Performant and Scalable Data Lakes using Azure Data...
Microsoft Azure BI Solutions in the Cloud
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Ad

Similar to Azure Data Factory Data Wrangling with Power Query (20)

PPTX
Become a Data-Engineering _ ABMC Group.pptx
PDF
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
DOCX
Data Wrangling for Big Data Challenges andOpportunities.docx
PPTX
Data Wrangling Made Simple: Tools and Tips.pptx
PPTX
Intro to Azure Data Factory v1
PPTX
data wrangling (1).pptx kjhiukjhknjbnkjh
PDF
Data Wrangling with Python_ Cleaning and Preparing Datasets for Analysis.pdf
PPTX
DataWrangler @VGSOM
PDF
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
PDF
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
PDF
Data Factory in Microsoft Fabric (MsBIP #82)
PDF
How Data Wrangling Is Reshaping IT Strategies.pdf
PDF
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
PPTX
Revolutionizing Data Wrangling with Ask On Data.pptx
PDF
Revolutionizing Data Wrangling with Ask On Data.pdf
DOCX
Revolutionizing Data Wrangling with Ask On Data.docx
PDF
Sql saturday el salvador 2016 - Me, A Data Scientist?
PDF
Azure Data Factory v2
PDF
Next level data operations using Power Automate magic
PPTX
DataDiscoveryWithPowerQuery.pptx
Become a Data-Engineering _ ABMC Group.pptx
Getting Started: Data Factory in Microsoft Fabric (Microsoft Fabric Community...
Data Wrangling for Big Data Challenges andOpportunities.docx
Data Wrangling Made Simple: Tools and Tips.pptx
Intro to Azure Data Factory v1
data wrangling (1).pptx kjhiukjhknjbnkjh
Data Wrangling with Python_ Cleaning and Preparing Datasets for Analysis.pdf
DataWrangler @VGSOM
Data Integration with Data Factory (Microsoft Fabric Day Oslo 2023)
Data Integration using Data Factory in Microsoft Fabric (ESPC Microsoft Fabri...
Data Factory in Microsoft Fabric (MsBIP #82)
How Data Wrangling Is Reshaping IT Strategies.pdf
The Battle of the Data Transformation Tools (PASS Data Community Summit 2023)
Revolutionizing Data Wrangling with Ask On Data.pptx
Revolutionizing Data Wrangling with Ask On Data.pdf
Revolutionizing Data Wrangling with Ask On Data.docx
Sql saturday el salvador 2016 - Me, A Data Scientist?
Azure Data Factory v2
Next level data operations using Power Automate magic
DataDiscoveryWithPowerQuery.pptx
Ad

More from Mark Kromer (10)

PPTX
Fabric Data Factory Pipeline Copy Perf Tips.pptx
PPTX
Build data quality rules and data cleansing into your data pipelines
PPTX
Data cleansing and prep with synapse data flows
PPTX
Data cleansing and data prep with synapse data flows
PPTX
Mapping Data Flows Perf Tuning April 2021
PPTX
Data Lake ETL in the Cloud with ADF
PPTX
Azure Data Factory Data Flow Performance Tuning 101
PPTX
Data Quality Patterns in the Cloud with ADF
PPTX
ADF Mapping Data Flows Training V2
PPTX
Azure Data Factory Data Flow Preview December 2019
Fabric Data Factory Pipeline Copy Perf Tips.pptx
Build data quality rules and data cleansing into your data pipelines
Data cleansing and prep with synapse data flows
Data cleansing and data prep with synapse data flows
Mapping Data Flows Perf Tuning April 2021
Data Lake ETL in the Cloud with ADF
Azure Data Factory Data Flow Performance Tuning 101
Data Quality Patterns in the Cloud with ADF
ADF Mapping Data Flows Training V2
Azure Data Factory Data Flow Preview December 2019

Recently uploaded (20)

PDF
NewMind AI Monthly Chronicles - July 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Cloud computing and distributed systems.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
A Presentation on Artificial Intelligence
PDF
Modernizing your data center with Dell and AMD
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Approach and Philosophy of On baking technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
KodekX | Application Modernization Development
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Monthly Chronicles - July 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Cloud computing and distributed systems.
Review of recent advances in non-invasive hemoglobin estimation
Encapsulation_ Review paper, used for researhc scholars
A Presentation on Artificial Intelligence
Modernizing your data center with Dell and AMD
Building Integrated photovoltaic BIPV_UPV.pdf
MYSQL Presentation for SQL database connectivity
Approach and Philosophy of On baking technology
Per capita expenditure prediction using model stacking based on satellite ima...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
Spectral efficient network and resource selection model in 5G networks
Unlocking AI with Model Context Protocol (MCP)
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KodekX | Application Modernization Development
20250228 LYD VKU AI Blended-Learning.pptx

Azure Data Factory Data Wrangling with Power Query

  • 1. Azure Data Factory: Data Wrangling Power Query in ADF Updated Public Preview Q1 CY21
  • 2. What is Data Wrangling?  Code-free data exploration and data prep  Operationalize Power Query as an activity by translating M script in ADF data flow script  Execute Power Query as a pipeline activity using the ADF data flow serverless, scaled-out, ADF-managed Apache Spark engine  Essentially acts as a data-first entry point to building ADF data flows
  • 3. ADF Data Wrangling Use Cases  Data Engineer is building an ETL process in ADF uses PQ to explore data using data profiling  Business Analyst is a PQ desktop user and wishes to operationalize their M query in a data pipeline that sinks data in the Lake  Data Engineer needs to prep data for modeling and ETL by using a data-first approach. Creates a PQ wrangling activity and adds it to pipeline.  Trimming strings  Data type conversions  Rename columns  Remove columns  Value prop  “Data Wrangling in ADF”, not “Power Query lift-and-shift”
  • 4. ADF Data Wrangling Roadmap – PQ Activity  Continue to add more M functions to fold into Spark  Add more native connectors that work in both ADF & Power Query  Enable V-Net in Power Query Online data wrangling experience in ADF  Launch PQ activity in Synapse Pipelines  Enable interactive monitoring similar to Copy and Data Flow
  • 5. Additional resources Documentation List of tutorial videos Expression language reference Performance guide ADF twitter ADF tech community blog