SlideShare a Scribd company logo
4
Most read
5
Most read
7
Most read
Data Staging Strategy
By: Milind Zodge
Points to consider
 Define data extraction mode
 Define Staging layer design
 Define Types of Data Load
 Define Types of data stores in staging
Data Extraction Pull Mode
Data Extraction Push Mode
Staging Layer - Store and Forward
 ELT mode
 Load into staging layer first then
transform
 Data is transformed and the records in
stage are updated with the transformed
values
 Data then can be loaded into data
warehouse or data mart
Staging Layer - Direct load
 In this method you avoid the data staging
layer
 Data can be extracted and directly loaded
into data warehouse layer
 In can be used where no transformation is
needed
Data Load – Full Data
 All the data from the source is loaded into
staging tables
 Then compare and load into data
warehouse layer
 When no change data capture technique
is available on the source, most of the
time you end up using this method
Data Load – Changes only
 Only newly added records or changed
records are processed in this method
 This can be used when changed data
capture is defined or can be defined on
the source
Data Stores
 There are two ways one can store data in
staging layer
◦ File
◦ Table

More Related Content

PDF
8 Steps to Creating a Data Strategy
PDF
DMBOK and Data Governance
PDF
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
PPTX
How to Build & Sustain a Data Governance Operating Model
PDF
Data Strategy
PDF
Introduction to Data Governance
PDF
Improving Data Literacy Around Data Architecture
PDF
Data strategy in a Big Data world
8 Steps to Creating a Data Strategy
DMBOK and Data Governance
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
How to Build & Sustain a Data Governance Operating Model
Data Strategy
Introduction to Data Governance
Improving Data Literacy Around Data Architecture
Data strategy in a Big Data world

What's hot (20)

PDF
Essential Metadata Strategies
PDF
Enterprise Architecture vs. Data Architecture
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
PDF
Data Mesh for Dinner
PDF
Implementing Effective Data Governance
PDF
Reference master data management
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
Data Governance
PDF
Do-It-Yourself (DIY) Data Governance Framework
PDF
Data Catalogues - Architecting for Collaboration & Self-Service
PDF
Convincing Stakeholders Data Governance Is Essential
PDF
How to identify the correct Master Data subject areas & tooling for your MDM...
PPTX
Business Drivers Behind Data Governance
PDF
3D Data Strategy Framework
PDF
Best Practices in Metadata Management
PDF
Glossaries, Dictionaries, and Catalogs Result in Data Governance
PDF
Data Governance
PPTX
Data Governance Best Practices
PDF
Building a Data Governance Strategy
PDF
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Essential Metadata Strategies
Enterprise Architecture vs. Data Architecture
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Data Mesh for Dinner
Implementing Effective Data Governance
Reference master data management
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Governance
Do-It-Yourself (DIY) Data Governance Framework
Data Catalogues - Architecting for Collaboration & Self-Service
Convincing Stakeholders Data Governance Is Essential
How to identify the correct Master Data subject areas & tooling for your MDM...
Business Drivers Behind Data Governance
3D Data Strategy Framework
Best Practices in Metadata Management
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Data Governance
Data Governance Best Practices
Building a Data Governance Strategy
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Ad

Viewers also liked (6)

PDF
Cassandra one page
PDF
Big datawarehouse
PDF
Business DataWarehouse_Big Data
PPTX
Real Time Data Processing using Spark Streaming | Data Day Texas 2015
PDF
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
PPTX
Real time Analytics with Apache Kafka and Apache Spark
Cassandra one page
Big datawarehouse
Business DataWarehouse_Big Data
Real Time Data Processing using Spark Streaming | Data Day Texas 2015
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Real time Analytics with Apache Kafka and Apache Spark
Ad

Similar to Data Staging Strategy (20)

PPT
Lecture 16
PPTX
Data preparation techniques in data.pptx
PPTX
Etl - Extract Transform Load
PPTX
1.3 CLASS-DW.pptx-ETL process in details with detailed descriptions
PPTX
ETL_Methodology.pptx
PPTX
Chapter 6.pptx
PPT
definign etl process extract transform load.ppt
PPTX
Data Warehouse - What you know about etl process is wrong
PPT
Intro to Data warehousing lecture 09
PPTX
ETL Process
PPT
ETL Testing - Introduction to ETL testing
PPT
ETL Testing - Introduction to ETL Testing
PPT
ETL Testing - Introduction to ETL testing
PPTX
Data stage
PPTX
Etl process in data warehouse
PPTX
“Extract, Load, Transform,” is another type of data integration process
PPT
Etl data processing system which is very useful for the engineering students
PPTX
ETL
PDF
Get started with data migration
PDF
Enhancing Data Staging as a Mechanism for Fast Data Access
Lecture 16
Data preparation techniques in data.pptx
Etl - Extract Transform Load
1.3 CLASS-DW.pptx-ETL process in details with detailed descriptions
ETL_Methodology.pptx
Chapter 6.pptx
definign etl process extract transform load.ppt
Data Warehouse - What you know about etl process is wrong
Intro to Data warehousing lecture 09
ETL Process
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL testing
Data stage
Etl process in data warehouse
“Extract, Load, Transform,” is another type of data integration process
Etl data processing system which is very useful for the engineering students
ETL
Get started with data migration
Enhancing Data Staging as a Mechanism for Fast Data Access

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
Teaching material agriculture food technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Approach and Philosophy of On baking technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Electronic commerce courselecture one. Pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Digital-Transformation-Roadmap-for-Companies.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation_ Review paper, used for researhc scholars
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Teaching material agriculture food technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
The AUB Centre for AI in Media Proposal.docx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Approach and Philosophy of On baking technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
Electronic commerce courselecture one. Pdf
Review of recent advances in non-invasive hemoglobin estimation

Data Staging Strategy

  • 2. Points to consider  Define data extraction mode  Define Staging layer design  Define Types of Data Load  Define Types of data stores in staging
  • 5. Staging Layer - Store and Forward  ELT mode  Load into staging layer first then transform  Data is transformed and the records in stage are updated with the transformed values  Data then can be loaded into data warehouse or data mart
  • 6. Staging Layer - Direct load  In this method you avoid the data staging layer  Data can be extracted and directly loaded into data warehouse layer  In can be used where no transformation is needed
  • 7. Data Load – Full Data  All the data from the source is loaded into staging tables  Then compare and load into data warehouse layer  When no change data capture technique is available on the source, most of the time you end up using this method
  • 8. Data Load – Changes only  Only newly added records or changed records are processed in this method  This can be used when changed data capture is defined or can be defined on the source
  • 9. Data Stores  There are two ways one can store data in staging layer ◦ File ◦ Table