SlideShare a Scribd company logo
How to prepare your
data for a data
migration

www.etlsolutions.com
Introduction
•

Data migration is a complex undertaking, and
the processes and software used are
continually evolving.

•

Thorough preparation of data and systems
before a migration takes place helps to reduce
the risks involved.

•

The aim of this guide is to show the ways in
which data can be systematically prepared
before a migration to maximise the project’s
chance of success.

•

Don’t hesitate to get in touch with us at
info@etlsolutions.com if you have any
questions.

www.etlsolutions.com

Download our data
migration planning
eGuide for free at:
http://www.etlsolutions.co
m/free-eguide-preparinga-data-migration-plan/
Landscape analysis
•

It is crucial to thoroughly prepare
data and systems before a
migration takes place.

•

Landscape analysis provides an
overview of the source and target
systems, enabling the project team
to understand how each system
works and how the data within
each system is structured.

•

These areas should be reviewed
systematically to ensure that
potential errors are identified in
advance of the migration.

•

Ideally, the team should model the
links and interactions between the
different systems involved, along
with the data structures within each
system.
www.etlsolutions.com
Data assurance
•

Data assurance is another important
component of thorough preparation.

•

This procedure validates the data
discovered in the landscape analysis
and ensures that all data is fit for
purpose.

•

By validating the data, the migration
team are then free to focus solely on
structural manipulation and movement.

•

Data assurance has several phases:
• Data profiling
• Data quality definition
• Data cleansing.

www.etlsolutions.com
Data profiling
•

The aim of the data profiling phase is to
ensure that any historical data due to be
migrated is suitable for the changes that are
taking place in the organisation.

•

Profiling will identify areas of the data which
may not be of sufficient quality.

•

It should include comprehensive checks of
existing model structure, data format and data
conformance.

•

A retirement plan should be used to define the
data no longer required:
•

Any data to be retired should be recorded,
along with a description of what replaces it
or why it can be removed.

• The data that is no longer needed may
have to be archived for tax purposes or to
meet the requirements of an industry’s
governing bodies.
www.etlsolutions.com
Data quality definition
•

Data quality definitions state the
quality that must be attained by
elements, attributes and relationships
in the source system.

•

The definitions or rules should be
used during profiling to identify
whether or not the data is of the
correct quality and format.

•

All data quality rules should be listed
at element level, such as data table
or flat file.

•

All data quality issues and queries
should be tracked and stored.

www.etlsolutions.com
Data cleansing
•

The first stage in data cleansing is to define
which cleansing rules will be carried out
manually and which will be automated.

•

Splitting the rules into two enables the
organisation’s domain experts to
concentrate on the manual process, while
the migration experts design and develop
the automated cleansing.

•

Typically, the manual cleansing will be
carried out before the migration, while the
automated cleansing may be carried out
before the migration or as part of the
migration’s initial phase.

www.etlsolutions.com
Data verification
•

Data verification is the part of
the data cleansing process that
checks that the data is
available, accessible, complete
and in the correct format.

•

Our consultants often continue
to carry out verification once a
migration has begun, ensuring
that the information is optimised
prior to each stage of the
migration.

www.etlsolutions.com
Data impact analysis
•

We find that data impact analysis
is a crucial part of data
cleansing.

•

Because cleansing data adds or
alters values, data impact
analysis ensures that these
changes do not have a knock-on
effect on other elements within
the source and target systems.

•

It also checks the impact of data
cleansing on other systems
which currently use the data, and
on systems which may use the
data once the migration is
complete.

www.etlsolutions.com
Download your free copy of our data migration planning guide
• Download the PDF copy of this
guide for easy reading and
printing. It’s free, and no email
address is required!
• Visit us at:
http://www.etlsolutions.com/free
-eguide-preparing-a-datamigration-plan/ to download
your copy.

About us
At ETL Solutions, we design software to help developers tackle difficult
data transformations. We deliver ready-to-use products and services
based on Transformation Manager, a robust integration toolkit.

Images from Freedigitalphotos.net
Contact information
Karl Glenn, Business Development Director
kg@etlsolutions.com
+44 (0) 1912 894040
www.etlsolutions.com

Raising data
management
standards
www.etlsolutions.com
www.etlsolutions.com

More Related Content

PPTX
Preparing a data migration plan: A practical guide
PPTX
20171019 data migration (rk)
PPTX
DMM9 - Data Migration Testing
PDF
Data Migration Strategies PowerPoint Presentation Slides
PPT
My Sql Data Migration
PPTX
Winning Strategies for Converting and Migrating Master Data to SAP BusinessOb...
PPTX
7 Mistakes People Make During Data Center Migrations (And How to Avoid Them)
PDF
A Roadmap to Data Migration Success
Preparing a data migration plan: A practical guide
20171019 data migration (rk)
DMM9 - Data Migration Testing
Data Migration Strategies PowerPoint Presentation Slides
My Sql Data Migration
Winning Strategies for Converting and Migrating Master Data to SAP BusinessOb...
7 Mistakes People Make During Data Center Migrations (And How to Avoid Them)
A Roadmap to Data Migration Success

What's hot (20)

PDF
Get started with data migration
PPT
Data migration
PPTX
Migrating data: How to reduce risk
PDF
Data migration methodology for sap v2
PPTX
Home summary
PPS
ERP Data Migration Methodologies
PPTX
Systems Migration
PDF
Data Center Migration Essentials - Adam Saint-Prix Tim Wong
PPTX
DATPROF Test data Management (data privacy & data subsetting) - English
PDF
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
PPTX
Test data management a case study Presented at SiGIST
PDF
Test data management
ODP
PPTX
How to Test Big Data Systems | QualiTest Group
PPTX
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
PDF
Transforming Business Intelligence Testing
PPTX
Data Warehousing in Pharma: How to Find Bad Data while Meeting Regulatory Req...
PPTX
Implementing Azure DevOps with your Testing Project
PDF
QuerySurge - the automated Data Testing solution
PDF
Creating a Data validation and Testing Strategy
Get started with data migration
Data migration
Migrating data: How to reduce risk
Data migration methodology for sap v2
Home summary
ERP Data Migration Methodologies
Systems Migration
Data Center Migration Essentials - Adam Saint-Prix Tim Wong
DATPROF Test data Management (data privacy & data subsetting) - English
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
Test data management a case study Presented at SiGIST
Test data management
How to Test Big Data Systems | QualiTest Group
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Transforming Business Intelligence Testing
Data Warehousing in Pharma: How to Find Bad Data while Meeting Regulatory Req...
Implementing Azure DevOps with your Testing Project
QuerySurge - the automated Data Testing solution
Creating a Data validation and Testing Strategy
Ad

Viewers also liked (17)

PPTX
Indicators of Advancing Access to Justice in Situations of Gender-based-violence
PDF
Tanzania SBCC Landscape Analysis 2012
PDF
DSD-INT 2014 - Delft-FEWS Users Meeting - Hydrological forecasting system in ...
PPTX
Data Strategy in 2016
PPTX
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...
PDF
Data Mining- Big Data landscape
PDF
Finnish ITS and MaaS business: a landscape analysis
PDF
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
PPTX
Big data landscape version 2.0
PDF
Top 5 ETL Tools for Salesforce Data Migration
PDF
Big data landscape map collection by aibdp
PDF
Data strategy in a Big Data world
PPTX
Developing a Data Strategy -- A Guide For Business Leaders
 
PDF
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
PDF
Data Strategy
PDF
8 Steps to Creating a Data Strategy
PDF
Data Analytics Strategy
Indicators of Advancing Access to Justice in Situations of Gender-based-violence
Tanzania SBCC Landscape Analysis 2012
DSD-INT 2014 - Delft-FEWS Users Meeting - Hydrological forecasting system in ...
Data Strategy in 2016
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...
Data Mining- Big Data landscape
Finnish ITS and MaaS business: a landscape analysis
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
Big data landscape version 2.0
Top 5 ETL Tools for Salesforce Data Migration
Big data landscape map collection by aibdp
Data strategy in a Big Data world
Developing a Data Strategy -- A Guide For Business Leaders
 
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
Data Strategy
8 Steps to Creating a Data Strategy
Data Analytics Strategy
Ad

Similar to How to prepare data before a data migration (20)

PPTX
CISA_WK_4.pptx
PPTX
PPTX
Data Governance Overview - Doreen Christian
PPTX
AIS PPt.pptx
PPTX
management system development and planning
PDF
Migration Strategy and Best practices .
DOCX
The Ultimate Guide to Data Migration Strategies, Tools, and Techniques.docx
PPTX
PD 2 - Data Integration Architecture.pptx
PDF
Asset Finance Systems Implementation
PDF
Asset finance systems implementation
PDF
Asset finance systems implementation
PPTX
2. INFORMATION GATHERING.pptx Computer Applications in Pharmacy
PDF
Making the Most of Your Data A Comprehensive Guide to Successful Data Migrati...
PPTX
software engineering slide based on kfueit slides
PDF
Data migration patterns special
PPTX
MS Lecture 9 information technology
PPTX
How important is IT auditing
PPTX
PPT
Lecture 2 - Security Requirments.ppt
PPTX
System engineering analysis and design
CISA_WK_4.pptx
Data Governance Overview - Doreen Christian
AIS PPt.pptx
management system development and planning
Migration Strategy and Best practices .
The Ultimate Guide to Data Migration Strategies, Tools, and Techniques.docx
PD 2 - Data Integration Architecture.pptx
Asset Finance Systems Implementation
Asset finance systems implementation
Asset finance systems implementation
2. INFORMATION GATHERING.pptx Computer Applications in Pharmacy
Making the Most of Your Data A Comprehensive Guide to Successful Data Migrati...
software engineering slide based on kfueit slides
Data migration patterns special
MS Lecture 9 information technology
How important is IT auditing
Lecture 2 - Security Requirments.ppt
System engineering analysis and design

More from ETLSolutions (9)

PPTX
How to create a successful proof of concept
PPTX
DMS data integration: 6 ways to get it right
PPTX
WITSML to PPDM mapping project
PPTX
E&P data management: Implementing data standards
PPTX
An example of a successful proof of concept
PPTX
Data integration case study: Oil & Gas industry
PPTX
Data integration case study: Automotive industry
PPTX
A 5-step methodology for complex E&P data management
PPTX
Automotive data integration: An example of a successful project structure
How to create a successful proof of concept
DMS data integration: 6 ways to get it right
WITSML to PPDM mapping project
E&P data management: Implementing data standards
An example of a successful proof of concept
Data integration case study: Oil & Gas industry
Data integration case study: Automotive industry
A 5-step methodology for complex E&P data management
Automotive data integration: An example of a successful project structure

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Encapsulation theory and applications.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Electronic commerce courselecture one. Pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Big Data Technologies - Introduction.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Review of recent advances in non-invasive hemoglobin estimation
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Unlocking AI with Model Context Protocol (MCP)
Advanced methodologies resolving dimensionality complications for autism neur...
Dropbox Q2 2025 Financial Results & Investor Presentation
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
Encapsulation theory and applications.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding
Electronic commerce courselecture one. Pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Big Data Technologies - Introduction.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Review of recent advances in non-invasive hemoglobin estimation
The AUB Centre for AI in Media Proposal.docx
Chapter 3 Spatial Domain Image Processing.pdf

How to prepare data before a data migration

  • 1. How to prepare your data for a data migration www.etlsolutions.com
  • 2. Introduction • Data migration is a complex undertaking, and the processes and software used are continually evolving. • Thorough preparation of data and systems before a migration takes place helps to reduce the risks involved. • The aim of this guide is to show the ways in which data can be systematically prepared before a migration to maximise the project’s chance of success. • Don’t hesitate to get in touch with us at info@etlsolutions.com if you have any questions. www.etlsolutions.com Download our data migration planning eGuide for free at: http://www.etlsolutions.co m/free-eguide-preparinga-data-migration-plan/
  • 3. Landscape analysis • It is crucial to thoroughly prepare data and systems before a migration takes place. • Landscape analysis provides an overview of the source and target systems, enabling the project team to understand how each system works and how the data within each system is structured. • These areas should be reviewed systematically to ensure that potential errors are identified in advance of the migration. • Ideally, the team should model the links and interactions between the different systems involved, along with the data structures within each system. www.etlsolutions.com
  • 4. Data assurance • Data assurance is another important component of thorough preparation. • This procedure validates the data discovered in the landscape analysis and ensures that all data is fit for purpose. • By validating the data, the migration team are then free to focus solely on structural manipulation and movement. • Data assurance has several phases: • Data profiling • Data quality definition • Data cleansing. www.etlsolutions.com
  • 5. Data profiling • The aim of the data profiling phase is to ensure that any historical data due to be migrated is suitable for the changes that are taking place in the organisation. • Profiling will identify areas of the data which may not be of sufficient quality. • It should include comprehensive checks of existing model structure, data format and data conformance. • A retirement plan should be used to define the data no longer required: • Any data to be retired should be recorded, along with a description of what replaces it or why it can be removed. • The data that is no longer needed may have to be archived for tax purposes or to meet the requirements of an industry’s governing bodies. www.etlsolutions.com
  • 6. Data quality definition • Data quality definitions state the quality that must be attained by elements, attributes and relationships in the source system. • The definitions or rules should be used during profiling to identify whether or not the data is of the correct quality and format. • All data quality rules should be listed at element level, such as data table or flat file. • All data quality issues and queries should be tracked and stored. www.etlsolutions.com
  • 7. Data cleansing • The first stage in data cleansing is to define which cleansing rules will be carried out manually and which will be automated. • Splitting the rules into two enables the organisation’s domain experts to concentrate on the manual process, while the migration experts design and develop the automated cleansing. • Typically, the manual cleansing will be carried out before the migration, while the automated cleansing may be carried out before the migration or as part of the migration’s initial phase. www.etlsolutions.com
  • 8. Data verification • Data verification is the part of the data cleansing process that checks that the data is available, accessible, complete and in the correct format. • Our consultants often continue to carry out verification once a migration has begun, ensuring that the information is optimised prior to each stage of the migration. www.etlsolutions.com
  • 9. Data impact analysis • We find that data impact analysis is a crucial part of data cleansing. • Because cleansing data adds or alters values, data impact analysis ensures that these changes do not have a knock-on effect on other elements within the source and target systems. • It also checks the impact of data cleansing on other systems which currently use the data, and on systems which may use the data once the migration is complete. www.etlsolutions.com
  • 10. Download your free copy of our data migration planning guide • Download the PDF copy of this guide for easy reading and printing. It’s free, and no email address is required! • Visit us at: http://www.etlsolutions.com/free -eguide-preparing-a-datamigration-plan/ to download your copy. About us At ETL Solutions, we design software to help developers tackle difficult data transformations. We deliver ready-to-use products and services based on Transformation Manager, a robust integration toolkit. Images from Freedigitalphotos.net
  • 11. Contact information Karl Glenn, Business Development Director kg@etlsolutions.com +44 (0) 1912 894040 www.etlsolutions.com Raising data management standards www.etlsolutions.com www.etlsolutions.com

Editor's Notes

  • #4: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #5: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #6: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #7: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #8: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #9: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.
  • #10: To keep things simple when I’m talking, we’ll discuss loading data into PPDM, but a lot of this applies to generic data loading – moving data out of PPDM, or not involving PPDM at all.Data transformation is mudane from a business perspective, but very important to get right. The less time and trouble it causes, the more time you can spend doing more interesting things directly benefiting your business.Badly loaded data by definition affects the quality of the data in your MDM store.