SlideShare a Scribd company logo
How to effectively migrate
data from legacy systems
www.cloverdx.com
Migration from legacy system
Legacy system New system
Has not been modernized in a long
time, runs on outdated platform
In-house or heavily customized
Holds business critical data – CRM,
ERP, billing, …
Required for correct functioning of the
organization
Modern, more capable system
Often cloud-based
Provides business benefit like better
reporting, data quality, visibility, …
Migrate
Telco Migration, The Need
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Planning and scope changes
Gaps in knowledge (in source and in target)
Data quality
Execution and reporting
Challenges
Planning & Scope
Optimists planning a migration
Analysis
and
discovery
Extraction
Validation
Cleansing
Loading
Reconciliation
Done
In reality…
…many activities
happen in parallel
Documentation of what needs to be done and of deadlines.
Can take long time to complete
Work with business and review as early as possible to prevent issues
Figuringout the scope
Business specification
In-depth look at each system involved to determine implementation plan.
Requires cooperation from each party
Knowledge gaps can prolong the process quite a bit.
Figuringout the scope
Technical analysis
Hardest part, especially for complex migrations with many business rules.
What is a cost of a business rule implementation?
Big bang migration or trickle migration?
Figuringout the scope
Effort estimates and planning
Analysis Implementation & testing Run
Analysis Implementation & testing Run Run Run Run
Big bang
Trickle
Not all data has the same complexity
Which data to process first?
Simple Complex
Data complexity
Number of records
Not all data has the same complexity
More complex data brings more problems
Which data to process first?
Simple Complex
Data complexity
Number of records
Problems
Not all data has the same complexity
More complex data brings more problems
Migrate complex records first, maybe manually
Which data to process first?
Simple Complex
Data complexity
Number of records
Problems
Do this first
Oh no! The scope changed!
Factor “known unknowns” into your estimates – possibility to change after
technical analysis and technology risks.
What if business changed their mind? This cannot be avoided. Full and
close cooperation with business helps when this happens.
Telco Migration, The Plan
24 m
Start 12 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Data Quality
Single biggest issue in most legacy migration projects
Data frequently comes directly from users
o Typed in an application (or even Excel document)
o People come and go
o Processes and ideas about data change
Garbage in, garbage out
o Migration is your best chance to fix your data, business already expects
complications and issues with the data
Data Quality
Make this part of your technical analysis
Measure on every step of the process
o On input, but also after transformations
o Can help you catch logic errors
Provide frequent reports to business
o KPIs can help you make decisions: data is too bad → redesign solution
Measuring data quality
Fix as early as possible, ideally in the source system
o Needs cooperation with the source data owner
Fail on error or keep on going?
o Both approaches valid for different usecases
Fixing data quality issues
Data Mapping
How to map entities in old system to the ones in the one?
o Same data domain, different ideas about how to represent everything
Knowledge gaps
o Old system might not be known very well anymore
o New system is not yet known very well
The mapping problem
?
How to make it accessible to business users?
How to keep track of different mapping versions?
Use Data Modeling tools or Data catalog
Mapping specification
Split mappings into multiple “pieces” to make them easier to build and
verify
Document your code
Mapping development
Split mappings into multiple “pieces” to make them easier to build and
verify
Document your code
Validate your data and report to business
Mapping development
Test data can be difficult to obtain
o Manually built test data is often insufficient – does not cover corner cases
Use data as close to production data as possible
o Anonymize to prevent data leaks and comply with regulations
Test with business
Test as early (and as often) as possible
Testing the Mapping
Production Data sample Anonymize Use data in QA
Everything is tested, smooth sailing from here…
Communication is the key: frequent updates to your
business users.
Detailed reports help ensure that production data is ok.
Be ready to support the solution – hypercare.
Execution
Telco Migration, The Plan
24 m
Start 12 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Telco Migration, The Result
36 m
Start 24 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Project restart
Additional target systems
Bespoke systems
o Vendor no longer available or not cooperating
Complex scenarios
o Many-to-one or many-to-many scenarios
o Lots of customizations over long periods of time
Messy data
o Needing cleaning up, repurposing, reconstructing or enrichment
Tight deadlines
o When traditional approach can’t deliver on time
Data Management Platform
There is no “I can’t” with CloverDX.
Combining capabilities of coding with the
readability of a visual design, CloverDX lets
developers solve hard problems in code while
empowering less technical colleagues.
While CloverDX is built so that
everything can sit on a single
platform, it is flexible to fill in gaps
in an existing technology stack.
Thanks to its open architecture,
almost every part of CloverDX can
be customized, versioned, parametrized.
If 90% of the work is easy and 10% are
the exceptions, error and outliers, then
CloverDX is built to help you deal with
those 10 percent.
Solve 100% of a problem, not just the easy 90%
With CloverDX, you ultimately think of running everything on autopilot.
Automate Everything
Automate the entirety of
your data workloads,
including the exceptions
and intricacies of real-life
processes.
Schedule jobs for
automatic delivery to data
warehouses, data lakes or
operational databases in
cloud or on premise.
Orchestrate external systems
and tools by CloverDX sitting
at the center of your
architecture – events, API
calls or message queues.
www.cloverdx.com

More Related Content

PPTX
How to benchmark the maturity of your saas solution
PDF
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
PDF
Kubernetes Summit 2021: Multi-Cluster - The Good, the Bad and the Ugly
PDF
Modernizing to a Cloud Data Architecture
PPSX
Cloud Architecture - Multi Cloud, Edge, On-Premise
PDF
Getting Started with Delta Lake on Databricks
PPTX
Data Lakehouse Symposium | Day 4
PDF
Architect’s Open-Source Guide for a Data Mesh Architecture
How to benchmark the maturity of your saas solution
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Kubernetes Summit 2021: Multi-Cluster - The Good, the Bad and the Ugly
Modernizing to a Cloud Data Architecture
Cloud Architecture - Multi Cloud, Edge, On-Premise
Getting Started with Delta Lake on Databricks
Data Lakehouse Symposium | Day 4
Architect’s Open-Source Guide for a Data Mesh Architecture

What's hot (20)

PDF
Azure Monitoring Overview
PPTX
Azure data platform overview
PPTX
Snowflake: The Good, the Bad, and the Ugly
PPTX
Is the traditional data warehouse dead?
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PPTX
Azure Migrate
PDF
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
PDF
What is MLOps
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
PPTX
Microsoft Fabric Introduction
PDF
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
PDF
Data Architecture Best Practices for Advanced Analytics
PDF
How to govern and secure a Data Mesh?
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Modern Data Flow
PPTX
Databricks Fundamentals
PPTX
Modern Data Architecture
PDF
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
PDF
Time to Talk about Data Mesh
PDF
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Azure Monitoring Overview
Azure data platform overview
Snowflake: The Good, the Bad, and the Ugly
Is the traditional data warehouse dead?
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Azure Migrate
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
What is MLOps
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Microsoft Fabric Introduction
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Data Architecture Best Practices for Advanced Analytics
How to govern and secure a Data Mesh?
Enabling a Data Mesh Architecture with Data Virtualization
Modern Data Flow
Databricks Fundamentals
Modern Data Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Time to Talk about Data Mesh
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Ad

Similar to How to Effectively Migrate Data From Legacy Apps (20)

PPTX
Characteristics of modern data architecture that drive innovation
PDF
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
PPTX
How to build an automated customer data onboarding pipeline
PPTX
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
PDF
How to Migrate Without Downtime
PDF
Cw13 dell cloud computing for telco sp by anis tell
PDF
Cw13 dell cloud computing for telco sp by anis tell
PDF
How to develop a multi cloud strategy to accelerate digital transformation - ...
PPT
Feeling Good About Your Migration: A Field-proven Framework For Performing V...
PDF
Seamless Data Migration to Oracle Fusion Cloud
PPT
What do I know about my customers?
PDF
Data migration patterns special
PDF
Digital transformation slideshare
PPT
IT Modernization For Process Modernization
PDF
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
PDF
Whitepaper cloud 2016
PPT
Cloud Data Integration Best Practices
PDF
Security & Compliance in the Cloud [2019]
PPT
Auditing in the Cloud
PPT
Callidus Software On-Premise To On-Demand Migration
Characteristics of modern data architecture that drive innovation
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
How to build an automated customer data onboarding pipeline
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
How to Migrate Without Downtime
Cw13 dell cloud computing for telco sp by anis tell
Cw13 dell cloud computing for telco sp by anis tell
How to develop a multi cloud strategy to accelerate digital transformation - ...
Feeling Good About Your Migration: A Field-proven Framework For Performing V...
Seamless Data Migration to Oracle Fusion Cloud
What do I know about my customers?
Data migration patterns special
Digital transformation slideshare
IT Modernization For Process Modernization
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Whitepaper cloud 2016
Cloud Data Integration Best Practices
Security & Compliance in the Cloud [2019]
Auditing in the Cloud
Callidus Software On-Premise To On-Demand Migration
Ad

More from CloverDX (12)

PPTX
Data architecture principles to accelerate your data strategy
PPTX
Automating Data Pipelines: Moving away from Scripts and Excel
PPTX
CloverDX 6.2 Release
PDF
Deploying ETL to Cloud
PDF
Moving Legacy Apps to Cloud: How to Avoid Risk
PDF
Starting Your Modern DataOps Journey
PPTX
CloverDX for IBM Infosphere MDM (for 11.4 and later)
PDF
Modern management of data pipelines made easier
PDF
Removing Danger From Data
PDF
Data Anonymization For Better Software Testing
PDF
How to publish data and transformations over APIs with CloverDX Data Services
PPTX
Moving "Something Simple" To The Cloud - What It Really Takes
Data architecture principles to accelerate your data strategy
Automating Data Pipelines: Moving away from Scripts and Excel
CloverDX 6.2 Release
Deploying ETL to Cloud
Moving Legacy Apps to Cloud: How to Avoid Risk
Starting Your Modern DataOps Journey
CloverDX for IBM Infosphere MDM (for 11.4 and later)
Modern management of data pipelines made easier
Removing Danger From Data
Data Anonymization For Better Software Testing
How to publish data and transformations over APIs with CloverDX Data Services
Moving "Something Simple" To The Cloud - What It Really Takes

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
KodekX | Application Modernization Development
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Spectroscopy.pptx food analysis technology
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation theory and applications.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
cuic standard and advanced reporting.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Empathic Computing: Creating Shared Understanding
PDF
Electronic commerce courselecture one. Pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Unlocking AI with Model Context Protocol (MCP)
Digital-Transformation-Roadmap-for-Companies.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KodekX | Application Modernization Development
Building Integrated photovoltaic BIPV_UPV.pdf
Spectroscopy.pptx food analysis technology
MYSQL Presentation for SQL database connectivity
Encapsulation theory and applications.pdf
Big Data Technologies - Introduction.pptx
Spectral efficient network and resource selection model in 5G networks
cuic standard and advanced reporting.pdf
Understanding_Digital_Forensics_Presentation.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
MIND Revenue Release Quarter 2 2025 Press Release
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Empathic Computing: Creating Shared Understanding
Electronic commerce courselecture one. Pdf
Per capita expenditure prediction using model stacking based on satellite ima...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

How to Effectively Migrate Data From Legacy Apps

  • 1. How to effectively migrate data from legacy systems www.cloverdx.com
  • 2. Migration from legacy system Legacy system New system Has not been modernized in a long time, runs on outdated platform In-house or heavily customized Holds business critical data – CRM, ERP, billing, … Required for correct functioning of the organization Modern, more capable system Often cloud-based Provides business benefit like better reporting, data quality, visibility, … Migrate
  • 3. Telco Migration, The Need In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 4. Planning and scope changes Gaps in knowledge (in source and in target) Data quality Execution and reporting Challenges
  • 6. Optimists planning a migration Analysis and discovery Extraction Validation Cleansing Loading Reconciliation Done
  • 8. Documentation of what needs to be done and of deadlines. Can take long time to complete Work with business and review as early as possible to prevent issues Figuringout the scope Business specification
  • 9. In-depth look at each system involved to determine implementation plan. Requires cooperation from each party Knowledge gaps can prolong the process quite a bit. Figuringout the scope Technical analysis
  • 10. Hardest part, especially for complex migrations with many business rules. What is a cost of a business rule implementation? Big bang migration or trickle migration? Figuringout the scope Effort estimates and planning Analysis Implementation & testing Run Analysis Implementation & testing Run Run Run Run Big bang Trickle
  • 11. Not all data has the same complexity Which data to process first? Simple Complex Data complexity Number of records
  • 12. Not all data has the same complexity More complex data brings more problems Which data to process first? Simple Complex Data complexity Number of records Problems
  • 13. Not all data has the same complexity More complex data brings more problems Migrate complex records first, maybe manually Which data to process first? Simple Complex Data complexity Number of records Problems Do this first
  • 14. Oh no! The scope changed! Factor “known unknowns” into your estimates – possibility to change after technical analysis and technology risks. What if business changed their mind? This cannot be avoided. Full and close cooperation with business helps when this happens.
  • 15. Telco Migration, The Plan 24 m Start 12 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 17. Single biggest issue in most legacy migration projects Data frequently comes directly from users o Typed in an application (or even Excel document) o People come and go o Processes and ideas about data change Garbage in, garbage out o Migration is your best chance to fix your data, business already expects complications and issues with the data Data Quality
  • 18. Make this part of your technical analysis Measure on every step of the process o On input, but also after transformations o Can help you catch logic errors Provide frequent reports to business o KPIs can help you make decisions: data is too bad → redesign solution Measuring data quality
  • 19. Fix as early as possible, ideally in the source system o Needs cooperation with the source data owner Fail on error or keep on going? o Both approaches valid for different usecases Fixing data quality issues
  • 21. How to map entities in old system to the ones in the one? o Same data domain, different ideas about how to represent everything Knowledge gaps o Old system might not be known very well anymore o New system is not yet known very well The mapping problem ?
  • 22. How to make it accessible to business users? How to keep track of different mapping versions? Use Data Modeling tools or Data catalog Mapping specification
  • 23. Split mappings into multiple “pieces” to make them easier to build and verify Document your code Mapping development
  • 24. Split mappings into multiple “pieces” to make them easier to build and verify Document your code Validate your data and report to business Mapping development
  • 25. Test data can be difficult to obtain o Manually built test data is often insufficient – does not cover corner cases Use data as close to production data as possible o Anonymize to prevent data leaks and comply with regulations Test with business Test as early (and as often) as possible Testing the Mapping Production Data sample Anonymize Use data in QA
  • 26. Everything is tested, smooth sailing from here… Communication is the key: frequent updates to your business users. Detailed reports help ensure that production data is ok. Be ready to support the solution – hypercare. Execution
  • 27. Telco Migration, The Plan 24 m Start 12 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 28. Telco Migration, The Result 36 m Start 24 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers Project restart Additional target systems
  • 29. Bespoke systems o Vendor no longer available or not cooperating Complex scenarios o Many-to-one or many-to-many scenarios o Lots of customizations over long periods of time Messy data o Needing cleaning up, repurposing, reconstructing or enrichment Tight deadlines o When traditional approach can’t deliver on time Data Management Platform
  • 30. There is no “I can’t” with CloverDX. Combining capabilities of coding with the readability of a visual design, CloverDX lets developers solve hard problems in code while empowering less technical colleagues. While CloverDX is built so that everything can sit on a single platform, it is flexible to fill in gaps in an existing technology stack. Thanks to its open architecture, almost every part of CloverDX can be customized, versioned, parametrized. If 90% of the work is easy and 10% are the exceptions, error and outliers, then CloverDX is built to help you deal with those 10 percent. Solve 100% of a problem, not just the easy 90%
  • 31. With CloverDX, you ultimately think of running everything on autopilot. Automate Everything Automate the entirety of your data workloads, including the exceptions and intricacies of real-life processes. Schedule jobs for automatic delivery to data warehouses, data lakes or operational databases in cloud or on premise. Orchestrate external systems and tools by CloverDX sitting at the center of your architecture – events, API calls or message queues. www.cloverdx.com