SlideShare a Scribd company logo
Drupal MigrationMigrating 100,000 pages of contentFrom Legacy CMS to DrupalRachel JaroSolutions Architect at PrometSourcewww.prometsource.com
OverviewWe’ll talk about:Successful migration recipeCommon questions you should be asking before you startTop 3 tools to do migration in DrupalIssuesTools to use in URL RewritingFile management Comparison in D6TestingDeploying Solution
Data Migration	“Data migration solutions extract data from a source system, correct errors, reformat, restructure and load the data into a replacement target system”. 	It sounds simple, but poorly managed data migration is the most common cause of failure in implementing a replacement system. 	-- Gershon Pick, March 2001
Successful Migration Recipe
PlanningSource: http://www.flickr.com/photos/bjornmeansbear/4380595283/
Plan: What to AskNode types (Content separation, fields)Do you want to separate contents into pages, articles, biography, news, etc.What fields are needed for each node?Who can access it?Do you really need that content type? Or can we just use taxonomies instead for similar contents.
Plan: What to AskTaxonomy (Categorization, tags)Do you need to categorize nodes? Would you need different access?What kind of taxonomy groups or vocabularies you would need?Permission (per nodes) and User RolesWho are going to use the site? What are particularly their access rights?
Plan: What to AskNew URL mappingDo you need to make SEO friendly URLs?Files, files permissions and file directoryDo you need advance file management or document management tool?Do you need simpler solutions? How simple is that. Do you need access rights for each folder?Do you need browser type interface to access them?What kind of files do you need to store? Images, pdfs?
Build
RequirementsUse CSV files to import dataDivide migration into group or sectionsMap and replace old URL to SEO friendly URLBefore: 05-200.htm
Data in CSV ExampleDecember 13, 2005 3:39:54 PM||||||||||December 13, 2005||||||||||Report Spotlights Need for Reform in Jackpot Jurisdictions||||||||||/press/releases/2005/december/||||||||||05-200||||||||||{UUID}|||||||||| Economics^^^^^^^^^^Economy ||||||||||<p>LoremIpsum is simply dummy text of the printing and typesetting industry. LoremIpsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. </p><p>LoremIpsum is simply dummy text of the printing and typesetting industry. LoremIpsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. </p>$$$$$$$$$$Separator: ||||||||||End of Row: $$$$$$$$$$
Content Type DivisionExample: CNN.comDivide migration sequences into US, World, Politics, Justice, etc
Solutions/ToolsTW and Migrate modules Combonode_import()Drush + custom script
TW & Migrate Module Combohttp://drupal.org/project/twSupports Migrate module to run views of source datahttp://drupal.org/project/migratea flexible framework for migrating content
Migrate ModuleFeatures:users browse their legacy data using viewssupport for creating Drupal nodes, users, and comments is includedhooks permit migration of other types of content.provides a dashboard for running mini migrationsDrush support
Why I did not choose migrateImporting to mysql was not an option. CSV were used insteadCannot map old URL to new URL
node_import()http://drupal.org/project/node_importFeatures:Easy to learn, Point and clickUses CSV to upload contentsCan easily delete previous imported dataCan download errors when import failed for easy reference to fix issues
node_import() ProblemsI can’t define map old URL to new URLNo drush supportIt doesn’t save my old settings for a csv.
Drush + Custom scriptFlexibility 	- I can do whatever I want with the data
Create your own migration script[demo]
IssuesFile ManagementURL Rewriting
File ManagementClient requirementsIntuitiveHas wysiwyg supportAccess control – upload, edit, delete, revise files by different rolesRevision control – optional but good to haveLimited time!
File Management Modules*DbFm was not included due to problems encountered during tests in D6
URL RewritingSource: http://www.flickr.com/photos/randomfactor/483264915/
URLs Rewriting SolutionNot recommended.htaccessToo many URL to handle. Too much server loadRecommendedpathauto + path_redirect modulesautomated alias settings301 redirect setglobal redirectAdditional reference:http://acquia.com/blog/migrating-drupal-way-part-ii-saving-those-old-urls
URL Checkerhttp://drupal.org/project/linkchecker
Access control Alternative/default/files/PressReleases/default/files/Documents/default/files/International/default/files/International/America/default/files/International/England/default/files/International/Asia
Test, Test and did I say Test?Source: http://www.flickr.com/photos/paperpariah/2424107350/
Common problemsBroken linksMisconfigured pageEmpty pagesInvalid dateFile not found or orphan pagesPage formatTest when CACHE is on
Deployment
Deployment2 Ways to Deploy your data to live environmentAll at onceDivide and conquer
Deployment: Divide and ConquerExample: CNN Division
Deployment Mockup* shadow box is your migrated data’s production box* old CMS is still active at this time
DeploymentCoordination between the old CMS and Drupal
URL TestingDeployment Mockup* shadow box is your migrated data’s production box* replacing old CMS with Drupal
DeploymentProsLess risk, less stress Editors can do continues data entry dailyConsURL rewriting can be a trickyUpdating the production box with new content can be an arduous task
Deployment: Updating ProductionAutomationSVNDrush scripts to migrate contents from tester’s box to shadow boxDeploy – http://drupal.org/project/deployManualDocument configuration changesDocument database changes
RecapSDLC + AgileCommon questions you should be asking before you startTop 3 tools to do migration in DrupalTW & Migrate, node_import(), drushIssuesFile management Comparison in D6Tools to use in URL RewritingTestingDeployment Solution
Questions?
Resourceshttp://groups.drupal.org/content-migration-import-and-exporthttp://drupal.org/handbook/migrating

More Related Content

PDF
Migrating to Drupal 8: How to Migrate Your Content and Minimize the Risks
PDF
Become a Better Developer with Debugging Techniques for Drupal (and more!)
PDF
Caching Strategies for Scaling Drupal: Common Missteps vs Best Practices
PDF
Entities 101: Understanding Data Structures in Drupal
PPTX
Volodymyr Lyubinets "Introduction to big data processing with Apache Spark"
PDF
Drupal Is Not Your Web Site
PPTX
Compressed Introduction to Hadoop, SQL-on-Hadoop and NoSQL
PPT
Make Drupal Run Fast - increase page load speed
Migrating to Drupal 8: How to Migrate Your Content and Minimize the Risks
Become a Better Developer with Debugging Techniques for Drupal (and more!)
Caching Strategies for Scaling Drupal: Common Missteps vs Best Practices
Entities 101: Understanding Data Structures in Drupal
Volodymyr Lyubinets "Introduction to big data processing with Apache Spark"
Drupal Is Not Your Web Site
Compressed Introduction to Hadoop, SQL-on-Hadoop and NoSQL
Make Drupal Run Fast - increase page load speed

What's hot (20)

PDF
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
PDF
Evolution of Drupal and the Drupal community
PDF
Mobile & Desktop Cache 2.0: How To Create A Scriptable Cache
PDF
Everything You Need to Know About the Top Changes in Drupal 8
PPT
7 Stages of Scaling Web Applications
PDF
NOSQL Overview
PPT
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
PDF
Hadoop Application Architectures tutorial - Strata London
PDF
Oracle vs NoSQL – The good, the bad and the ugly
PPT
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
PPTX
Data Wrangling and Oracle Connectors for Hadoop
PDF
Top 5 mistakes when writing Streaming applications
PPTX
Introduction to Prometheus Monitoring (Singapore Meetup)
PPT
W3C Linked Data Platform Overview
PDF
Application Architectures with Hadoop
PDF
Application architectures with Hadoop and Sessionization in MR
PPTX
Top 8 Improvements in Drupal 8
PDF
Building Applications with a Graph Database
PPSX
Open writing-cloud-collab
PPTX
Open Writing! Collaborative Authoring for CloudStack Documentation by Jessica...
Strata NY 2014 - Architectural considerations for Hadoop applications tutorial
Evolution of Drupal and the Drupal community
Mobile & Desktop Cache 2.0: How To Create A Scriptable Cache
Everything You Need to Know About the Top Changes in Drupal 8
7 Stages of Scaling Web Applications
NOSQL Overview
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Hadoop Application Architectures tutorial - Strata London
Oracle vs NoSQL – The good, the bad and the ugly
How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook
Data Wrangling and Oracle Connectors for Hadoop
Top 5 mistakes when writing Streaming applications
Introduction to Prometheus Monitoring (Singapore Meetup)
W3C Linked Data Platform Overview
Application Architectures with Hadoop
Application architectures with Hadoop and Sessionization in MR
Top 8 Improvements in Drupal 8
Building Applications with a Graph Database
Open writing-cloud-collab
Open Writing! Collaborative Authoring for CloudStack Documentation by Jessica...
Ad

Viewers also liked (20)

PDF
Out With the Old, in With the Open-source: Brainshark's Complete CMS Migration
PPTX
Drupal content-migration
PDF
Content migration - CSV to Drupal 8
PPTX
Site Migration and Content Strategy
PDF
Staging Drupal 8 31 09 1 3
PDF
GeorgiaGov's move to Drupal - presentation by Nikhil Deshpande @nikofthehill
PPT
Create Website In Indian Languages using drupal
PDF
Migrate
PDF
Recipes for Drupal distributions
PDF
Drush 5.0 (DrupalCamp LA 2012) - Chris Charlton
PDF
Using Drupal Features in B-Translator
PPTX
8 Web Practices for Drupal
PPTX
Building Archivable Websites
PPTX
Drupal Migration
PDF
Moving In: how to port your content from * to Drupal
PDF
Moving Drupal to the Cloud
PDF
Drupal in the Cloud with Windows Azure
PPTX
Data migration to Drupal using the migrate module
PPT
JIIT PORTAL based on Drupal
PDF
Drupal for Non-Developers
Out With the Old, in With the Open-source: Brainshark's Complete CMS Migration
Drupal content-migration
Content migration - CSV to Drupal 8
Site Migration and Content Strategy
Staging Drupal 8 31 09 1 3
GeorgiaGov's move to Drupal - presentation by Nikhil Deshpande @nikofthehill
Create Website In Indian Languages using drupal
Migrate
Recipes for Drupal distributions
Drush 5.0 (DrupalCamp LA 2012) - Chris Charlton
Using Drupal Features in B-Translator
8 Web Practices for Drupal
Building Archivable Websites
Drupal Migration
Moving In: how to port your content from * to Drupal
Moving Drupal to the Cloud
Drupal in the Cloud with Windows Azure
Data migration to Drupal using the migrate module
JIIT PORTAL based on Drupal
Drupal for Non-Developers
Ad

Similar to Migration from Legacy CMS to Drupal (20)

PPTX
Drupalcampchicago2010.rachel.datamigration.
PDF
Migration Best Practices - SEOkomm 2018
PDF
Datasheet foldermanagementpluginforrd
PDF
Seven steps to better security
PDF
System Architecture at DDVE
PPTX
Best Practices for Migrating a Legacy-Based CMS to Drupal
PPTX
SharePoint 2010 Migration Presentation
PDF
Pratical Deep Dive into the Semantic Web - #smconnect
PPTX
Hybrid Cloud Journey - Maximizing Private and Public Cloud
PDF
Merging and Migrating: Data Portability from the Trenches
PPTX
Webinar: Scaling MongoDB
PDF
Migration Best Practices - SMX West 2019
PPTX
Planning Your Migration to SharePoint Online #SPBiz60
PPTX
Data Segregation for Remedyforce SaaS Help Desk and High-Speed Digital Servic...
PPTX
What Makes Migrating to the Cloud Different Than On-Premises
ODP
BrownSites: Building and Managing a CMS Infrastructure for Higher Ed
PDF
Spca2014 navigating clouds sp_con14_mackie
PPT
Taking your site from Drupal 6 to Drupal 7
PPTX
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
PPTX
Power BI Modeling Use Cases: Desktop to Enterprise with Questions and Answers
Drupalcampchicago2010.rachel.datamigration.
Migration Best Practices - SEOkomm 2018
Datasheet foldermanagementpluginforrd
Seven steps to better security
System Architecture at DDVE
Best Practices for Migrating a Legacy-Based CMS to Drupal
SharePoint 2010 Migration Presentation
Pratical Deep Dive into the Semantic Web - #smconnect
Hybrid Cloud Journey - Maximizing Private and Public Cloud
Merging and Migrating: Data Portability from the Trenches
Webinar: Scaling MongoDB
Migration Best Practices - SMX West 2019
Planning Your Migration to SharePoint Online #SPBiz60
Data Segregation for Remedyforce SaaS Help Desk and High-Speed Digital Servic...
What Makes Migrating to the Cloud Different Than On-Premises
BrownSites: Building and Managing a CMS Infrastructure for Higher Ed
Spca2014 navigating clouds sp_con14_mackie
Taking your site from Drupal 6 to Drupal 7
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
Power BI Modeling Use Cases: Desktop to Enterprise with Questions and Answers

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
cuic standard and advanced reporting.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation_ Review paper, used for researhc scholars
Understanding_Digital_Forensics_Presentation.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Diabetes mellitus diagnosis method based random forest with bat algorithm
The Rise and Fall of 3GPP – Time for a Sabbatical?
NewMind AI Monthly Chronicles - July 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Network Security Unit 5.pdf for BCA BBA.
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding
The AUB Centre for AI in Media Proposal.docx
Advanced methodologies resolving dimensionality complications for autism neur...
Per capita expenditure prediction using model stacking based on satellite ima...
cuic standard and advanced reporting.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...

Migration from Legacy CMS to Drupal

Editor's Notes

  • #5: Todo – make comparison of normal sdlc to migration of sdlc
  • #14: http://www.flickr.com/photos/14804582@N08/2111269218/