SlideShare a Scribd company logo
Drupal MigrationMigrating 100,000 pages of contentFrom Legacy CMS to DrupalRachel JaroSolutions Architect at PrometSourcewww.prometsource.com
OverviewWe’ll talk about:Successful migration recipeCommon questions you should be asking before you startTop 3 tools to do migration in DrupalIssuesTools to use in URL RewritingFile management Comparison in D6TestingDeploying Solution
Data Migration	“Data migration solutions extract data from a source system, correct errors, reformat, restructure and load the data into a replacement target system”. 	It sounds simple, but poorly managed data migration is the most common cause of failure in implementing a replacement system. 	-- Gershon Pick, March 2001
Successful Migration Recipe
PlanningSource: http://www.flickr.com/photos/bjornmeansbear/4380595283/
Plan: What to AskNode types (Content separation, fields)Do you want to separate contents into pages, articles, biography, news, etc.What fields are needed for each node?Who can access it?Do you really need that content type? Or can we just use taxonomies instead for similar contents.
Plan: What to AskTaxonomy (Categorization, tags)Do you need to categorize nodes? Would you need different access?What kind of taxonomy groups or vocabularies you would need?Permission (per nodes) and User RolesWho are going to use the site? What are particularly their access rights?
Plan: What to AskNew URL mappingDo you need to make SEO friendly URLs?Files, files permissions and file directoryDo you need advance file management or document management tool?Do you need simpler solutions? How simple is that. Do you need access rights for each folder?Do you need browser type interface to access them?What kind of files do you need to store? Images, pdfs?
Build
RequirementsUse CSV files to import dataDivide migration into group or sectionsMap and replace old URL to SEO friendly URLBefore: 05-200.htm
Data in CSV ExampleDecember 13, 2005 3:39:54 PM||||||||||December 13, 2005||||||||||Report Spotlights Need for Reform in Jackpot Jurisdictions||||||||||/press/releases/2005/december/||||||||||05-200||||||||||{UUID}|||||||||| Economics^^^^^^^^^^Economy ||||||||||<p>LoremIpsum is simply dummy text of the printing and typesetting industry. LoremIpsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. </p><p>LoremIpsum is simply dummy text of the printing and typesetting industry. LoremIpsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. </p>$$$$$$$$$$Separator: ||||||||||End of Row: $$$$$$$$$$
Content Type DivisionExample: CNN.comDivide migration sequences into US, World, Politics, Justice, etc
Solutions/ToolsTW and Migrate modules Combonode_import()Drush + custom script
TW & Migrate Module Combohttp://drupal.org/project/twSupports Migrate module to run views of source datahttp://drupal.org/project/migratea flexible framework for migrating content
Migrate ModuleFeatures:users browse their legacy data using viewssupport for creating Drupal nodes, users, and comments is includedhooks permit migration of other types of content.provides a dashboard for running mini migrationsDrush support
Why I did not choose migrateImporting to mysql was not an option. CSV were used insteadCannot map old URL to new URL
node_import()http://drupal.org/project/node_importFeatures:Easy to learn, Point and clickUses CSV to upload contentsCan easily delete previous imported dataCan download errors when import failed for easy reference to fix issues
node_import() ProblemsI can’t define map old URL to new URLNo drush supportIt doesn’t save my old settings for a csv.
Drush + Custom scriptFlexibility 	- I can do whatever I want with the data
Create your own migration script[demo]
IssuesFile ManagementURL Rewriting
File ManagementClient requirementsIntuitiveHas wysiwyg supportAccess control – upload, edit, delete, revise files by different rolesRevision control – optional but good to haveLimited time!
File Management Modules*DbFm was not included due to problems encountered during tests in D6
URL RewritingSource: http://www.flickr.com/photos/randomfactor/483264915/
URLs Rewriting SolutionNot recommended.htaccessToo many URL to handle. Too much server loadRecommendedpathauto + path_redirect modulesautomated alias settings301 redirect setglobal redirectAdditional reference:http://acquia.com/blog/migrating-drupal-way-part-ii-saving-those-old-urls
URL Checkerhttp://drupal.org/project/linkchecker
Access control Alternative/default/files/PressReleases/default/files/Documents/default/files/International/default/files/International/America/default/files/International/England/default/files/International/Asia
Test, Test and did I say Test?Source: http://www.flickr.com/photos/paperpariah/2424107350/
Common problemsBroken linksMisconfigured pageEmpty pagesInvalid dateFile not found or orphan pagesPage formatTest when CACHE is on
Deployment
Deployment2 Ways to Deploy your data to live environmentAll at onceDivide and conquer
Deployment: Divide and ConquerExample: CNN Division
Deployment Mockup* shadow box is your migrated data’s production box* old CMS is still active at this time
DeploymentCoordination between the old CMS and Drupal
URL TestingDeployment Mockup* shadow box is your migrated data’s production box* replacing old CMS with Drupal
DeploymentProsLess risk, less stress Editors can do continues data entry dailyConsURL rewriting can be a trickyUpdating the production box with new content can be an arduous task
Deployment: Updating ProductionAutomationSVNDrush scripts to migrate contents from tester’s box to shadow boxDeploy – http://drupal.org/project/deployManualDocument configuration changesDocument database changes
RecapSDLC + AgileCommon questions you should be asking before you startTop 3 tools to do migration in DrupalTW & Migrate, node_import(), drushIssuesFile management Comparison in D6Tools to use in URL RewritingTestingDeployment Solution
Questions?
Resourceshttp://groups.drupal.org/content-migration-import-and-exporthttp://drupal.org/handbook/migrating

More Related Content

PPTX
Hadoop, Infrastructure and Stack
PPT
Understanding the Value of a Payments Problem
PDF
IRPET - Toscana 2010 - Enrico Conti
PDF
REGIONE TOSCANA - Rapporto partecipazione 2009
PDF
ISNART - Ricerca Destinazione Venezia -1* Luglio 2011
PPTX
Chris heeter you girls out here all alone
PDF
BTO Educational Loves L’Andana PM ‘15
PPTX
Drupalcampchicago2010.rachel.datamigration.
Hadoop, Infrastructure and Stack
Understanding the Value of a Payments Problem
IRPET - Toscana 2010 - Enrico Conti
REGIONE TOSCANA - Rapporto partecipazione 2009
ISNART - Ricerca Destinazione Venezia -1* Luglio 2011
Chris heeter you girls out here all alone
BTO Educational Loves L’Andana PM ‘15
Drupalcampchicago2010.rachel.datamigration.

Similar to Drupal campchicago2010.rachel.datamigration (20)

PDF
Migration Best Practices - SEOkomm 2018
PDF
Datasheet foldermanagementpluginforrd
PDF
Seven steps to better security
PDF
System Architecture at DDVE
PPTX
Best Practices for Migrating a Legacy-Based CMS to Drupal
PPTX
SharePoint 2010 Migration Presentation
PDF
Pratical Deep Dive into the Semantic Web - #smconnect
PPTX
Hybrid Cloud Journey - Maximizing Private and Public Cloud
PDF
Merging and Migrating: Data Portability from the Trenches
PPTX
Webinar: Scaling MongoDB
PDF
Migration Best Practices - SMX West 2019
PPTX
Planning Your Migration to SharePoint Online #SPBiz60
PPT
Make Drupal Run Fast - increase page load speed
PPTX
Data Segregation for Remedyforce SaaS Help Desk and High-Speed Digital Servic...
PPTX
What Makes Migrating to the Cloud Different Than On-Premises
ODP
BrownSites: Building and Managing a CMS Infrastructure for Higher Ed
PDF
Spca2014 navigating clouds sp_con14_mackie
PPT
Taking your site from Drupal 6 to Drupal 7
PPTX
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
PPTX
Power BI Modeling Use Cases: Desktop to Enterprise with Questions and Answers
Migration Best Practices - SEOkomm 2018
Datasheet foldermanagementpluginforrd
Seven steps to better security
System Architecture at DDVE
Best Practices for Migrating a Legacy-Based CMS to Drupal
SharePoint 2010 Migration Presentation
Pratical Deep Dive into the Semantic Web - #smconnect
Hybrid Cloud Journey - Maximizing Private and Public Cloud
Merging and Migrating: Data Portability from the Trenches
Webinar: Scaling MongoDB
Migration Best Practices - SMX West 2019
Planning Your Migration to SharePoint Online #SPBiz60
Make Drupal Run Fast - increase page load speed
Data Segregation for Remedyforce SaaS Help Desk and High-Speed Digital Servic...
What Makes Migrating to the Cloud Different Than On-Premises
BrownSites: Building and Managing a CMS Infrastructure for Higher Ed
Spca2014 navigating clouds sp_con14_mackie
Taking your site from Drupal 6 to Drupal 7
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
Power BI Modeling Use Cases: Desktop to Enterprise with Questions and Answers
Ad

More from Andy Kucharski (11)

PPTX
Estimation - web software development estimation DrupalCon and DrupalCamp pre...
PPTX
Drupal Camp Wroclaw 2015 Measure everything nps
PPTX
Measure everything - but make NPS the Key
PPTX
Drupal commerce performance profiling and tunning using loadstorm experiments...
PPTX
PrometSource Mobile Development Capabilities
PPT
2012 bad camp-project management tools and organization-v4
PPTX
Front End page speed performance improvements for Drupal
PPTX
Stress Test Drupal on Amazon EC2 vs. RackSpace cloud
PPTX
Drupal Business Summit - making your sites mobile accessible, four methods
PPT
What should you expect from your Drupal Web Host
PPTX
Promet.manila2011.selling drupal
Estimation - web software development estimation DrupalCon and DrupalCamp pre...
Drupal Camp Wroclaw 2015 Measure everything nps
Measure everything - but make NPS the Key
Drupal commerce performance profiling and tunning using loadstorm experiments...
PrometSource Mobile Development Capabilities
2012 bad camp-project management tools and organization-v4
Front End page speed performance improvements for Drupal
Stress Test Drupal on Amazon EC2 vs. RackSpace cloud
Drupal Business Summit - making your sites mobile accessible, four methods
What should you expect from your Drupal Web Host
Promet.manila2011.selling drupal
Ad

Drupal campchicago2010.rachel.datamigration

Editor's Notes

  • #5: Todo – make comparison of normal sdlc to migration of sdlc
  • #14: http://www.flickr.com/photos/14804582@N08/2111269218/