SlideShare a Scribd company logo
Confidential & Proprietarywww.dclab.comwww.dclab.com
Converting and Integrating Content When
Moving to a New CMS
Greg Fagan,
Sales Director,
Data Conversion Laboratory (DCL)
Confidential & Proprietarywww.dclab.com 2
Valuable Content Transformed
• Document Digitization
• XML and HTML Conversion
• eBook Production
• Hosted Solutions
• Big Data Automation
• Conversion Management
• Editorial Services
• Harmonizer
Confidential & Proprietarywww.dclab.com 3
Experience the DCL Difference
DCL blends years of conversion experience with cutting-edge technology and the
infrastructure to make the process easy and efficient.
• World-Class Services
• Leading-Edge Technology
• Unparalleled Infrastructure
• US-Based Management
• Complex-Content Expertise
• 24/7 Online Project Tracking
• Automated Quality Control
• Global Capabilities
Confidential & Proprietarywww.dclab.com
We Serve a Very Broad Client Base . . .
4
Confidential & Proprietarywww.dclab.com 5
. . . Spanning All Industries
• Aerospace
• Associations
• Defense
• Distribution
• Education
• Financial
• Government
• Libraries
• Life Sciences
• Manufacturing
• Medical
• Museums
• Periodicals
• Professional
• Publishing
• Reference
• Research
• Societies
• Software
• STM
• Technology
• Telecommunications
• Universities
• Utilities
Confidential & Proprietarywww.dclab.com 6
So You’re Implementing a New CMS – Great. Now
What?
Confidential & Proprietarywww.dclab.com
Key Considerations
• Content structure
• More than likely, that structure will be XML
• Which XML schema is appropriate – DITA, DocBook, XHTML?
• Is one schema better than the others?
• What’s the plan for legacy content?
7
Confidential & Proprietarywww.dclab.com
• Multichannel publishing
• Content repurposing
• Content reuse
• Easier updating
• Avoiding multiple conversions
• Some/all of the above
8
What Are Your Business Drivers?
Confidential & Proprietarywww.dclab.com
• Can’t just be moved from one format to another
• Non-XML sources embed formatting – not applicable to other
outputs
• Tool-specific formats make your content dependent on
functionality of that tool
9
Content Reuse is Hard!
Confidential & Proprietarywww.dclab.com
So You Need XML, but Which Schema?
10
Confidential & Proprietarywww.dclab.com
So Why DITA?
• Works across all outputs
• Can be customized to different content types (educational,
financial, legal, etc.)
• Can produce both HTML5 and EPUB from DITA with open-
source tools
• Can do everything DocBook can, but reverse not true
• XHTML not a true schema
11
Confidential & Proprietarywww.dclab.com
DITA with Your CMS
• Your CMS should support several different output targets
• DITA provides the consistent structure and flexibility to do
that
• New content will be authored in DITA
• But what about…
12
Confidential & Proprietarywww.dclab.com
Your Legacy Content?!
13
Confidential & Proprietarywww.dclab.com
• Not as scary as it seems
• Prioritize and convert in stages
• Consider conversion before selecting a CMS
• Consider a pilot program before committing fully
14
Converting and Integrating Your Legacy Content
Confidential & Proprietarywww.dclab.comwww.dclab.com
Lessons from 12 DITA
Implementations
DCL Survey of 12 Companies
Confidential & Proprietarywww.dclab.com
Why DITA? Top 3 Answers
• Reduce need for composition
• Content reuse
• Reduce translation costs
16
Confidential & Proprietarywww.dclab.com
What Were the Business Drivers?
• Top answer was multi-purposing
– Ability for various teams to use content to suit their
particular needs
– Deploying chunks of content for multiple purposes
dramatically reduced costs, and improved overall reliability
17
Confidential & Proprietarywww.dclab.com
How Long Did It Take?
• Average implementation took three years
• Some took only two years; others five
• Some respondents believed implementation to be an ongoing
process (never completed)
• Across the board, however, it took far longer than planned
18
Confidential & Proprietarywww.dclab.com
When Did They Choose CMS?
• Half of respondents selected CMS at beginning of process
• Other half after running pilot programs
• Companies that implemented CMS later converted content
first then selected CMS based on data requirements
• Two companies switched to different CMS during testing
phase
19
Confidential & Proprietarywww.dclab.com
How Was Success Measured?
• Multi-purposing was top criterion, with these notable
benefits:
– Publishing content in multiple formats such as PDF and print
– Developing training and help systems
– Customizing marketing and sales collateral
– Changing styling, layout, and design while maintaining the copy
– Producing HTML and eBooks since content was standardized
20
Confidential & Proprietarywww.dclab.com
How Are You Maximizing Benefits of Content Reuse?
• Only two of the 12 companies actively reusing content
• Built extensive rewriting phase into plan
• Extended implementation time but critical to overall success
• Other companies cited size of project and drastic change to
authoring process as reasons for not implementing reuse plan
up front
21
Confidential & Proprietarywww.dclab.com
Do You Often Need Translation?
• Four of 12 companies doing heavy translation
• All reported significant saving from data standardization, even
without content reuse
• Rest of companies viewed translation as essential to future
plans
22
Confidential & Proprietarywww.dclab.com
How Did Conversion Go?
• All 12 companies felt it went smoothly
• Many didn’t do conversion at initial stages and opted for
extensive rewriting, which they regretted
• Same companies held off on legacy conversion until after
implementation, which like rewriting, wasn’t efficient
23
Confidential & Proprietarywww.dclab.com
Did DITA Work Out of the Box?
• All 12 companies reported that it did
• Only 3 reported using DITA specialization, mostly for minor
items
• All were working with technical documents, so specialization
wasn’t an issue in these cases
24
Confidential & Proprietarywww.dclab.com
Lessons Learned
• Consensus for more data clean-up before conversion
• Small pilot programs are useful
• Underestimated adapting to DITA authoring and training
needs
• Support from management crucial
25
Confidential & Proprietarywww.dclab.com 26
Confidential & Proprietarywww.dclab.com 27
Benefits and Drivers for
Content Conversion
Confidential & Proprietarywww.dclab.com 28
The Value of Structured Content
Increase Revenues
 Improve customer service
 Decrease time to market
 Expand into new markets
 Create data versatility
 Enhance discoverability
Decrease Expenses
 Increase authoring productivity
 Reduce publishing costs
 Increase information reuse
 Reduce translation costs
 Future-proof data
Successful business strategies are driven by content!
Confidential & Proprietarywww.dclab.com 29
Can your content keep up with changing technology?
 Data drives every aspect of a business from engineering and development
to maintenance, repair and operations, sales, customer service, marketing,
and more
 Documents are often converted in order to comply with law, industry
standards, or to support distribution partners and meet consumers'
expectations
 Data conversion is most desirable for its potential to lower costs by making
data easier to manage, update, reproduce, and syndicate
 Structured formatting enables content to be delivered any where at any
time on any device imaginable
Confidential & Proprietarywww.dclab.com 30
Re-purposing
Searching
Component Reuse
Enforce Data Standards
Interchange with Vendors, Customers, & World
 Creating new versions of data suitable for derivative uses
(e.g. the web, diagnostic equipment, hand-held devices,
voice devices)
 Ability to find information through text searches and
through more advanced searches that depend on context
and “understanding”
 Ability to reuse portions of data for different products and
different documentation sets
 Ability to assure that the information produced is
produced consistently and meets corporate standards
 Ability for others to use your information for
communications with others and to incorporate into
products belonging to other organizations
Various Uses for Structured Content
Confidential & Proprietarywww.dclab.com
• Plan… plan… plan
• Prepare your teams and manage attitudes and expectations
accordingly
• Phase your project for increased manageability
• Establish multiple checkpoints and test often
• DON’T GO IT ALONE!
31
Key Takeaways
Confidential & Proprietarywww.dclab.com 32
Q&A
Greg Fagan
Sales Director,
Data Conversion Laboratory
(908) 723-1884
gfagan@dclab.com
@dclaboratory

More Related Content

PPTX
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
PPTX
What are the Strengths and Weaknesses of DITA Adoption?
PPTX
Converting and Transforming Technical Graphics
PPTX
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
PPT
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
PPTX
Content Development: Measuring the Trends
PPTX
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
PPTX
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
What are the Strengths and Weaknesses of DITA Adoption?
Converting and Transforming Technical Graphics
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Content Development: Measuring the Trends
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
Minimalism Revisited — Let’s Stop Developing Content that No One Wants

What's hot (20)

PDF
DITA: From “Do I?” to “Done It!”: An Automotive Case Study that can apply to ...
PDF
From Zero to DITA in about 60 Minutes
PDF
Lessons Learned... Migration to DITA During Corporate Acquisitions
PDF
10 Ways DITA Can Help Drive a Unified Strategy
PDF
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
PDF
Box Use Case Matrix - FINAL (external)
PDF
Lower Cost and Complexity with Azure and StorSimple Hybrid Cloud Solutions
PPT
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
PDF
Best Practices for a Successful SharePoint Migration or Upgrade to the Cloud
PDF
MDM for product data with Talend
PPT
Data Warehouse Methodology
PDF
Mcgraw Hill Construction - Print Production Case Study
PDF
Technip Multidomain MDM Journey
PPT
[EN] Trends in Records, Document and Enterprise Content Management | Ulrich K...
PDF
Google for Work Applications: Enterprise-Class Collaboration and Search Integ...
PPTX
DITA as Interchange Format for Crowdsourcing and Acquisitions
PDF
Plan a Successful Information Management Solution Implementation
PDF
Agile NoSQL With XRX
PPTX
DITA for Small Teams Workshop (Tekom 2017)
PDF
The technology of the business data lake
DITA: From “Do I?” to “Done It!”: An Automotive Case Study that can apply to ...
From Zero to DITA in about 60 Minutes
Lessons Learned... Migration to DITA During Corporate Acquisitions
10 Ways DITA Can Help Drive a Unified Strategy
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
Box Use Case Matrix - FINAL (external)
Lower Cost and Complexity with Azure and StorSimple Hybrid Cloud Solutions
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
Best Practices for a Successful SharePoint Migration or Upgrade to the Cloud
MDM for product data with Talend
Data Warehouse Methodology
Mcgraw Hill Construction - Print Production Case Study
Technip Multidomain MDM Journey
[EN] Trends in Records, Document and Enterprise Content Management | Ulrich K...
Google for Work Applications: Enterprise-Class Collaboration and Search Integ...
DITA as Interchange Format for Crowdsourcing and Acquisitions
Plan a Successful Information Management Solution Implementation
Agile NoSQL With XRX
DITA for Small Teams Workshop (Tekom 2017)
The technology of the business data lake
Ad

Viewers also liked (20)

PPTX
Making the Most of the New Math Specializations in DITA 1.3
PPTX
Data-Driven User Experience
PPTX
Optimizing the DITA Authoring Experience
PPTX
10 Mistakes When Moving to Topic-Based Authoring
PPTX
Content Conversion Done Right Saves More Than Money
PPTX
Using HTML5 to Deliver and Monetize Your Mobile Content
PPTX
Precision Content™ Tools, Techniques, and Technology
PPT
When Conversion Makes Sense
PPTX
There's Gold in Them Thar Data
PPTX
Content Engineering and The Internet of “Smart” Things
PPTX
DITA's New Thang: Going Mapless!
PPTX
New Directions 2015 – Changes in Content Best Practices
PPTX
Metadata Matters
PPTX
Anticipating Lightweight DITA
PPTX
DITA for Small Teams: An Open Source Approach to DITA Content Management
PPTX
Demystifying SPL for Medical Devices
PPTX
DITA, EPUB, and HTML5: An Update for 2015
PPTX
Coming Up to Speed with XML Authoring in Adobe FrameMaker
PPTX
Out of the Silos and Into the Farm
PPTX
Finding Role Clarity in UX Chaos
Making the Most of the New Math Specializations in DITA 1.3
Data-Driven User Experience
Optimizing the DITA Authoring Experience
10 Mistakes When Moving to Topic-Based Authoring
Content Conversion Done Right Saves More Than Money
Using HTML5 to Deliver and Monetize Your Mobile Content
Precision Content™ Tools, Techniques, and Technology
When Conversion Makes Sense
There's Gold in Them Thar Data
Content Engineering and The Internet of “Smart” Things
DITA's New Thang: Going Mapless!
New Directions 2015 – Changes in Content Best Practices
Metadata Matters
Anticipating Lightweight DITA
DITA for Small Teams: An Open Source Approach to DITA Content Management
Demystifying SPL for Medical Devices
DITA, EPUB, and HTML5: An Update for 2015
Coming Up to Speed with XML Authoring in Adobe FrameMaker
Out of the Silos and Into the Farm
Finding Role Clarity in UX Chaos
Ad

Similar to Converting and Integrating Content When Implementing a New CMS (20)

PDF
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
PPTX
Why Business is Better in the Cloud
PPTX
Building a Modern Analytic Database with Cloudera 5.8
PDF
Neo4j PartnerDay Amsterdam 2017
PPT
Agile Data Architecture
PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
PDF
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
PPTX
Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...
PPTX
Emea partners recruitment webinar
PDF
ASTC Conference 2024 - Tools, Trends, Technologies
PDF
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
PPTX
The Future of IT Infrastructure is Hybrid and on Demand
PPTX
Webinar: How Partners Can Benefit from our New Program (EMEA)
PPTX
Data integration case study: Oil & Gas industry
PPTX
10 Million Dita Topics Can't Be Wrong
PPTX
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
PDF
Modernize Your Content Publishing Process with Smart Content
PPT
Choosing Public vs. Private vs. Hybrid Cloud Computing
PPTX
8 Things to Consider as SharePoint Moves to the Cloud
PDF
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Why Business is Better in the Cloud
Building a Modern Analytic Database with Cloudera 5.8
Neo4j PartnerDay Amsterdam 2017
Agile Data Architecture
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
Webinar: The 5 Most Critical Things to Understand About Modern Data Integration
Partner Recruitment Webinar: "Join the Most Productive Ecosystem in Big Data ...
Emea partners recruitment webinar
ASTC Conference 2024 - Tools, Trends, Technologies
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
The Future of IT Infrastructure is Hybrid and on Demand
Webinar: How Partners Can Benefit from our New Program (EMEA)
Data integration case study: Oil & Gas industry
10 Million Dita Topics Can't Be Wrong
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Modernize Your Content Publishing Process with Smart Content
Choosing Public vs. Private vs. Hybrid Cloud Computing
8 Things to Consider as SharePoint Moves to the Cloud
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures

More from dclsocialmedia (6)

PPTX
Preparing Your Legacy Data for Automation in S1000D
PPTX
Introduction to Structured Authoring
PPTX
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
PPTX
Converting Your Legacy Data to S1000D
PPTX
Marketing and Strategy and Bears... oh my!
PPTX
Managing Documentation Projects in Nearly Any Environment
Preparing Your Legacy Data for Automation in S1000D
Introduction to Structured Authoring
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Converting Your Legacy Data to S1000D
Marketing and Strategy and Bears... oh my!
Managing Documentation Projects in Nearly Any Environment

Recently uploaded (20)

PPT
E commerce busin and some important issues
PPTX
Introduction to Essence of Indian traditional knowledge.pptx
PPTX
Who’s winning the race to be the world’s first trillionaire.pptx
PPTX
What is next for the Fractional CFO - August 2025
PPTX
kyc aml guideline a detailed pt onthat.pptx
PDF
Copia de Minimal 3D Technology Consulting Presentation.pdf
PDF
Corporate Finance Fundamentals - Course Presentation.pdf
PDF
Predicting Customer Bankruptcy Using Machine Learning Algorithm research pape...
PDF
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
PDF
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
PDF
ECONOMICS AND ENTREPRENEURS LESSONSS AND
PDF
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
PDF
illuminati Uganda brotherhood agent in Kampala call 0756664682,0782561496
PDF
Dialnet-DynamicHedgingOfPricesOfNaturalGasInMexico-8788871.pdf
PPTX
How best to drive Metrics, Ratios, and Key Performance Indicators
PPTX
Session 11-13. Working Capital Management and Cash Budget.pptx
PDF
discourse-2025-02-building-a-trillion-dollar-dream.pdf
PPTX
4.5.1 Financial Governance_Appropriation & Finance.pptx
PPTX
Session 3. Time Value of Money.pptx_finance
PPTX
Globalization-of-Religion. Contemporary World
E commerce busin and some important issues
Introduction to Essence of Indian traditional knowledge.pptx
Who’s winning the race to be the world’s first trillionaire.pptx
What is next for the Fractional CFO - August 2025
kyc aml guideline a detailed pt onthat.pptx
Copia de Minimal 3D Technology Consulting Presentation.pdf
Corporate Finance Fundamentals - Course Presentation.pdf
Predicting Customer Bankruptcy Using Machine Learning Algorithm research pape...
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
ECONOMICS AND ENTREPRENEURS LESSONSS AND
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
illuminati Uganda brotherhood agent in Kampala call 0756664682,0782561496
Dialnet-DynamicHedgingOfPricesOfNaturalGasInMexico-8788871.pdf
How best to drive Metrics, Ratios, and Key Performance Indicators
Session 11-13. Working Capital Management and Cash Budget.pptx
discourse-2025-02-building-a-trillion-dollar-dream.pdf
4.5.1 Financial Governance_Appropriation & Finance.pptx
Session 3. Time Value of Money.pptx_finance
Globalization-of-Religion. Contemporary World

Converting and Integrating Content When Implementing a New CMS

  • 1. Confidential & Proprietarywww.dclab.comwww.dclab.com Converting and Integrating Content When Moving to a New CMS Greg Fagan, Sales Director, Data Conversion Laboratory (DCL)
  • 2. Confidential & Proprietarywww.dclab.com 2 Valuable Content Transformed • Document Digitization • XML and HTML Conversion • eBook Production • Hosted Solutions • Big Data Automation • Conversion Management • Editorial Services • Harmonizer
  • 3. Confidential & Proprietarywww.dclab.com 3 Experience the DCL Difference DCL blends years of conversion experience with cutting-edge technology and the infrastructure to make the process easy and efficient. • World-Class Services • Leading-Edge Technology • Unparalleled Infrastructure • US-Based Management • Complex-Content Expertise • 24/7 Online Project Tracking • Automated Quality Control • Global Capabilities
  • 4. Confidential & Proprietarywww.dclab.com We Serve a Very Broad Client Base . . . 4
  • 5. Confidential & Proprietarywww.dclab.com 5 . . . Spanning All Industries • Aerospace • Associations • Defense • Distribution • Education • Financial • Government • Libraries • Life Sciences • Manufacturing • Medical • Museums • Periodicals • Professional • Publishing • Reference • Research • Societies • Software • STM • Technology • Telecommunications • Universities • Utilities
  • 6. Confidential & Proprietarywww.dclab.com 6 So You’re Implementing a New CMS – Great. Now What?
  • 7. Confidential & Proprietarywww.dclab.com Key Considerations • Content structure • More than likely, that structure will be XML • Which XML schema is appropriate – DITA, DocBook, XHTML? • Is one schema better than the others? • What’s the plan for legacy content? 7
  • 8. Confidential & Proprietarywww.dclab.com • Multichannel publishing • Content repurposing • Content reuse • Easier updating • Avoiding multiple conversions • Some/all of the above 8 What Are Your Business Drivers?
  • 9. Confidential & Proprietarywww.dclab.com • Can’t just be moved from one format to another • Non-XML sources embed formatting – not applicable to other outputs • Tool-specific formats make your content dependent on functionality of that tool 9 Content Reuse is Hard!
  • 10. Confidential & Proprietarywww.dclab.com So You Need XML, but Which Schema? 10
  • 11. Confidential & Proprietarywww.dclab.com So Why DITA? • Works across all outputs • Can be customized to different content types (educational, financial, legal, etc.) • Can produce both HTML5 and EPUB from DITA with open- source tools • Can do everything DocBook can, but reverse not true • XHTML not a true schema 11
  • 12. Confidential & Proprietarywww.dclab.com DITA with Your CMS • Your CMS should support several different output targets • DITA provides the consistent structure and flexibility to do that • New content will be authored in DITA • But what about… 12
  • 14. Confidential & Proprietarywww.dclab.com • Not as scary as it seems • Prioritize and convert in stages • Consider conversion before selecting a CMS • Consider a pilot program before committing fully 14 Converting and Integrating Your Legacy Content
  • 15. Confidential & Proprietarywww.dclab.comwww.dclab.com Lessons from 12 DITA Implementations DCL Survey of 12 Companies
  • 16. Confidential & Proprietarywww.dclab.com Why DITA? Top 3 Answers • Reduce need for composition • Content reuse • Reduce translation costs 16
  • 17. Confidential & Proprietarywww.dclab.com What Were the Business Drivers? • Top answer was multi-purposing – Ability for various teams to use content to suit their particular needs – Deploying chunks of content for multiple purposes dramatically reduced costs, and improved overall reliability 17
  • 18. Confidential & Proprietarywww.dclab.com How Long Did It Take? • Average implementation took three years • Some took only two years; others five • Some respondents believed implementation to be an ongoing process (never completed) • Across the board, however, it took far longer than planned 18
  • 19. Confidential & Proprietarywww.dclab.com When Did They Choose CMS? • Half of respondents selected CMS at beginning of process • Other half after running pilot programs • Companies that implemented CMS later converted content first then selected CMS based on data requirements • Two companies switched to different CMS during testing phase 19
  • 20. Confidential & Proprietarywww.dclab.com How Was Success Measured? • Multi-purposing was top criterion, with these notable benefits: – Publishing content in multiple formats such as PDF and print – Developing training and help systems – Customizing marketing and sales collateral – Changing styling, layout, and design while maintaining the copy – Producing HTML and eBooks since content was standardized 20
  • 21. Confidential & Proprietarywww.dclab.com How Are You Maximizing Benefits of Content Reuse? • Only two of the 12 companies actively reusing content • Built extensive rewriting phase into plan • Extended implementation time but critical to overall success • Other companies cited size of project and drastic change to authoring process as reasons for not implementing reuse plan up front 21
  • 22. Confidential & Proprietarywww.dclab.com Do You Often Need Translation? • Four of 12 companies doing heavy translation • All reported significant saving from data standardization, even without content reuse • Rest of companies viewed translation as essential to future plans 22
  • 23. Confidential & Proprietarywww.dclab.com How Did Conversion Go? • All 12 companies felt it went smoothly • Many didn’t do conversion at initial stages and opted for extensive rewriting, which they regretted • Same companies held off on legacy conversion until after implementation, which like rewriting, wasn’t efficient 23
  • 24. Confidential & Proprietarywww.dclab.com Did DITA Work Out of the Box? • All 12 companies reported that it did • Only 3 reported using DITA specialization, mostly for minor items • All were working with technical documents, so specialization wasn’t an issue in these cases 24
  • 25. Confidential & Proprietarywww.dclab.com Lessons Learned • Consensus for more data clean-up before conversion • Small pilot programs are useful • Underestimated adapting to DITA authoring and training needs • Support from management crucial 25
  • 27. Confidential & Proprietarywww.dclab.com 27 Benefits and Drivers for Content Conversion
  • 28. Confidential & Proprietarywww.dclab.com 28 The Value of Structured Content Increase Revenues  Improve customer service  Decrease time to market  Expand into new markets  Create data versatility  Enhance discoverability Decrease Expenses  Increase authoring productivity  Reduce publishing costs  Increase information reuse  Reduce translation costs  Future-proof data Successful business strategies are driven by content!
  • 29. Confidential & Proprietarywww.dclab.com 29 Can your content keep up with changing technology?  Data drives every aspect of a business from engineering and development to maintenance, repair and operations, sales, customer service, marketing, and more  Documents are often converted in order to comply with law, industry standards, or to support distribution partners and meet consumers' expectations  Data conversion is most desirable for its potential to lower costs by making data easier to manage, update, reproduce, and syndicate  Structured formatting enables content to be delivered any where at any time on any device imaginable
  • 30. Confidential & Proprietarywww.dclab.com 30 Re-purposing Searching Component Reuse Enforce Data Standards Interchange with Vendors, Customers, & World  Creating new versions of data suitable for derivative uses (e.g. the web, diagnostic equipment, hand-held devices, voice devices)  Ability to find information through text searches and through more advanced searches that depend on context and “understanding”  Ability to reuse portions of data for different products and different documentation sets  Ability to assure that the information produced is produced consistently and meets corporate standards  Ability for others to use your information for communications with others and to incorporate into products belonging to other organizations Various Uses for Structured Content
  • 31. Confidential & Proprietarywww.dclab.com • Plan… plan… plan • Prepare your teams and manage attitudes and expectations accordingly • Phase your project for increased manageability • Establish multiple checkpoints and test often • DON’T GO IT ALONE! 31 Key Takeaways
  • 32. Confidential & Proprietarywww.dclab.com 32 Q&A Greg Fagan Sales Director, Data Conversion Laboratory (908) 723-1884 gfagan@dclab.com @dclaboratory

Editor's Notes

  • #2: Good afternoon, everyone! Thanks for joining us for this webinar. Today we’re going to discuss the best formats and practices for content conversion when you’re migrating to a new content management system. I’m Greg Fagan, and I’m the Sales Director for the publishing and financial industries at DCL. Because you’re all busy people, I’ve tried to keep this presentation as concise as possible. I’ll talk for about 15-20 minutes and then open the floor to your questions.
  • #3: Just some quick background information on DCL. We’re content conversion experts. We take content in any format you might have it and convert it to reusable formats for digital output such as XML, SGML, HTML5, DITA, and EPUB. We not only convert your content, but we can enrich it to make it more discoverable, usable, and deliverable to any output format or device. Aside from conversion, we offer a suite of services, including hosting, editorial services, and project management.
  • #4: Our deep experience, sophisticated infrastructure, and ferocious commitment to quality are what set us apart from the pack.
  • #5: We serve a broad range of clients. Myriad large, global companies from many different sectors entrust their content to us.
  • #6: And our clients span a wide array of industries, which speaks to our familiarity and fluency with many different XML schemas. Publishers, societies, pharmaceutical companies, defense contractors, and government agencies are just a few of the types of clients and industries we serve.
  • #7: So you’re implementing a new content management system. Or maybe you’re upgrading an existing one. This means that you’re serious about organizing your content to make it more searchable and retrievable, and that you’re probably keen to reuse and repurpose your content in multiple ways and to multiple outputs. That’s good business practice, and it’s something every organization that provides content should do. Delivering content to your users in the way that they want it is critical to your overall success. So now that you’ve decided to move forward with this new CMS, what’s next?
  • #8: Any content management system requires content to be in some kind of structured format. In most industries, from publishing to financial services to aerospace, just to list a few examples, that structure will likely be some flavor of XML. But which XML schema should you use? Is one better or more appropriate than the others? Sure, you have a plan for the new content that you’ll be entering into your CMS, and you very likely have an authoring tool designed to work with that system. But what about your legacy content? How many years’ worth of content do you have? How is it currently stored – mostly paper, PDFs, bound books – and what’s your plan for integrating it into your CMS? Do you need to convert all of your content now, or can you prioritize and do it in stages? These are all important considerations, and hopefully you’ve thought about them before you decided to implement a new CMS.
  • #9: Think about your business drivers for developing a CMS. Is the goal to publish to multiple channels – print, Web, mobile apps, streaming audio/video? Is it to reuse your content across your enterprise from a single source so that you can streamline content creation and avoid redundancies? Do you want to make updating your content easier? Or maybe you’ve seen the inefficiency of converting your content multiple times for different outputs. More than likely, your business drivers involve some combination or even all of these reasons. After all, the whole point of implementing a CMS is to get your content into a structure that provides greater control and flexibility.
  • #10: In addition to the financial challenges of converting content from one source to another multiple times, the content cannot simply be “moved” from one design to another. Content for books is written to be read from beginning to end. This approach creates dependencies that make it difficult to use the same content in a different order or for a different purpose, such as a mobile app. For example, wording such as “in the previous chapter” is not appropriate in a non-book experience. Non-XML sources embed the formatting into the content. When an author applies a format in a source file, such as an InDesign file, the styling is embedded into the content. Because this styling is not usually applicable in another deliverable, the formatting must be updated for each deliverable type. Tool-specific files lock your content into a dependency on the functionality, including output generation, for that tool. All of these factors contribute to limiting the delivery possibilities for your content.
  • #11: For the most flexibility across all content types, my recommendation would be DITA, which, if you’re not familiar with it, stands for Darwin Information Typing Architecture. DITA is an open standard for creating, managing, and publishing modular content, which is what will be stored in your content management system. It supports the definition of new content types within a comprehensive content ecosystem, and it has been increasingly adopted across a wide range of content disciplines and industries.
  • #12: A few years ago, the common wisdom was that if you were developing narrative content, you should use DocBook, and if you were developing modular or topic-based content, you should use DITA. That was true to an extent but was always somewhat misleading, in my view. Books can be written with DITA and modular content can be authored with DocBook. DITA has really advanced in the last couple of years, to the point where I think it’s superior to DocBook, especially when implementing a CMS. DITA can be published to all outputs, and it can be easily customized (or specialized, to use the preferred terminology of DITA advocates) to many different content types, such as educational, financial, and legal, just to name a few. That’s important for the development of mobile content, apps, and enhanced ebooks. You can produce both HTML5 and EPUB with readily available open-source tools from DITA. And while both have their strong points, DITA is the more flexible schema of the two: it can do everything DocBook can, but the reverse isn’t always true. For example, DITA is better-suited to granular storage of content that you see in a most content management systems. What about XHTML? Although it’s often thought of as one, XHTML is not a true schema; it’s really a document styling format and thus not structured enough for a CMS.
  • #13: One of the essential functions of any content management system is that it should support most if not all output targets, and DITA provides both a consistent XML structure and the flexibility of specialization to do that. How many of us foresaw the advent of all the current deliverable types five years ago? Can you predict the quantity or variation of the deliverables your company will need to create to meet changing user needs in the next five years? If you separate your content from its delivery now, which is what a good CMS does, then you don’t have to try to predict the future; instead, you can future-proof your content so that it’s ready to be transformed into whatever outputs your customers need. Now it’s one thing to do that with new content. Piece of cake, right? You simply set up templates and tools that integrate with your CMS. But what about…
  • #14: That’s a different challenge. So let’s talk about that.
  • #15: Integrating your legacy content with a new CMS is no easy task, but with a logical, well-planned approach, it’s not as daunting as it might seem. A phased approach makes a lot of sense, as it helps you to avoid costly mistakes, like realizing you’ve implemented a system that doesn’t work before you get too far down the road. Prioritizing and converting your content in stages, doing some conversion and learning more about your content requirements before choosing a CMS, and running small pilot programs before getting locked into a CMS, at a cost of hundreds of thousands of dollars, are all good examples of a phased approach.
  • #16: With that in mind, let’s discuss some relevant DITA and CMS lessons in detail. DCL recently conducted a series of interviews with DITA implementers at twelve companies. The intent of the study was to better understand the reality of live implementations vs. the perceptions that exist in the industry. We promised anonymity so we could ensure the results would be representative of the group’s actual findings.
  • #17: The three most popular answers were: Reduced need for composition Content reuse and reduced translation costs. All three resulted in cost savings, decreased time to market and improved internal efficiencies. This isn’t surprising. We know from our own years of experience that having content in a structured format in a content management system has many benefits, with these three among the most cited.
  • #18: The top answer to this question was the ability for various teams to multi-purpose content to suit their particular requirements. Utilizing chunks of content for multiple purposes dramatically reduced costs and improved overall reliability. I referred to multi-purposing in an earlier slide, and it’s highly likely that it’s at or near the top of any organization’s list for moving to a structured content format within a CMS.
  • #19: [Read bullets.] This one came as a surprise to us. But there are ways to speed the process. After all… time is money!
  • #20: Half of the respondents selected their CMS at the beginning of the process. The other half followed after running various pilot programs. The companies that selected a CMS later started doing conversion and getting comfortable with the data first, then selected a management system when they had a better understanding of their own data requirements. Two of the companies had switched from their initial selection to another CMS during the testing phase, which highlights the value and wisdom of running small pilot programs before full implementation. Absent that testing, they might very well have continued down their respective paths with content management systems that weren’t meeting their needs. That would have meant large sums of money spent for poorly implemented solutions. And it also would have resulted in walking papers for those decision makers.
  • #21: Once again, the ability to multi-purpose content was the number one criteria for measuring success and return on investment. Some of the notable savings came from the improved ease and efficiency of the following: [Read bullets 2-6.]
  • #22: Only two of the twelve companies we interviewed were actively taking advantage of content reuse. Yes… we were surprised by this as well. Those two companies had decided upfront to build an extensive re-writing phase into their implementation plan. While this additional phase extended the implementation time, the upfront planning was critical to the success of their overall project. The most common reasons for not implementing a reuse plan up front included projects being too large for anyone to manage or requiring too much rewriting. Notably, many also mentioned the drastic change that would be required for their writers to move to a more modular writing mode and to work more collaboratively and with more guidelines than typically they were accustomed to.
  • #23: Four of the twelve companies were actively doing a lot of translation. All four reported major savings. Even without content reuse, the savings of standardized data in terms of translation were vast and long-lasting. The eight companies who did not translate their data stated that it was a likely future endeavor but that right now, even with globalization, they were able to get away with English alone. All of the respondents said that translation was definitely a future requirement.
  • #24: All twelve companies felt that the conversion went smoothly. However, many didn’t do much conversion in their initial stages. Several had decided, to their later regret, to rewrite most materials from scratch, which simply took way too much time. These organizations ultimately left most of their legacy data unconverted until after the CMS implementation. Two companies initially thought that having the writers do it themselves would be good training, but noted that, in retrospect, this wasn’t a good idea.
  • #25: A major reason for attraction to DITA was that it works out of the box, at least for most. Others can expand its benefits by applying specializations when necessary. The companies we interviewed all agreed that for their materials DITA pretty much worked out of the box, and that standard composition software was for the most part suitable for their needs. Only three of the companies reported using specialization, and those were for minor items like customized document covers. Of course, these organizations were all working with technical documents of one kind or another, which are the type of documents DITA was originally designed for. Other kinds of documents would likely need more specialization, although there are a number of emerging standardized “specializations” for different document types.
  • #26: Let’s talk about lessons learned. When asked what they would do differently, the most common response was “more cleanup of data before conversion.” Many wanted smaller—and simpler—pilots, as well as more time to experiment. They felt they had focused too much on the complex outliers in their pilot, and jumped into production too quickly without enough time to adjust for lessons learned in the pilot. Underestimating the human factor was a common note. Allowing more time for people to adapt to the new system and the philosophy of DITA, as well as earlier training, were also prominent suggestions in hindsight. Finally, buy-in and support from upper management was viewed as critical by all respondents.
  • #27: Here is a table of common content pain points and how DITA implementation solves them.
  • #28: So what are the benefits and drivers for content conversion, specifically when converting to a new CMS?
  • #29: Well-structured content has many benefits, with the most important being that it can increase revenue by decreasing time to market and enabling new product development. It also decreases expenses, such as publishing and translation costs, over time, which makes it a smart investment. Often legacy content is more complex and difficult to manage than new content. In many cases, it was designed for one specific output and not much thought was given to proper storage, retrieval, or reusability. There are also different document types, formats, and levels of complexity, like heavy math and tabular material that was never meant for digital output. This is where the help of a trusted partner can be invaluable in helping you identify, categorize, and convert your content to a well-structured format. Your content should drive your business strategy.
  • #30: But you can’t structure your content and think your work is done. It’s an ongoing process to keep up with industry standards, compliance, and constantly evolving outputs. Once the major work is done, however, the changes are much easier to manage, and your content is ready for delivery to any output. Content drives every aspect of your business, so make sure yours is ready to take you in the right direction.
  • #31: Structured content has many uses, with reuse and repurposing the most important in my mind. Why? Because they generate revenue. The others are important, too. Different industries have differing degrees of importance, but money talks in all of them. When your content is structured at a granular level, you can assemble the different components into new products and new revenue sources.
  • #32: So the key considerations for conversion when implementing a CMS are as follows: You must plan thoroughly and then be prepared to adjust once theory turns into practice. To quote General Dwight Eissenhower, “No battle was ever won according to plan, but no battle was ever one without one.” Prepare your teams and manage expectations. Try to anticipate problems before they occur. That’s easier said than done sometimes, but it’s the key to good project management. Implement your conversion and your CMS in phases. Pilot projects are a great way to discover and head off potential problems before you head too far down the wrong path. Establish multiple checkpoints and milestones and test your system often with real users and real content. The people who will have to use the CMS every day are the people who will provide the most valuable feedback. And finally, we give this advice often, but it’s always worth repeating: Don’t go it alone! For a project of this scope, you’ll need outside expertise. Bringing in the right expertise is almost always more cost-effective than trying to manage every aspect of a large-scale project inhouse.
  • #33: I’d like to thank you for tuning in today. Feel free to contact me directly anytime; my contact information is there on the screen. Now I’m happy to take your questions.