SlideShare a Scribd company logo
Confidential & Proprietarywww.dclab.comwww.dclab.com
Developing and Implementing a QA Plan
When Converting Your Legacy Data
Naveh Greenberg,
Director, U.S. Defense Development,
Data Conversion Laboratory
Confidential & Proprietarywww.dclab.com 2
Valuable Content Transformed
• Document Digitization
• XML and HTML Conversion
• eBook Production
• Hosted Solutions
• Big Data Automation
• Conversion Management
• Editorial Services
• Harmonizer
Confidential & Proprietarywww.dclab.com 3
Experience the DCL Difference
DCL blends years of conversion experience with cutting-edge technology and the
infrastructure to make the process easy and efficient.
• World-Class Services
• Leading-Edge Technology
• Unparalleled Infrastructure
• US-Based Management
• Complex-Content Expertise
• 24/7 Online Project Tracking
• Automated Quality Control
• Global Capabilities
Confidential & Proprietarywww.dclab.com
We Serve a Very Broad Client Base . . .
4
Confidential & Proprietarywww.dclab.com 5
. . . Spanning All Industries
• Aerospace
• Associations
• Defense
• Distribution
• Education
• Financial
• Government
• Libraries
• Life Sciences
• Manufacturing
• Medical
• Museums
• Periodicals
• Professional
• Publishing
• Reference
• Research
• Societies
• Software
• STM
• Technology
• Telecommunications
• Universities
• Utilities
Confidential & Proprietarywww.dclab.com 6
Agenda
• What makes conversion difficult?
• Planning for a good conversion experience
• Implementing your plan
• Examples
• Q&A
Confidential & Proprietarywww.dclab.com 7
What Makes Conversion Difficult
• The usual conversion issues
– Accuracy of the transferred text
– Tables
– Math & Special Characters
– How to determine correct hierarchy.
– Pages & most formatting are not in the XML/SGML
– Irrelevant Cross-References
– Identifying reusable content
– Writer Creativity in Source Material
• And the people issues
– Getting used to a new “document” paradigm
– Agreeing on conversion rules
– Involving all stakeholders
Confidential & Proprietarywww.dclab.com 8
Most Importantly – Plan!!!
• Ask the important initial questions
• Who are the stakeholders. Who is the final client/user?
• What is the estimated volume and deadline?
• What is the standard ?
• What CMS or rendering tools will be used?
• What are we starting with? Not all source data are created equal.
• Budget?
• Learn from others.
• Join discussion groups.
• Case studies & Lesson Learn.
• Prepare for the next step
• Get your hands on the source data, schemas, sample of converted data.
• Build a solid team.
• Develop a solid process.
“If I had eight hours to chop down a tree, I'd spend six sharpening my ax.”
Confidential & Proprietarywww.dclab.com
“If I had eight hours to chop
down a tree, I'd spend six
sharpening my ax.”
- Abraham Lincoln
DCL’s Project Start-up Methodology
Confidential & Proprietarywww.dclab.com
Inventory & Assessment
• Log the batches received into a production control system.
• By logging and tracking each unit you can gather information
that can be used to:
– Project delivery schedules
– Confirm that processes are working properly
– Track each unit and show you in what step of the production
process it’s in.
Confidential & Proprietarywww.dclab.com 11
Inventory & Assessment: What to Convert, and in What Order
• Categorizing
– Active documents in good shape
– Active documents that need a lot of work
– Somewhat inactive document that will likely be retired
– Archival materials
• Prioritizing
– Documents that are most used
– Documents that are customer favorites
– Documents with longest product life
– Start with most recent documents and go back
• Identifying the process
– Can be converted as is
– Can be converted with some work
– Needs to be rewritten
– Don’t convert – just keep archival copies
Confidential & Proprietarywww.dclab.com
Why Is Reuse Analysis Important?
• Increased consistency
• Reduced development time
• Lower maintenance costs
• Rapid reconfiguration
• Find Typos or Applicability
• Divide and conquer
Confidential & Proprietarywww.dclab.com 13
Content Reuse Analysis Reports
• Finding exact or similar text will help you when mapping to Data Modules
• It will also help to detect applicability and inconsistencies
Confidential & Proprietarywww.dclab.com 14
Document Analysis – Text extraction
Confidential & Proprietarywww.dclab.com 15
Document Analysis – Text extraction
Confidential & Proprietarywww.dclab.com 16
Document Analysis – Text extraction
Confidential & Proprietarywww.dclab.com
The Conversion Specification
17
Confidential & Proprietarywww.dclab.com
The Conversion Specification
18
Confidential & Proprietarywww.dclab.com 19
Normalizing Your Data
<para>1. Clean the Engine.</para>
<step1><para>Clean the Engine.</para></step1>
<seqlist><item>Clean the Engine.</item></seqlist>
<entry>1.</entry><entry>Clean the Engine. </entry>
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
20
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
21
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
22
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
23
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
24
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
25
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
26
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
27
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
28
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
30
Confidential & Proprietarywww.dclab.com
Normalizing Your Data
31
Confidential & Proprietarywww.dclab.com
Viewing Your Converted Data while QC
32
Confidential & Proprietarywww.dclab.com 33
Q&A
Naveh Greenberg
Director, U.S. Defense Development,
Data Conversion Laboratory
(718) 307-5758
ngreenberg@dclab.com
@dclaboratory

More Related Content

PPTX
Content Development: Measuring the Trends
PPTX
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
PPTX
Preparing Your Legacy Data for Automation in S1000D
PPTX
What are the Strengths and Weaknesses of DITA Adoption?
PPTX
Data-Driven User Experience
PPTX
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
PPT
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
PPTX
Content Engineering and The Internet of “Smart” Things
Content Development: Measuring the Trends
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
Preparing Your Legacy Data for Automation in S1000D
What are the Strengths and Weaknesses of DITA Adoption?
Data-Driven User Experience
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Content Engineering and The Internet of “Smart” Things

What's hot (20)

PPTX
Anticipating Lightweight DITA
PPTX
Introduction to Structured Authoring
PPTX
Optimizing the DITA Authoring Experience
PPTX
DITA's New Thang: Going Mapless!
PPTX
New Directions 2015 – Changes in Content Best Practices
PPTX
Improve your Chances for Documentation Success with DITA and a CCMS LavaCon L...
PDF
DataOps - Lean principles and lean practices
PPTX
Managing the Complexities of Conversion to S1000D
PDF
Sprinting to Success: Why Agile and DITA Work So Well Together
PPTX
10 Million Dita Topics Can't Be Wrong
PDF
Stored Procedure Superpowers: A Developer’s Guide
PPTX
4D Pubs - Distributed Dynamic Document Dsplay
PDF
Is DITA Right for You? - STC Summit 2017
PDF
Produce Reliable Content with DITA CMS
PDF
Using Markdown and Lightweight DITA in a Collaborative Environment
PPTX
DITA for Small Teams
PPTX
DITA and Agile Are Made For Each Other
PDF
The Evolution of DITAs
PDF
The right side of speed - learning to shift left
PDF
Eat Your Data and Have It Too: Get the Blazing Performance of In-Memory Opera...
Anticipating Lightweight DITA
Introduction to Structured Authoring
Optimizing the DITA Authoring Experience
DITA's New Thang: Going Mapless!
New Directions 2015 – Changes in Content Best Practices
Improve your Chances for Documentation Success with DITA and a CCMS LavaCon L...
DataOps - Lean principles and lean practices
Managing the Complexities of Conversion to S1000D
Sprinting to Success: Why Agile and DITA Work So Well Together
10 Million Dita Topics Can't Be Wrong
Stored Procedure Superpowers: A Developer’s Guide
4D Pubs - Distributed Dynamic Document Dsplay
Is DITA Right for You? - STC Summit 2017
Produce Reliable Content with DITA CMS
Using Markdown and Lightweight DITA in a Collaborative Environment
DITA for Small Teams
DITA and Agile Are Made For Each Other
The Evolution of DITAs
The right side of speed - learning to shift left
Eat Your Data and Have It Too: Get the Blazing Performance of In-Memory Opera...
Ad

Similar to Developing and Implementing a QA Plan During Your Legacy Data to S1000D (20)

PPTX
Content Conversion Done Right Saves More Than Money
PPTX
There's Gold in Them Thar Data
PPTX
Wheeles Webinar Slides - 9-16
PPTX
Creating a Hybrid Approach to Legacy Conversion
PPTX
Converting and Integrating Content When Implementing a New CMS
PPTX
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
PDF
Nonprofit data migration webinar 02.20.2014
PDF
Nonprofit data migration: You can't take it all with you Webinar
PDF
10 Decisions You Will Face With Any Donor Data Migration Project
PPT
Preparing Your Data for ECM
PPT
Grace Currie Ann Jebson First Things First
PPTX
DMM9 - Data Migration Testing
PDF
Ax 2012 R3 Legacy Data Migration
PPTX
Reaping the Rewards of Imaging: Designing & Implementing an Imaging Project
 
PPTX
Electronic Document Management Case Study
PDF
5 Steps To Master Data Management
PPTX
ARMA Denver Implementing An ECM System Final 6_21_2016
PDF
Practice Tips for Successful Discovery Projects
PDF
Effectively Capturing Paper and Digital Documents in your Existing Applicatio...
PPTX
Developing a plan for your imaging project
 
Content Conversion Done Right Saves More Than Money
There's Gold in Them Thar Data
Wheeles Webinar Slides - 9-16
Creating a Hybrid Approach to Legacy Conversion
Converting and Integrating Content When Implementing a New CMS
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
Nonprofit data migration webinar 02.20.2014
Nonprofit data migration: You can't take it all with you Webinar
10 Decisions You Will Face With Any Donor Data Migration Project
Preparing Your Data for ECM
Grace Currie Ann Jebson First Things First
DMM9 - Data Migration Testing
Ax 2012 R3 Legacy Data Migration
Reaping the Rewards of Imaging: Designing & Implementing an Imaging Project
 
Electronic Document Management Case Study
5 Steps To Master Data Management
ARMA Denver Implementing An ECM System Final 6_21_2016
Practice Tips for Successful Discovery Projects
Effectively Capturing Paper and Digital Documents in your Existing Applicatio...
Developing a plan for your imaging project
 
Ad

More from dclsocialmedia (14)

PPTX
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
PPTX
Converting and Transforming Technical Graphics
PPTX
DITA for Small Teams: An Open Source Approach to DITA Content Management
PPTX
Metadata Matters
PPTX
Using HTML5 to Deliver and Monetize Your Mobile Content
PPTX
Precision Content™ Tools, Techniques, and Technology
PPT
When Conversion Makes Sense
PPTX
DITA, EPUB, and HTML5: An Update for 2015
PPTX
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
PPTX
Converting Your Legacy Data to S1000D
PPTX
Marketing and Strategy and Bears... oh my!
PPTX
Finding Role Clarity in UX Chaos
PPTX
Managing Documentation Projects in Nearly Any Environment
PPTX
Coming Up to Speed with XML Authoring in Adobe FrameMaker
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
Converting and Transforming Technical Graphics
DITA for Small Teams: An Open Source Approach to DITA Content Management
Metadata Matters
Using HTML5 to Deliver and Monetize Your Mobile Content
Precision Content™ Tools, Techniques, and Technology
When Conversion Makes Sense
DITA, EPUB, and HTML5: An Update for 2015
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
Converting Your Legacy Data to S1000D
Marketing and Strategy and Bears... oh my!
Finding Role Clarity in UX Chaos
Managing Documentation Projects in Nearly Any Environment
Coming Up to Speed with XML Authoring in Adobe FrameMaker

Recently uploaded (20)

PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Big Data Technologies - Introduction.pptx
PDF
KodekX | Application Modernization Development
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPT
Teaching material agriculture food technology
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Big Data Technologies - Introduction.pptx
KodekX | Application Modernization Development
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
20250228 LYD VKU AI Blended-Learning.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Teaching material agriculture food technology
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Chapter 3 Spatial Domain Image Processing.pdf
Approach and Philosophy of On baking technology
Understanding_Digital_Forensics_Presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Spectral efficient network and resource selection model in 5G networks

Developing and Implementing a QA Plan During Your Legacy Data to S1000D