SlideShare a Scribd company logo
www.dclab.com
Converting Your Legacy Data to S1000D
Naveh Greenberg,
Director, US Defense Development
Data Conversion Laboratory (DCL)
2
Agenda
• A very quick introduction to S1000D conversions
• What the technical headaches are
• Whether to convert or rewrite
• Planning for a good conversion experience
• What the timeline looks like
• Some tools to help
3
The Story with S1000D
• Provides an international solution to standardize on a common
spec across the services
• Was developed from ATA Spec 100 to place greater emphasis on
granularity and interchangeability of information
• Allows for modularization of content with Data Modules (DM),
and for easy re-use in multiple outputs
• Standard and schemas are publicly available
• Applicability is available at a very fine level
• Standard format for external data exchange (Data Dispatch
Notes)
• Process DM to allow interactive processing structures and
Learning DM for training (4.0)
4
What Makes S1000D Conversion Difficult
• S1000D is a conceptual departure from linear information
– and is difficult for many to get used to
• Turns the traditional book into a collection of DMs
• DMC & business rules.
• DMs can be thought of as interchangeable parts
– to be reassembled in multiple ways
– to be repurposed for multiple outputs
– to be reused across multiple products
• …but your documents weren’t likely to have been
designed to do this.
5
Structuring a Book into Data Modules in S1000D
IPD
Wiring
Descriptive
Crew
Fault
Appendix B
Procedural
Para 1-1Early S1000D
Publication
Para 1-2
Para 1-3
Para 1-1
Para 3-1
Para 2-1
PDF Book
Para 1-2
38784 Book
Para 2-1
Para 2-2
Appendix A
Para 3-2
Appendix A
Appendix B
S1000D Common
Source Database
Publication 1
Publication 2
Subtask
Task
Subtask
ATA Book
Pageblock
Pageblock
Pageblock
Pageblock
Pageblock
Task
Maintenance
Process
Descriptive DM
Procedural DM
IPD DM
Wiring DM
Crew DM
Process DM
Maintenance DM
Fault DM
IPD
Wiring
Descriptive
Crew
Fault
Procedural
Maintenance
Process
Process
Wiring
Procedural
Descriptive
Fault
Crew
Process
Publication 3
6
Further Complications in S1000D Conversion
• There’s the usual conversion issues
– Accuracy of the transferred text
– Tables
– Math
– Special Characters
• There’s also the structuring issues
– Identifying DMs
– Identifying reusable content
– Identifying Applicability
• And the people issues
– Deciding what needs re-authoring
– Getting used to a new “document” paradigm
– Getting rugged individualists to collaborate more
7
Some Examples of Converted Data
≠BikeSample
YourLegacyData
8
Square Peg 1: Procedure Authored as a Table
Issue:
Tasks are done as tables rather than numbered lists.
Automated conversion keeps the tables as tables, and steps
are not tagged as steps. This is a big problem for Information
Mapping users (standard layout for steps).
Step Action
1 Check the phase sequence in the mains with
the phase sequence indicator.
2 Check the direction of rotation of impeller. If a
dry installation is made, check the direction of
rotation through the inlet elbow access cover.
Note: If done consistently, tables can be deconstructed programmatically.
9
Square Peg 2: Multiple Steps in a Single Procedure
Replacing an XYZ Module
Use this procedure to replace an XYZ module
Remove XYZ Module
1. Loosen the screws.
2. Disengage the ejectors
3. Pull the module straight out
Insert Replacement XYZ Module
1. Align the module.
2. Insert the module, pressing in firmly
3. Engage the ejectors
10
Square Peg 3: Introductory Material That Applies to
Numerous DMs
Example:
1-1 Introduction
The following paragraph discuss servicing…
Personnel Recommended:
One 1-2
Two 1-3
WARNING
Make sure XYZ module is not connected…..
1-2 Remove XYZ Module
1. Loosen the screws.
2. Disengage the ejectors
1-3 Insert Replacement XYZ Module
1. Align the module.
2. Insert the module, pressing in firmly
[[start]]
[[stop]]
11
Square Peg 4: Irrelevant Cross-References
Conversion to S1000D may make some source cross-references irrelevant. For
example, assuming some chapter headings are dropped, a reference to a
chapter is no longer valid.
See Chapter 1, Introduction on page 2
Would be tagged as:
See <dmRef><dmRefIdent><dmCode modelIdentCode=“.." systemDiffCode="A” . . .
infoCodeVariant="A“ itemLocationCode="D"/></dmRefIdent></dmRef>
NOTE: Hard-keyed page numbers are typically dropped from the cross-
reference string since they are no longer relevant in S1000D. A project specific
business rule will need to me made as to how to handle these links.
12
Square Peg 5: Writer Creativity in Source Material
<para>1. Clean the Engine.</para>
<step1><para>Clean the Engine.</para></step1>
<seqlist><item>Clean the Engine.</item></seqlist>
<entry>1.</entry><entry>Clean the Engine. </entry>
13
Square Peg 5: Writer Creativity in Source Material
……………..
14
So… Maybe You Shouldn’t Bother Converting Your Content?
• It seems like such a pain.
• Too many rules.
• There is always a need for some rewriting - few writers
have the clairvoyance to author content with the intent
that be converted in the future – might as well rewrite it
all.
• My writers aren’t very busy right now anyway.
• It’s more fun and seems like less trouble to author anew.
15
Think Again….
• If you rewrite you still need to fulfil the standard
requirements.
• Still need some XML knowledge.
• Can take a very long time. For example a 5000 page project:
• 15 minutes per page to rewrite.
• 75,000 minutes => 1250 hours
• At 7 hours per day (no breaks) => 179 days
• But worse ...
• Inconsistencies.
• Writer creativity.
• IETM/CMS may not properly function
16
In Reality… Converting Your Content is Worth the Bother
• Throwing it out and starting over is an expensive option
– Rewriting at $25-$45+/page vs. converting to S1000D at $6-$12/page
– The hidden costs of redoing index entries, links and other features
you’ve built in
– The hidden cost of reviewing and recertifying it all
• It’s usually easier to use what you have as a base, and convert over
– Needs planning
– Needs time
• Planning for a good conversion experience
– Which content will you need?
– Which content is worth converting?
– Which content is suitable for re-use in multiple places?
– What tools are available?
– How to specify the conversion to get it right?
– When do you start all this planning?
17
Most Importantly – Plan!!!
• Ask the important initial questions
• Who are the stakeholders. Who is the final client/user?
• What is the estimated volume and deadline?
• What version of S1000D?
• Do we know what CMS or rendering tools will be used?
• Source format. Not all source data are created equal.
• Budget?
• Ask around or join discussion groups.
• Prepare for the next step
• Get your hands on the source data, business rules, and schemas.
• Begin looking for the right people. You don’t need to be a S1000D savvy but
you do at a minimum understand the concept.
“If I had eight hours to chop down a tree, I'd spend six sharpening my ax.”
18
What to Convert, and in What Order
• Categorizing
– Active documents in good shape
– Active documents that need a lot of work
– Somewhat inactive document that will likely be retired
– Archival materials
• Prioritizing
– Documents that are most used
– Documents that are customer favorites
– Documents with longest product life
– Start with most recent documents and go back
• Identifying the process
– Can be converted as is
– Can be converted with some work
– Needs to be rewritten
– Don’t convert – just keep archival copies
19
The Classic Conversion Process
• Conversion of legacy data is usually an afterthought – a lot of
scampering after the system is up and running
• 3 years has been typical for DITA implementations, with another 1-2
years to really get it going; S1000D is longer
Pilot Select CSDB Implementation
Post-Conversion
Cleanup
Live
Training
Start
Conversion
20
The Classic Conversion Process
Doing Cleanup and Implementation in Parallel Tracks:
Pilot Select CSDB Implementation Post-Conversion
Cleanup
Live
Initial
Training Pre-Conversion Cleanup
Refresher
Training
Start
Pilot Select CSDB Implementation
Post-Conversion
Cleanup
Live
TrainingConversion
Conversion
21
Some Tools for a Good Conversion Experience
• Conversion specifications – a formal process to specify the
conversion so that all stakeholders can agree on a proper process
• DMRL (Data Module Requirements List) - to verify that Data
Modules have been properly identified
• Content reuse identification – software to located potentially
reusable content
• Best practice scanners – to scan code to verify best practices have
been followed
• Test conversions into S1000D – to make sure the process works all
the way through
22
The Conversion Specification
23
Data Module Requirement List (DMRL)
Para No Original Title DMC Type Modelic
FOREWORD SAMPLESYSTEM-A-00-00-00-00A-018A-D Desc SAMPLESYSTEM
1 INTRODUCTION. -------
2 MANUAL ORGANIZATION. -------
CHAPTER 3 PREPARATION FOR USE AND RESHIPMENT NA
SECTION I PREPARATION FOR USE SAMPLESYSTEM-A-00-00-00-00A-006A-A Desc SAMPLESYSTEM
3.1 INTRODUCTION. -------
3.1.1
Nomenclature Setup and Initialization
Instructions.
SAMPLESYSTEM-A-00-00-00-00A-007A-A proc SAMPLESYSTEM
SECTION II PREPARATION FOR RESHIPMENT SAMPLESYSTEM-A-00-00-00-00A-530A-A Desc SAMPLESYSTEM
3.2 INTRODUCTION. -------
3.3 CONDITIONS AND METHODS OF RESHIPMENT. -------
CHAPTER 8 ILLUSTRATED PARTS BREAKDOWN SAMPLESYSTEM-A-00-00-00-00A-018B-D Desc SAMPLESYSTEM
SECTION I INTRODUCTION NA
8.1 INTRODUCTION. -------
8.2 MAINTENANCE PARTS LIST. -------
SECTION II MAINTENANCE PARTS LIST SAMPLESYSTEM-A-00-00-00-00A-075A-D IPB SAMPLESYSTEM
SECTION III NUMERICAL INDEX N/A
SECTION IV REFERENCE DESIGNATOR INDEX N/A
GLOSSARY SAMPLESYSTEM-A-00-00-00-00A-006A-D Desc SAMPLESYSTEM
24
Divide and Conquer for Effective Content Reuse
• Content reuse frequently showed as a reason for moving to S1000D
• However few had yet implemented; it was just too daunting a task
• Divide and conquer the identification of reusable content
• Use tools that help to identify reusable content
• Create an iterative process to manage content reuse
25
Content Reuse Analysis Reports
• Finding exact or similar text will help you when mapping to Data Modules
• It will also help to detect applicability and inconsistencies
26
Summary
• Conversion of legacy data to S1000D is not trivial
– Significantly faster, safer and less expensive that rewriting
– There are special S1000D consideration to be taken into account
– Needs planning
• Starting the conversion process earlier makes life easier
– Shifts the critical path; speeds the process
– More of the work can be done by the content owners
– Eases the training and change acceptance burdens
• Content re-use also needs planning
– Starting early allows time for a more through review
– Setting up collaborative teams sets the tone and allows one to
“divide and conquer”
27
Q&A
Naveh Greenberg
Director, US Defense Development
Data Conversion Laboratory (DCL)
(718) 307-5758
ngreenberg@dclab.com
@dclaboratory

More Related Content

PPTX
Optimize S1000D & ATA Technical Illustration production
PPTX
ADG S1000D Series - S1000D Information Sets & Publications
PDF
S1000D Illustrations White Paper
PDF
تكشيف المواد السمعية البصرية
PPTX
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
PPTX
eBooks Platforms, Standards and Use
PPTX
Creating a Hybrid Approach to Legacy Conversion
PPT
ASME's Digital Path Initiative: Don't Make a Molehill Out of a Mountain!
Optimize S1000D & ATA Technical Illustration production
ADG S1000D Series - S1000D Information Sets & Publications
S1000D Illustrations White Paper
تكشيف المواد السمعية البصرية
Automating Complex High-Volume Technical Paper and Journal Article Page Compo...
eBooks Platforms, Standards and Use
Creating a Hybrid Approach to Legacy Conversion
ASME's Digital Path Initiative: Don't Make a Molehill Out of a Mountain!

Viewers also liked (18)

PPTX
Managing the Complexities of Conversion to S1000D
PPTX
There's Gold in Them There Archives!: Printing Industries of America
PPTX
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
PPT
When Conversion Makes Sense Following the Trends: Is your content ready?
PPTX
Best Practices for Mobile UX
PPTX
The Freedom to Grow: How Standards in Communication Facilitate Our Industry, ...
PPTX
Product Instructions: The Missing Piece of the Customer Experience
PPTX
Re-branding Content During a Migration with Marli Mesibov: Step 2--Finding Yo...
PPTX
DITA and Information Architecture for Responsive Web Design
PPTX
Monetizing and Marketing Digital Textbooks
PPTX
Training: A Key Component of the Global Information Experience
PPTX
Making Beautiful Books
PPTX
DITA 1.3: What's New and Different
PPTX
Marketing, Monetizing, and Mobilizing eBooks: How Content Creators Can Maximi...
PDF
Back to Basics: Getting the Content Essentials Right
PPTX
Marketing & Publicity For Independent Authors: Get More Buzz For Your Book
PPTX
eBooks for Education: Using Digital in the K12 Classroom
PPTX
Advantages of DITA for the Life Sciences
Managing the Complexities of Conversion to S1000D
There's Gold in Them There Archives!: Printing Industries of America
Best Practices: Cutting Through the Confusion & Avoiding the Pitfalls of Crea...
When Conversion Makes Sense Following the Trends: Is your content ready?
Best Practices for Mobile UX
The Freedom to Grow: How Standards in Communication Facilitate Our Industry, ...
Product Instructions: The Missing Piece of the Customer Experience
Re-branding Content During a Migration with Marli Mesibov: Step 2--Finding Yo...
DITA and Information Architecture for Responsive Web Design
Monetizing and Marketing Digital Textbooks
Training: A Key Component of the Global Information Experience
Making Beautiful Books
DITA 1.3: What's New and Different
Marketing, Monetizing, and Mobilizing eBooks: How Content Creators Can Maximi...
Back to Basics: Getting the Content Essentials Right
Marketing & Publicity For Independent Authors: Get More Buzz For Your Book
eBooks for Education: Using Digital in the K12 Classroom
Advantages of DITA for the Life Sciences
Ad

Similar to Converting Your Legacy Data to S1000D (20)

PPTX
Managing the Complexities of Conversion to S1000D
PPTX
Preparing Your Legacy Data for Automation in S1000D
PDF
LegacyDataConversionToS1000D
PDF
Implementing_S1000D_Best_Business_Practices_Means_Understanding_Your
PDF
Using S1000D and SCORM to Integrate Documentation and Training
PPT
Introduction to S1000D
PDF
What_it_Takes_to_Model_a_Wiring_Solution
PPS
SDL-S1000D Training
PDF
Using Checker Software for Clear, Concise and Consistent Content | Berry Braster
PPTX
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
PPSX
It\'s all about the content!
PPT
S1000D_Training Issue 4.0.ppt
PDF
Leverage alter table conversions methodology to alter your upgrade downtime
PPTX
Converting and Integrating Content When Implementing a New CMS
PPTX
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
PPTX
What Is Your PLM Challenge - Manage configurable products and maintain produc...
PPT
CAD MBD &amp; 3D Technical Documentation
PDF
Simulation data management for the Interdisciplinary Development of Drive Sys...
PDF
S1000D: Defined, Explained, and Explored
PPTX
Modal Driven Architecture In Software Engineering By NADEEM AHMED
Managing the Complexities of Conversion to S1000D
Preparing Your Legacy Data for Automation in S1000D
LegacyDataConversionToS1000D
Implementing_S1000D_Best_Business_Practices_Means_Understanding_Your
Using S1000D and SCORM to Integrate Documentation and Training
Introduction to S1000D
What_it_Takes_to_Model_a_Wiring_Solution
SDL-S1000D Training
Using Checker Software for Clear, Concise and Consistent Content | Berry Braster
Converting and Integrating Legacy Data and Documents When Implementing a New CMS
It\'s all about the content!
S1000D_Training Issue 4.0.ppt
Leverage alter table conversions methodology to alter your upgrade downtime
Converting and Integrating Content When Implementing a New CMS
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
What Is Your PLM Challenge - Manage configurable products and maintain produc...
CAD MBD &amp; 3D Technical Documentation
Simulation data management for the Interdisciplinary Development of Drive Sys...
S1000D: Defined, Explained, and Explored
Modal Driven Architecture In Software Engineering By NADEEM AHMED
Ad

More from dclsocialmedia (20)

PPTX
Content Development: Measuring the Trends
PPTX
What are the Strengths and Weaknesses of DITA Adoption?
PPTX
DITA's New Thang: Going Mapless!
PPTX
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
PPTX
Converting and Transforming Technical Graphics
PPT
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
PPTX
Anticipating Lightweight DITA
PPTX
Content Engineering and The Internet of “Smart” Things
PPTX
DITA for Small Teams: An Open Source Approach to DITA Content Management
PPTX
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
PPTX
Data-Driven User Experience
PPTX
Introduction to Structured Authoring
PPTX
Metadata Matters
PPTX
Using HTML5 to Deliver and Monetize Your Mobile Content
PPTX
There's Gold in Them Thar Data
PPTX
Content Conversion Done Right Saves More Than Money
PPTX
Precision Content™ Tools, Techniques, and Technology
PPT
When Conversion Makes Sense
PPTX
DITA, EPUB, and HTML5: An Update for 2015
PPTX
Optimizing the DITA Authoring Experience
Content Development: Measuring the Trends
What are the Strengths and Weaknesses of DITA Adoption?
DITA's New Thang: Going Mapless!
Minimalism Revisited — Let’s Stop Developing Content that No One Wants
Converting and Transforming Technical Graphics
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Anticipating Lightweight DITA
Content Engineering and The Internet of “Smart” Things
DITA for Small Teams: An Open Source Approach to DITA Content Management
Managing Deliverable-Specific Link Anchors: New Suggested Best Practice for Keys
Data-Driven User Experience
Introduction to Structured Authoring
Metadata Matters
Using HTML5 to Deliver and Monetize Your Mobile Content
There's Gold in Them Thar Data
Content Conversion Done Right Saves More Than Money
Precision Content™ Tools, Techniques, and Technology
When Conversion Makes Sense
DITA, EPUB, and HTML5: An Update for 2015
Optimizing the DITA Authoring Experience

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Encapsulation theory and applications.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Unlocking AI with Model Context Protocol (MCP)
Advanced methodologies resolving dimensionality complications for autism neur...
The Rise and Fall of 3GPP – Time for a Sabbatical?
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf
Network Security Unit 5.pdf for BCA BBA.
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Weekly Chronicles - August'25 Week I
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Understanding_Digital_Forensics_Presentation.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Encapsulation theory and applications.pdf
Machine learning based COVID-19 study performance prediction
“AI and Expert System Decision Support & Business Intelligence Systems”
The AUB Centre for AI in Media Proposal.docx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Unlocking AI with Model Context Protocol (MCP)

Converting Your Legacy Data to S1000D

  • 1. www.dclab.com Converting Your Legacy Data to S1000D Naveh Greenberg, Director, US Defense Development Data Conversion Laboratory (DCL)
  • 2. 2 Agenda • A very quick introduction to S1000D conversions • What the technical headaches are • Whether to convert or rewrite • Planning for a good conversion experience • What the timeline looks like • Some tools to help
  • 3. 3 The Story with S1000D • Provides an international solution to standardize on a common spec across the services • Was developed from ATA Spec 100 to place greater emphasis on granularity and interchangeability of information • Allows for modularization of content with Data Modules (DM), and for easy re-use in multiple outputs • Standard and schemas are publicly available • Applicability is available at a very fine level • Standard format for external data exchange (Data Dispatch Notes) • Process DM to allow interactive processing structures and Learning DM for training (4.0)
  • 4. 4 What Makes S1000D Conversion Difficult • S1000D is a conceptual departure from linear information – and is difficult for many to get used to • Turns the traditional book into a collection of DMs • DMC & business rules. • DMs can be thought of as interchangeable parts – to be reassembled in multiple ways – to be repurposed for multiple outputs – to be reused across multiple products • …but your documents weren’t likely to have been designed to do this.
  • 5. 5 Structuring a Book into Data Modules in S1000D IPD Wiring Descriptive Crew Fault Appendix B Procedural Para 1-1Early S1000D Publication Para 1-2 Para 1-3 Para 1-1 Para 3-1 Para 2-1 PDF Book Para 1-2 38784 Book Para 2-1 Para 2-2 Appendix A Para 3-2 Appendix A Appendix B S1000D Common Source Database Publication 1 Publication 2 Subtask Task Subtask ATA Book Pageblock Pageblock Pageblock Pageblock Pageblock Task Maintenance Process Descriptive DM Procedural DM IPD DM Wiring DM Crew DM Process DM Maintenance DM Fault DM IPD Wiring Descriptive Crew Fault Procedural Maintenance Process Process Wiring Procedural Descriptive Fault Crew Process Publication 3
  • 6. 6 Further Complications in S1000D Conversion • There’s the usual conversion issues – Accuracy of the transferred text – Tables – Math – Special Characters • There’s also the structuring issues – Identifying DMs – Identifying reusable content – Identifying Applicability • And the people issues – Deciding what needs re-authoring – Getting used to a new “document” paradigm – Getting rugged individualists to collaborate more
  • 7. 7 Some Examples of Converted Data ≠BikeSample YourLegacyData
  • 8. 8 Square Peg 1: Procedure Authored as a Table Issue: Tasks are done as tables rather than numbered lists. Automated conversion keeps the tables as tables, and steps are not tagged as steps. This is a big problem for Information Mapping users (standard layout for steps). Step Action 1 Check the phase sequence in the mains with the phase sequence indicator. 2 Check the direction of rotation of impeller. If a dry installation is made, check the direction of rotation through the inlet elbow access cover. Note: If done consistently, tables can be deconstructed programmatically.
  • 9. 9 Square Peg 2: Multiple Steps in a Single Procedure Replacing an XYZ Module Use this procedure to replace an XYZ module Remove XYZ Module 1. Loosen the screws. 2. Disengage the ejectors 3. Pull the module straight out Insert Replacement XYZ Module 1. Align the module. 2. Insert the module, pressing in firmly 3. Engage the ejectors
  • 10. 10 Square Peg 3: Introductory Material That Applies to Numerous DMs Example: 1-1 Introduction The following paragraph discuss servicing… Personnel Recommended: One 1-2 Two 1-3 WARNING Make sure XYZ module is not connected….. 1-2 Remove XYZ Module 1. Loosen the screws. 2. Disengage the ejectors 1-3 Insert Replacement XYZ Module 1. Align the module. 2. Insert the module, pressing in firmly [[start]] [[stop]]
  • 11. 11 Square Peg 4: Irrelevant Cross-References Conversion to S1000D may make some source cross-references irrelevant. For example, assuming some chapter headings are dropped, a reference to a chapter is no longer valid. See Chapter 1, Introduction on page 2 Would be tagged as: See <dmRef><dmRefIdent><dmCode modelIdentCode=“.." systemDiffCode="A” . . . infoCodeVariant="A“ itemLocationCode="D"/></dmRefIdent></dmRef> NOTE: Hard-keyed page numbers are typically dropped from the cross- reference string since they are no longer relevant in S1000D. A project specific business rule will need to me made as to how to handle these links.
  • 12. 12 Square Peg 5: Writer Creativity in Source Material <para>1. Clean the Engine.</para> <step1><para>Clean the Engine.</para></step1> <seqlist><item>Clean the Engine.</item></seqlist> <entry>1.</entry><entry>Clean the Engine. </entry>
  • 13. 13 Square Peg 5: Writer Creativity in Source Material ……………..
  • 14. 14 So… Maybe You Shouldn’t Bother Converting Your Content? • It seems like such a pain. • Too many rules. • There is always a need for some rewriting - few writers have the clairvoyance to author content with the intent that be converted in the future – might as well rewrite it all. • My writers aren’t very busy right now anyway. • It’s more fun and seems like less trouble to author anew.
  • 15. 15 Think Again…. • If you rewrite you still need to fulfil the standard requirements. • Still need some XML knowledge. • Can take a very long time. For example a 5000 page project: • 15 minutes per page to rewrite. • 75,000 minutes => 1250 hours • At 7 hours per day (no breaks) => 179 days • But worse ... • Inconsistencies. • Writer creativity. • IETM/CMS may not properly function
  • 16. 16 In Reality… Converting Your Content is Worth the Bother • Throwing it out and starting over is an expensive option – Rewriting at $25-$45+/page vs. converting to S1000D at $6-$12/page – The hidden costs of redoing index entries, links and other features you’ve built in – The hidden cost of reviewing and recertifying it all • It’s usually easier to use what you have as a base, and convert over – Needs planning – Needs time • Planning for a good conversion experience – Which content will you need? – Which content is worth converting? – Which content is suitable for re-use in multiple places? – What tools are available? – How to specify the conversion to get it right? – When do you start all this planning?
  • 17. 17 Most Importantly – Plan!!! • Ask the important initial questions • Who are the stakeholders. Who is the final client/user? • What is the estimated volume and deadline? • What version of S1000D? • Do we know what CMS or rendering tools will be used? • Source format. Not all source data are created equal. • Budget? • Ask around or join discussion groups. • Prepare for the next step • Get your hands on the source data, business rules, and schemas. • Begin looking for the right people. You don’t need to be a S1000D savvy but you do at a minimum understand the concept. “If I had eight hours to chop down a tree, I'd spend six sharpening my ax.”
  • 18. 18 What to Convert, and in What Order • Categorizing – Active documents in good shape – Active documents that need a lot of work – Somewhat inactive document that will likely be retired – Archival materials • Prioritizing – Documents that are most used – Documents that are customer favorites – Documents with longest product life – Start with most recent documents and go back • Identifying the process – Can be converted as is – Can be converted with some work – Needs to be rewritten – Don’t convert – just keep archival copies
  • 19. 19 The Classic Conversion Process • Conversion of legacy data is usually an afterthought – a lot of scampering after the system is up and running • 3 years has been typical for DITA implementations, with another 1-2 years to really get it going; S1000D is longer Pilot Select CSDB Implementation Post-Conversion Cleanup Live Training Start Conversion
  • 20. 20 The Classic Conversion Process Doing Cleanup and Implementation in Parallel Tracks: Pilot Select CSDB Implementation Post-Conversion Cleanup Live Initial Training Pre-Conversion Cleanup Refresher Training Start Pilot Select CSDB Implementation Post-Conversion Cleanup Live TrainingConversion Conversion
  • 21. 21 Some Tools for a Good Conversion Experience • Conversion specifications – a formal process to specify the conversion so that all stakeholders can agree on a proper process • DMRL (Data Module Requirements List) - to verify that Data Modules have been properly identified • Content reuse identification – software to located potentially reusable content • Best practice scanners – to scan code to verify best practices have been followed • Test conversions into S1000D – to make sure the process works all the way through
  • 23. 23 Data Module Requirement List (DMRL) Para No Original Title DMC Type Modelic FOREWORD SAMPLESYSTEM-A-00-00-00-00A-018A-D Desc SAMPLESYSTEM 1 INTRODUCTION. ------- 2 MANUAL ORGANIZATION. ------- CHAPTER 3 PREPARATION FOR USE AND RESHIPMENT NA SECTION I PREPARATION FOR USE SAMPLESYSTEM-A-00-00-00-00A-006A-A Desc SAMPLESYSTEM 3.1 INTRODUCTION. ------- 3.1.1 Nomenclature Setup and Initialization Instructions. SAMPLESYSTEM-A-00-00-00-00A-007A-A proc SAMPLESYSTEM SECTION II PREPARATION FOR RESHIPMENT SAMPLESYSTEM-A-00-00-00-00A-530A-A Desc SAMPLESYSTEM 3.2 INTRODUCTION. ------- 3.3 CONDITIONS AND METHODS OF RESHIPMENT. ------- CHAPTER 8 ILLUSTRATED PARTS BREAKDOWN SAMPLESYSTEM-A-00-00-00-00A-018B-D Desc SAMPLESYSTEM SECTION I INTRODUCTION NA 8.1 INTRODUCTION. ------- 8.2 MAINTENANCE PARTS LIST. ------- SECTION II MAINTENANCE PARTS LIST SAMPLESYSTEM-A-00-00-00-00A-075A-D IPB SAMPLESYSTEM SECTION III NUMERICAL INDEX N/A SECTION IV REFERENCE DESIGNATOR INDEX N/A GLOSSARY SAMPLESYSTEM-A-00-00-00-00A-006A-D Desc SAMPLESYSTEM
  • 24. 24 Divide and Conquer for Effective Content Reuse • Content reuse frequently showed as a reason for moving to S1000D • However few had yet implemented; it was just too daunting a task • Divide and conquer the identification of reusable content • Use tools that help to identify reusable content • Create an iterative process to manage content reuse
  • 25. 25 Content Reuse Analysis Reports • Finding exact or similar text will help you when mapping to Data Modules • It will also help to detect applicability and inconsistencies
  • 26. 26 Summary • Conversion of legacy data to S1000D is not trivial – Significantly faster, safer and less expensive that rewriting – There are special S1000D consideration to be taken into account – Needs planning • Starting the conversion process earlier makes life easier – Shifts the critical path; speeds the process – More of the work can be done by the content owners – Eases the training and change acceptance burdens • Content re-use also needs planning – Starting early allows time for a more through review – Setting up collaborative teams sets the tone and allows one to “divide and conquer”
  • 27. 27 Q&A Naveh Greenberg Director, US Defense Development Data Conversion Laboratory (DCL) (718) 307-5758 ngreenberg@dclab.com @dclaboratory