SlideShare a Scribd company logo
The Science of Information




Digital Reformatting
Transforming physical content for
preservation, access and reuse


Infogrid Pacific Pte. Ltd.

           © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    1
The Science of Information

Production Objectives
–Digitize   once, use forever
–Automate     long term ownership of valuable content
–Control    off-line and on-line access
–Standards     driven not proprietary
–Contribute     to current and future information strategies




              © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    2
The Science of Information

 The Production Steps
          Input             Establish                                         Physical                                   Define
   Information                Client                                          Material                                  Required
      Client                Objectives                                        Analysis                                  Metadata
    Objectives

 Planning and               Create                                           Define                                      Define
       Testing            Production                                       Production                                    Quality
     Client              Specifications                                    Processes                                     Control
  Objectives


Content Output                                                               Enforce                                     Create
                             Execute
   Production               Production
                                                                             Quality                                     Delivery
                                                                             Control                                    Packages


              © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners               3
The Science of Information

Metadata
                                                    OAI                                               METS (LOC)
                                                                                                      Metadata for Encoding and
  Dublin Core (ISO)                                 Open Access Initiative                            Transmission Standard
  Descriptive Properties
  Title:                                            Search Engine Reg.                                Technical:
  Author:                                            -Google                                          DigiProv:
  Publisher:                                         -Yahoo!                                          Rights:
  Rights:                                            -etc.                                            Source:
  Language:                                         Interchange Protocol                              FileSpec:
                                                                                                      StructMap:
  Etc.


                                                                                                                                 MIX (LOC)
                                                                                                                                 Metadata for Images in XML

                                                                                                                                 Format:
                                                                                                                                 Size:
                                                                                                                                 Bitdepth:
                                                                                                                                 Checksum:
                                                                                                                                 Scanning:


                                Custom                                             TextML (LOC)
                                Examples...                                        Text markup Language

                                Age Level:                                         Encoding:
                                Subject:                                           XML-Lang:
                                Course:                                            Chars:
                                Year:                                              Words:
                                Restrictions:                                      Para:
                                Document Genre:

                       © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners                                4
The Science of Information

XHTML - The Best Encoding
XHTML delivers it all
–   Handle content complexity
–   Support metadata
–   Simple in production, flexible in use
–   Output Online ready
–   Widest support
–   Lowest cost of ownership



             © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    5
The Science of Information




The Production Deliverables




       © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    6
Production Output                                                                                  The Science of Information

From this.................                                To this...............................
                                                                               Master Scan Image                               Master Content
                                                                               Set (One per page)                              XHTML and hi-res
                                                                                                                               extracted images




                                                        Print Res Cover Set (Front, Back, Spine)
                                                                                                                                Online Optimized
                                                                                                                                XHTML plus images
                                                                                                                                processed for online




                                                                                Print on Demand                                 Metadata – Descriptive
                                                                                ready PDF (All pages                            provenance and QC
                                                                                in one PDF)




                                                                                                              METS Package for digital preservation
                                                                               Online optimized
                                                                               page image set (one
                                                                               per page)

                 © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners                        7
What does this give?                                                                                          The Science of Information


                 Master Scan Image                                 Master Content
                 Set (One per page)                                XHTML and hi-res
                                                                   extracted images




Print Res Cover Set (Front, Back, Spine)
                                                                    Online Optimized
                                                                    XHTML plus images
                                                                    processed for online




                  Print on Demand                                   Metadata – Descriptive
                  ready PDF (All pages                              provenance and QC
                  in one PDF)




                                           METS Package for digital preservation
                 Online optimized
                 page image set (one
                 per page)

                                © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    8
Online Publishing Set                                                                                          The Science of Information


                 Master Scan Image
                 Set (One per page)
                                                                 Master Content
                                                                 XHTML and hi-res
                                                                                                •      Online ready paginated or
                                                                 extracted images
                                                                                                       continuous document view
                                                                                                •
                                                                                                       With IGP:InfoViewer
Print Res Cover Set (Front, Back, Spine)
                                                                  Online Optimized
                                                                                                              Instantly create online
                                                                  XHTML plus images
                                                                  processed for online
                                                                                                               catalogue and page views
                                                                                                              Metadata search and full
                                                                                                               text search
                  Print on Demand                                 Metadata – Descriptive
                                                                                                              Customize and standardize
                  ready PDF (All pages
                  in one PDF)
                                                                  provenance and QC
                                                                                                               the look and feel of
                                                                                                               documents


                                         METS Package for digital preservation
                Online optimized
                page image set (one
                per page)

                                 © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    9
Print on Demand Set                                                                                            The Science of Information


                 Master Scan Image
                 Set (One per page)
                                                                 Master Content
                                                                 XHTML and hi-res
                                                                                                   •     Create new print copies as
                                                                 extracted images
                                                                                                         and when required
                                                                                                   •
                                                                                                         Bitonal, grayscale and
Print Res Cover Set (Front, Back, Spine)                                                                 colour pages all handled in
                                                                  Online Optimized
                                                                  XHTML plus images                      the package
                                                                  processed for online
                                                                                                   •
                                                                                                         Reprinting, trim and
                                                                                                         imposition information
                                                                                                         included
                  Print on Demand                                 Metadata – Descriptive
                  ready PDF (All pages                            provenance and QC
                  in one PDF)




                                         METS Package for digital preservation
                 Online optimized
                 page image set (one
                 per page)

                                 © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    10
Variable Publishing                                                                                             The Science of Information

    & Reuse
                  Master Scan Image
                  Set (One per page)
                                                                      Master Content
                                                                      XHTML and hi-res
                                                                      extracted images
                                                                                                         Make sure your content
                                                                                                         is ready for new
                                                                                                         information strategies
Print Res Cover Set (Front, Back, Spine)
                                                                                                         when required
                                                                       Online Optimized
                                                                       XHTML plus images
                                                                       processed for online              Variable publishing –
                                                                                                         use selected portions
                                                                                                         Revisions and updates
                  Print on Demand
                  ready PDF (All pages
                  in one PDF)
                                                                       Metadata – Descriptive
                                                                       provenance and QC                 – keep track of source
                                                                                                         and versions
                                                                                                         Partial and split
                      Online
                                               METS Package for digital preservation                     document distribution
                      optimized
                      page image
                      set (one per
                      page)      © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners     11
Digital Preservation                                                                                          The Science of Information


                Master Scan Image
                Set (One per page)
                                                               Master Content
                                                               XHTML and hi-res
                                                                                              •      Create output for interntl.
                                                               extracted images
                                                                                                     archive standards
                                                                                              •
                                                                                                     Ensure the survivability of
                                                                                                     the data
Print Res Cover Set (Front, Back, Spine)
                                                                Online Optimized
                                                                XHTML plus images
                                                                processed for online
                                                                                              •
                                                                                                     With IGP:Repository
                                                                                                            Seamless and auto data
                                                                                                             ingestion from IGP:Content
                Print on Demand                                 Metadata – Descriptive
                                                                                                             Workbench
                ready PDF (All pages                            provenance and QC
                in one PDF)                                                                                 Easy data ingestion from
                                                                                                             other sources
                                                                                                            Integrated distribution with
                                       METS Package for digital preservation                                 IGP:InfoViewer
                Online optimized
                page image set (one
                per page)

                               © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    12
The Science of Information



And now for something special




       © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    13
Special Processes                                                                                   The Science of Information
From this.................                                   To this...............................
                                                                                                                               Substitute online/
                                                                                                         Master Archive
Oversize Pages                                                                                           Image (per special
                                                                                                                               book alias page
                                                                                                         process page)




                                                                                                                                            Online PDF
                                                                                                                                            (per special
                                                                                                                                            process
                                                            Print Ready PDF                                                                 page)
                                                            (per special process
                                                            page)

 Foldouts                                                  Master Archive Image (per special process page)




                                                           Print Ready PDF (per special process page)




                                                           Online PDF (per special process page)
                                                                                                                              Substitute online/
                                                                                                                              book alias page



                    © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners                         14
Special Processes                                                                                    The Science of Information
From this.................                                    To this...............................

Overlays                                           The original image                                                    Combined for a new digital
                                                                                                                         representation




                                                                                                                          And all images are available
                                                                                                                          for independent re-use
                                                                          + The original overlay


Illustrated, Text and Art Books




                                                                           Camera Scan                                            High resolution images
                                                                           archive image                                          extracted for further use
                                                                           and cleaned up                                         as independent media
                                                                           presentation                                           items.
                                                                           image                                                                         15
                     © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners
Special Processes                                                                                   The Science of Information
From this.................                                   To this...............................

                                          The preservation archive camera Image
Manuscript Documents
                                                                                                                              Online and search
                                                                                                                              access thumbnail




                                                                  Metadata – Descriptive
                                                                  provenance and QC

                                                                                                 High resolution research
                                                                                                 image




                                         METS Package for digital preservation




                    © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners                   16
The Science of Information

Summary
 •
     Multiple digitization strategies
 •   IGP:Production Solutions does it all
 •   Extract more value from the
     digitization dollar
 •   Seamlessly integrate with:
     –   controlled online distribution (IGP:InfoViewer 2)
     –   standardized digital preservation (IGP:Repository 2)

               © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    17
The Science of Information



Thank you

For more information visit:
www.infogridpacific.com
or contact us at:
sales@infogridpacific.com


         © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners    18

More Related Content

KEY
Advanced modelling made simple with the Gmodel metalanguage
PDF
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
PPTX
CMIS and Interoperability - AIIM 2009
PPTX
EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?
PDF
B vb script11
PDF
ACE Logo
PDF
Pal gov.tutorial2.session0.outline
PDF
"Ontology-centric navigation of the scientific literature"
Advanced modelling made simple with the Gmodel metalanguage
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
CMIS and Interoperability - AIIM 2009
EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?
B vb script11
ACE Logo
Pal gov.tutorial2.session0.outline
"Ontology-centric navigation of the scientific literature"

Viewers also liked (6)

PDF
Azardi:Content Fulfilment Introduction
PDF
The Requirement for ECMS
PPTX
Information storage and retrieval
PPT
Database management system presentation
PDF
Study: The Future of VR, AR and Self-Driving Cars
PDF
Hype vs. Reality: The AI Explainer
Azardi:Content Fulfilment Introduction
The Requirement for ECMS
Information storage and retrieval
Database management system presentation
Study: The Future of VR, AR and Self-Driving Cars
Hype vs. Reality: The AI Explainer
Ad

Similar to IGP Production Systems For Digital Archives (20)

PDF
Cebit-2008: Content Aggregation
PDF
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
PPSX
SeCold - A Linked Data Platform for Mining Software Repositories
PPT
Computing for Human Experience and Wellness
PPTX
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
PDF
Technologies For Appraising and Managing Electronic Records
PDF
PBCore: Overview
PDF
MoDisco Eclipse-OMG Symp 2010
PDF
Introduction of file based workflows 111004 vfinal
PDF
CCNxCon2012: Session 5: CCN support for Information-Centric Opportunistic Net...
PDF
Jena based implementation of a iso 11179 meta data registry
PPTX
Keyword Services Platform (KSP) from Microsoft adCenter
PPT
Next Generation Localization
PPTX
From file-based production to real-time co-production
PDF
fiwalk With Me: Building Emergent Pre-Ingest Workflows for Digital Archival R...
PDF
Web 2.0 And The End Of DITA
PDF
Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications
PDF
Measure Data Quality
PDF
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
PDF
Switch to Alfresco with Seed in Australia and New Zealand
Cebit-2008: Content Aggregation
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
SeCold - A Linked Data Platform for Mining Software Repositories
Computing for Human Experience and Wellness
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
Technologies For Appraising and Managing Electronic Records
PBCore: Overview
MoDisco Eclipse-OMG Symp 2010
Introduction of file based workflows 111004 vfinal
CCNxCon2012: Session 5: CCN support for Information-Centric Opportunistic Net...
Jena based implementation of a iso 11179 meta data registry
Keyword Services Platform (KSP) from Microsoft adCenter
Next Generation Localization
From file-based production to real-time co-production
fiwalk With Me: Building Emergent Pre-Ingest Workflows for Digital Archival R...
Web 2.0 And The End Of DITA
Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications
Measure Data Quality
Jean-Marc Lazard d'Exalead - Pioneering hypermedia - SEO Campus 2011
Switch to Alfresco with Seed in Australia and New Zealand
Ad

Recently uploaded (20)

PDF
Hazard Identification & Risk Assessment .pdf
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
Classroom Observation Tools for Teachers
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Cell Types and Its function , kingdom of life
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Empowerment Technology for Senior High School Guide
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Introduction to Building Materials
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Indian roads congress 037 - 2012 Flexible pavement
Hazard Identification & Risk Assessment .pdf
A powerpoint presentation on the Revised K-10 Science Shaping Paper
LDMMIA Reiki Yoga Finals Review Spring Summer
A systematic review of self-coping strategies used by university students to ...
Classroom Observation Tools for Teachers
Final Presentation General Medicine 03-08-2024.pptx
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Cell Types and Its function , kingdom of life
Supply Chain Operations Speaking Notes -ICLT Program
Empowerment Technology for Senior High School Guide
History, Philosophy and sociology of education (1).pptx
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Introduction to Building Materials
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
Orientation - ARALprogram of Deped to the Parents.pptx
Indian roads congress 037 - 2012 Flexible pavement

IGP Production Systems For Digital Archives

  • 1. The Science of Information Digital Reformatting Transforming physical content for preservation, access and reuse Infogrid Pacific Pte. Ltd. © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 1
  • 2. The Science of Information Production Objectives –Digitize once, use forever –Automate long term ownership of valuable content –Control off-line and on-line access –Standards driven not proprietary –Contribute to current and future information strategies © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 2
  • 3. The Science of Information The Production Steps Input Establish Physical Define Information Client Material Required Client Objectives Analysis Metadata Objectives Planning and Create Define Define Testing Production Production Quality Client Specifications Processes Control Objectives Content Output Enforce Create Execute Production Production Quality Delivery Control Packages © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 3
  • 4. The Science of Information Metadata OAI METS (LOC) Metadata for Encoding and Dublin Core (ISO) Open Access Initiative Transmission Standard Descriptive Properties Title: Search Engine Reg. Technical: Author: -Google DigiProv: Publisher: -Yahoo! Rights: Rights: -etc. Source: Language: Interchange Protocol FileSpec: StructMap: Etc. MIX (LOC) Metadata for Images in XML Format: Size: Bitdepth: Checksum: Scanning: Custom TextML (LOC) Examples... Text markup Language Age Level: Encoding: Subject: XML-Lang: Course: Chars: Year: Words: Restrictions: Para: Document Genre: © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 4
  • 5. The Science of Information XHTML - The Best Encoding XHTML delivers it all – Handle content complexity – Support metadata – Simple in production, flexible in use – Output Online ready – Widest support – Lowest cost of ownership © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 5
  • 6. The Science of Information The Production Deliverables © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 6
  • 7. Production Output The Science of Information From this................. To this............................... Master Scan Image Master Content Set (One per page) XHTML and hi-res extracted images Print Res Cover Set (Front, Back, Spine) Online Optimized XHTML plus images processed for online Print on Demand Metadata – Descriptive ready PDF (All pages provenance and QC in one PDF) METS Package for digital preservation Online optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 7
  • 8. What does this give? The Science of Information Master Scan Image Master Content Set (One per page) XHTML and hi-res extracted images Print Res Cover Set (Front, Back, Spine) Online Optimized XHTML plus images processed for online Print on Demand Metadata – Descriptive ready PDF (All pages provenance and QC in one PDF) METS Package for digital preservation Online optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 8
  • 9. Online Publishing Set The Science of Information Master Scan Image Set (One per page) Master Content XHTML and hi-res • Online ready paginated or extracted images continuous document view • With IGP:InfoViewer Print Res Cover Set (Front, Back, Spine) Online Optimized  Instantly create online XHTML plus images processed for online catalogue and page views  Metadata search and full text search Print on Demand Metadata – Descriptive  Customize and standardize ready PDF (All pages in one PDF) provenance and QC the look and feel of documents METS Package for digital preservation Online optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 9
  • 10. Print on Demand Set The Science of Information Master Scan Image Set (One per page) Master Content XHTML and hi-res • Create new print copies as extracted images and when required • Bitonal, grayscale and Print Res Cover Set (Front, Back, Spine) colour pages all handled in Online Optimized XHTML plus images the package processed for online • Reprinting, trim and imposition information included Print on Demand Metadata – Descriptive ready PDF (All pages provenance and QC in one PDF) METS Package for digital preservation Online optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 10
  • 11. Variable Publishing The Science of Information & Reuse Master Scan Image Set (One per page) Master Content XHTML and hi-res extracted images Make sure your content is ready for new information strategies Print Res Cover Set (Front, Back, Spine) when required Online Optimized XHTML plus images processed for online Variable publishing – use selected portions Revisions and updates Print on Demand ready PDF (All pages in one PDF) Metadata – Descriptive provenance and QC – keep track of source and versions Partial and split Online METS Package for digital preservation document distribution optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 11
  • 12. Digital Preservation The Science of Information Master Scan Image Set (One per page) Master Content XHTML and hi-res • Create output for interntl. extracted images archive standards • Ensure the survivability of the data Print Res Cover Set (Front, Back, Spine) Online Optimized XHTML plus images processed for online • With IGP:Repository  Seamless and auto data ingestion from IGP:Content Print on Demand Metadata – Descriptive Workbench ready PDF (All pages provenance and QC in one PDF)  Easy data ingestion from other sources  Integrated distribution with METS Package for digital preservation IGP:InfoViewer Online optimized page image set (one per page) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 12
  • 13. The Science of Information And now for something special © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 13
  • 14. Special Processes The Science of Information From this................. To this............................... Substitute online/ Master Archive Oversize Pages Image (per special book alias page process page) Online PDF (per special process Print Ready PDF page) (per special process page) Foldouts Master Archive Image (per special process page) Print Ready PDF (per special process page) Online PDF (per special process page) Substitute online/ book alias page © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 14
  • 15. Special Processes The Science of Information From this................. To this............................... Overlays The original image Combined for a new digital representation And all images are available for independent re-use + The original overlay Illustrated, Text and Art Books Camera Scan High resolution images archive image extracted for further use and cleaned up as independent media presentation items. image 15 © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners
  • 16. Special Processes The Science of Information From this................. To this............................... The preservation archive camera Image Manuscript Documents Online and search access thumbnail Metadata – Descriptive provenance and QC High resolution research image METS Package for digital preservation © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 16
  • 17. The Science of Information Summary • Multiple digitization strategies • IGP:Production Solutions does it all • Extract more value from the digitization dollar • Seamlessly integrate with: – controlled online distribution (IGP:InfoViewer 2) – standardized digital preservation (IGP:Repository 2) © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 17
  • 18. The Science of Information Thank you For more information visit: www.infogridpacific.com or contact us at: sales@infogridpacific.com © 2004-9 Infogrid Pacific. All trademarks and copyrights remain the property of their respective owners 18