SlideShare a Scribd company logo
Turning the Page on
Digital Content
David Wilcox (dgi) & Kirsta Stapelfeldt
(Islandora) Open Repositories 2013
Outline
•  Content Models
•  Role of Metadata
•  Preparing Content for Ingest
•  Derivative Creation
•  Display
Content Models
•  Book/Monographs/Journals/Periodicals/
Newspapers
•  Formats: tiff, jpeg, jp2000, pdf(pdf/a)
o  PDF is stored as single continuous object
o  Books and Periodicals stored atomistically
RDF Statements
Book Object
Page Objects
Book Object: Datastreams
RELS-EXT RDF statements connecting book to collection
MODS MODS metadata
DC Dublin Core metadata
TN Display Thumbnail
PDF (Optional) Optional PDF can be generated and stored at the
book level of all pages
Page Object: Datastreams
RELS-EXT RDF statements connecting pages to book and
declaring the order of pages
MODS MODS metadata
DC Dublin Core Metadata
TN Display Thumbnail
OBJ TIFF representing page
JP2 JPEG 2000
JPG Display JPEG (for reader)
OCR Text (generated or uploaded)
HOCR Coordinate data for generated text only
PDF (Optional) PDF for single page can be generated and stored
with the object
Management functions
for book pages
•  Reordering,
deletion,
replacement (of
object or
derivatives)
Approaches to Metadata
•  Default is MODS and DC
•  Ability to add different metadata at Book &
Page level
•  Ability to add encoded text stream (TEI and
HOCR)
o  Syncing issues
o  TEI schema
•  Next: How is content created and managed?
(Interface Tour)
Single Page Ingest
Simple Batch Ingest
Advanced Batch Ingest
Derivative Generation
•  Kakadu > JP2
•  ImageMagick > JPG
•  Ghostscript > PDF
•  Tesseract > OCR/hOCR
Displaying Content: Changes in
Islandora 7
•  Greater generalization
•  Deprecation of the google reader viewer and
IIV
•  Viewers packaged as separate modules
Displaying Content: Changes in
Islandora 7
Turning the Page on Digital Content
Turning the Page on Digital Content
Sample Projects (discoverygarden)
University of Manitoba
http://digitalcollections.lib.umanitoba.ca
CalTech
http://caltech.discoverygarden.ca
Williams College
http://unbound.williams.edu
Sample Projects (UPEI)
The Island Magazine
http://vre2.upei.ca/islandmag
PEI Legislative Documents Online
http://peildo.ca/
Prince Edward Island Magazine
http://vre2.upei.ca/peimagazine/
The Charlottetown Guardian
http://newspapers.vre.upei.ca/
Contact Us
David Wilcox
david@discoverygarden.ca
Kirsta Stapelfeldt
kstapelfeldt@upei.ca

More Related Content

PPTX
Improvement of no sql technology for relational databases v2
PDF
Linked data tooling XML
PPTX
Linked data-tooling-xml
PDF
Indoctrinatr – Open Source PDF generation service
ODP
Open source Java office, day 16: Dataobject
PDF
Building an editable, versionized LOD service for library data
PPTX
Web topic 4 style in html
Improvement of no sql technology for relational databases v2
Linked data tooling XML
Linked data-tooling-xml
Indoctrinatr – Open Source PDF generation service
Open source Java office, day 16: Dataobject
Building an editable, versionized LOD service for library data
Web topic 4 style in html

What's hot (19)

PDF
Hap clojure berlin 2015
PDF
HyperGraphQL
PPTX
Clustering in Data Mining
PPTX
RDFa: an introduction
PPTX
Need for css,introduction to css & basic syntax wt
PPTX
The document object
PDF
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
KEY
The Kasabi Information Marketplace
PPT
Semantic HTML
PPT
NISO Bibliographic Roadmap Meeting Proposal
PDF
Schema Design
PDF
Indexing, searching, and aggregation with redi search and .net
PPTX
Data Science Capstone - Global Economics
PDF
JSON-LD
ODP
Semantic Web introduction
PPT
Documentation With Open Source Tools·(ასლი)
PPT
Documentation With Open Source Tools
PPTX
CHAOS Platform presentation, The Royal Library in Copenhagen.
Hap clojure berlin 2015
HyperGraphQL
Clustering in Data Mining
RDFa: an introduction
Need for css,introduction to css & basic syntax wt
The document object
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
The Kasabi Information Marketplace
Semantic HTML
NISO Bibliographic Roadmap Meeting Proposal
Schema Design
Indexing, searching, and aggregation with redi search and .net
Data Science Capstone - Global Economics
JSON-LD
Semantic Web introduction
Documentation With Open Source Tools·(ასლი)
Documentation With Open Source Tools
CHAOS Platform presentation, The Royal Library in Copenhagen.
Ad

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Getting Started with Data Integration: FME Form 101
PDF
Machine learning based COVID-19 study performance prediction
PDF
Electronic commerce courselecture one. Pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Approach and Philosophy of On baking technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation_ Review paper, used for researhc scholars
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
Getting Started with Data Integration: FME Form 101
Machine learning based COVID-19 study performance prediction
Electronic commerce courselecture one. Pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Dropbox Q2 2025 Financial Results & Investor Presentation
Digital-Transformation-Roadmap-for-Companies.pptx
Group 1 Presentation -Planning and Decision Making .pptx
Spectral efficient network and resource selection model in 5G networks
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Reach Out and Touch Someone: Haptics and Empathic Computing
Approach and Philosophy of On baking technology
Big Data Technologies - Introduction.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Ad

Turning the Page on Digital Content