SlideShare a Scribd company logo
Using Schematron for Analyzing
Conformance to Best Practices
for EAD, TEI, and MODS
(and some other thoughts on
workflow tools)
Jenn Riley
Metadata Librarian
Indiana University Digital Library Program
Consistency is a challenge
Document-centric XML (TEI, EAD) is
very difficult to create consistently
 Some common tools to help:


◦
◦
◦
◦
◦
◦


Schema/DTD validation
Tag libraries
XML templates
Example documents
Keyboard macros
Detailed encoding guidelines

These are not enough!
7/11/09

ALA 2009 - ALCTS NRMIG

2
Another possible tool layer
Machine validation of a file against
local encoding guidelines
 Can only go so far, but that far is
extremely helpful
 Indiana University implemented using:


◦ Schematron (http://www.schematron.com)
◦ <oXygen />plugin architecture

7/11/09

ALA 2009 - ALCTS NRMIG

3
Inspiration: RLG EAD Report
Card

7/11/09

ALA 2009 - ALCTS NRMIG

4
7/11/09

ALA 2009 - ALCTS NRMIG

5
7/11/09

ALA 2009 - ALCTS NRMIG

6
7/11/09

ALA 2009 - ALCTS NRMIG

7
7/11/09

ALA 2009 - ALCTS NRMIG

8
7/11/09

ALA 2009 - ALCTS NRMIG

9
7/11/09

ALA 2009 - ALCTS NRMIG

10
7/11/09

ALA 2009 - ALCTS NRMIG

11
More info on Schematron
ISO/IEC 19757 - Document Schema
Definition Languages (DSDL) - Part 3:
Rule-based validation – Schematron.
 Be careful! http://www.schematron.com has
the primary specs; http://schematron.com is
for a particular company’s tool using them.
(Weird.)
 This is the page you want:


7/11/09

ALA 2009 - ALCTS NRMIG

12
Using a Schematron file


Schematron home page provides two
distributions:

◦ One for XSLT 1.0 processors and one for 2.0
processors
◦ Each includes a set of three stylesheets to be
used in turn on the Schematron file
◦ Result of this processing is a stylesheet to be
run on your XML instance document

IU implementation wraps this all up into
an <oXygen />plugin written in Java
 You could also pipe them together with a
shell script, a Windows .bat file, etc……


7/11/09

ALA 2009 - ALCTS NRMIG

13
7/11/09

ALA 2009 - ALCTS NRMIG

14
7/11/09

ALA 2009 - ALCTS NRMIG

15
7/11/09

ALA 2009 - ALCTS NRMIG

16
Let’s step back


How can better tools revolutionize
metadata creation workflows?
◦ Promoting consistency
 This is hard and not something that humans
are generally good at

◦ True interoperability between systems
 Without futzing!


We spend too much valuable human time
doing repetitive and low-value tasks as part
of descriptive workflows
7/11/09

ALA 2009 - ALCTS NRMIG

17
Were do we go from here?


Make better use of available
technologies
◦ Automating
◦ Streamlining
◦ Validating



We can and must do our jobs better
and more efficiently, with the help of
better tools
◦ Providing comparable services with less
◦ Creating a convincing argument for more?
7/11/09

ALA 2009 - ALCTS NRMIG

18
There is no excuse for not
having usable metadata creation
tools.


Smart systems are possible and
necessary
◦ Configurable
◦ Modular
◦ Connected



Make it easy to do it well
◦ Consistent
◦ Complete
◦ Efficient




Make it hard to do it poorly
We must pay attention to user interface
design for cataloging tools
7/11/09

ALA 2009 - ALCTS NRMIG

19
OK, rant over. Thank you!


jenlrile@indiana.edu
◦ (watch out for the invisible “l” in the
middle)



Slides and handout:
◦ On ALA presentations Wiki
<http://presentations.ala.org>
◦ On my home page
<http://www.dlib.indiana.edu/~jenlrile/pres
entations/nrmig2009/>
7/11/09

ALA 2009 - ALCTS NRMIG

20

More Related Content

PDF
Reactive programming
PDF
Reactive programming - Observable
PDF
Intro to React
PDF
Microservices with spring boot
PPTX
ThingMonk 2016 - Concursus Event sourcing for the IOT By Tareq Abedrabbo & Do...
PPTX
Ajax Backbone
PPTX
Mest
PDF
Putting Open Access into Practice
Reactive programming
Reactive programming - Observable
Intro to React
Microservices with spring boot
ThingMonk 2016 - Concursus Event sourcing for the IOT By Tareq Abedrabbo & Do...
Ajax Backbone
Mest
Putting Open Access into Practice

Similar to Using Schematron for Analyzing Conformance to Best Practices for EAD, TEI, and MODS (and some other thoughts on workflow tools) (20)

PPT
Establishing Metadata Practices
PPT
The XML Submission Tool: A System for Managing Text Collections at Indiana Un...
PDF
Webinar slides: Interoperability between resources involved in TDM at the lev...
PDF
2011-Balisage-Poster-Schwarzman
PPT
Sasa Nesic - PhD Dissertation Defense
PDF
2012.11 - ISWC 2012 - DC - 2
PPT
eXtensible Catalog - afternoon session - Tilburg
PPT
Cutting Through the Hype about Metadata
PPT
Getting a Handle on Alphabet Soup: Implementing Diverse Content Standards and...
PDF
Artificial Intelligence for XMLDevelopment
PDF
TEI ODD based development
PPT
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
PPT
Metadata Workshop - Utrecht - November 5, 2008
PPT
A Theoretical Survey of User Interface Description Languages: Preliminary Res...
PDF
Ji cv6n1
PPTX
LIS688_Group1
ODP
Xml Overview
PPTX
2012.11 - ISWC 2012 - DC - 1
DOCX
Course syllabus metadata systems for warsaw
PPT
Metadata practice and direction: a community perspective
Establishing Metadata Practices
The XML Submission Tool: A System for Managing Text Collections at Indiana Un...
Webinar slides: Interoperability between resources involved in TDM at the lev...
2011-Balisage-Poster-Schwarzman
Sasa Nesic - PhD Dissertation Defense
2012.11 - ISWC 2012 - DC - 2
eXtensible Catalog - afternoon session - Tilburg
Cutting Through the Hype about Metadata
Getting a Handle on Alphabet Soup: Implementing Diverse Content Standards and...
Artificial Intelligence for XMLDevelopment
TEI ODD based development
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
Metadata Workshop - Utrecht - November 5, 2008
A Theoretical Survey of User Interface Description Languages: Preliminary Res...
Ji cv6n1
LIS688_Group1
Xml Overview
2012.11 - ISWC 2012 - DC - 1
Course syllabus metadata systems for warsaw
Metadata practice and direction: a community perspective
Ad

More from Jenn Riley (20)

PPTX
Understanding Metadata: Looking Forward
PPTX
The future of cataloguing? Future cataloguers!
PPTX
Discovery elsewhere
PPTX
Designing the Garden: Getting Grounded in Linked Data
PPTX
Launching metaware.buzz
PPTX
Getting Comfortable with Metadata Reuse
PDF
Handout for Digital Imaging of Photographs
PPT
Digital Imaging of Photographs
PPT
The Open Archives Initiative and the Sheet Music Consortium
PPT
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
PDF
Handout for FRBR; or, How I learned to stop worrying and love the model
PPT
Metadata for Brittle Books Page Turner
PPT
Digitizing and Delivering Audio and Video
PPT
Variations2
PDF
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
PPT
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
PDF
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
PPT
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
PPT
Challenges in the Nursery: Linking a Finding Aid with Online Content
PPT
Making Interoperability Easier: Creating Shareable Metadata
Understanding Metadata: Looking Forward
The future of cataloguing? Future cataloguers!
Discovery elsewhere
Designing the Garden: Getting Grounded in Linked Data
Launching metaware.buzz
Getting Comfortable with Metadata Reuse
Handout for Digital Imaging of Photographs
Digital Imaging of Photographs
The Open Archives Initiative and the Sheet Music Consortium
Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and S...
Handout for FRBR; or, How I learned to stop worrying and love the model
Metadata for Brittle Books Page Turner
Digitizing and Delivering Audio and Video
Variations2
Handout for Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Handout for Merging Metadata from Multiple Traditions: IN Harmony Sheet Music...
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Librar...
Challenges in the Nursery: Linking a Finding Aid with Online Content
Making Interoperability Easier: Creating Shareable Metadata
Ad

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PPTX
A Presentation on Artificial Intelligence
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Machine Learning_overview_presentation.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Spectral efficient network and resource selection model in 5G networks
Dropbox Q2 2025 Financial Results & Investor Presentation
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
A Presentation on Artificial Intelligence
Advanced methodologies resolving dimensionality complications for autism neur...
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
MYSQL Presentation for SQL database connectivity
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Building Integrated photovoltaic BIPV_UPV.pdf
A comparative analysis of optical character recognition models for extracting...
Review of recent advances in non-invasive hemoglobin estimation
NewMind AI Weekly Chronicles - August'25-Week II
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Machine Learning_overview_presentation.pptx

Using Schematron for Analyzing Conformance to Best Practices for EAD, TEI, and MODS (and some other thoughts on workflow tools)

  • 1. Using Schematron for Analyzing Conformance to Best Practices for EAD, TEI, and MODS (and some other thoughts on workflow tools) Jenn Riley Metadata Librarian Indiana University Digital Library Program
  • 2. Consistency is a challenge Document-centric XML (TEI, EAD) is very difficult to create consistently  Some common tools to help:  ◦ ◦ ◦ ◦ ◦ ◦  Schema/DTD validation Tag libraries XML templates Example documents Keyboard macros Detailed encoding guidelines These are not enough! 7/11/09 ALA 2009 - ALCTS NRMIG 2
  • 3. Another possible tool layer Machine validation of a file against local encoding guidelines  Can only go so far, but that far is extremely helpful  Indiana University implemented using:  ◦ Schematron (http://www.schematron.com) ◦ <oXygen />plugin architecture 7/11/09 ALA 2009 - ALCTS NRMIG 3
  • 4. Inspiration: RLG EAD Report Card 7/11/09 ALA 2009 - ALCTS NRMIG 4
  • 5. 7/11/09 ALA 2009 - ALCTS NRMIG 5
  • 6. 7/11/09 ALA 2009 - ALCTS NRMIG 6
  • 7. 7/11/09 ALA 2009 - ALCTS NRMIG 7
  • 8. 7/11/09 ALA 2009 - ALCTS NRMIG 8
  • 9. 7/11/09 ALA 2009 - ALCTS NRMIG 9
  • 10. 7/11/09 ALA 2009 - ALCTS NRMIG 10
  • 11. 7/11/09 ALA 2009 - ALCTS NRMIG 11
  • 12. More info on Schematron ISO/IEC 19757 - Document Schema Definition Languages (DSDL) - Part 3: Rule-based validation – Schematron.  Be careful! http://www.schematron.com has the primary specs; http://schematron.com is for a particular company’s tool using them. (Weird.)  This is the page you want:  7/11/09 ALA 2009 - ALCTS NRMIG 12
  • 13. Using a Schematron file  Schematron home page provides two distributions: ◦ One for XSLT 1.0 processors and one for 2.0 processors ◦ Each includes a set of three stylesheets to be used in turn on the Schematron file ◦ Result of this processing is a stylesheet to be run on your XML instance document IU implementation wraps this all up into an <oXygen />plugin written in Java  You could also pipe them together with a shell script, a Windows .bat file, etc……  7/11/09 ALA 2009 - ALCTS NRMIG 13
  • 14. 7/11/09 ALA 2009 - ALCTS NRMIG 14
  • 15. 7/11/09 ALA 2009 - ALCTS NRMIG 15
  • 16. 7/11/09 ALA 2009 - ALCTS NRMIG 16
  • 17. Let’s step back  How can better tools revolutionize metadata creation workflows? ◦ Promoting consistency  This is hard and not something that humans are generally good at ◦ True interoperability between systems  Without futzing!  We spend too much valuable human time doing repetitive and low-value tasks as part of descriptive workflows 7/11/09 ALA 2009 - ALCTS NRMIG 17
  • 18. Were do we go from here?  Make better use of available technologies ◦ Automating ◦ Streamlining ◦ Validating  We can and must do our jobs better and more efficiently, with the help of better tools ◦ Providing comparable services with less ◦ Creating a convincing argument for more? 7/11/09 ALA 2009 - ALCTS NRMIG 18
  • 19. There is no excuse for not having usable metadata creation tools.  Smart systems are possible and necessary ◦ Configurable ◦ Modular ◦ Connected  Make it easy to do it well ◦ Consistent ◦ Complete ◦ Efficient   Make it hard to do it poorly We must pay attention to user interface design for cataloging tools 7/11/09 ALA 2009 - ALCTS NRMIG 19
  • 20. OK, rant over. Thank you!  jenlrile@indiana.edu ◦ (watch out for the invisible “l” in the middle)  Slides and handout: ◦ On ALA presentations Wiki <http://presentations.ala.org> ◦ On my home page <http://www.dlib.indiana.edu/~jenlrile/pres entations/nrmig2009/> 7/11/09 ALA 2009 - ALCTS NRMIG 20