SlideShare a Scribd company logo
Making Inter-operability Visible Visualising Interoperability: ARH, Aggregation, Rationalisation and Harmonisation Michael Currie, Meigan Geileskey, Liddy Nevile, Richard Woodman A Project  in the Victorian Department of Premier & Cabinet
Summary Introduction  disclaimer, context purpose, process The Case Study Some  after-thoughts...
Disclaimer Just what it is … Using info m’ment to teach maths Not a scientific paper ... Information scientists, systems architects, administrators, DC worshippers, …..
Context Dept PC developing Intranet Dept SD developing VOG (brochure-net) Most depts developing doc. m’ment systems  departmental agencies have (sometimes) implemented, more or less, DC-based Australian Government Locator Standard  the Government needs discovery for all resources   and information   interoperability
Context (2) a visualisation of interoperability to assist real-world deployment of metadata a few metadata records, but the future of the Whole-of-Government Intranet and all departments of government my role
Purpose of Project With minimal interference  To accommodate work on extranet Influence new document management systems to prepare for intranet To maintain local specificity and global discovery To increase interest and effort.
Previous Process Departmental mandate to manage information  Monthly meetings of those interested in metadata from various departments  Only collaboration, cooperation, … Weak government “policy”.
Case Study Process Literature review -  what is inter-operability Metadata review Emergent phenomena Bricolage development Aggregation, rationalisation, harmonisation Registry, repository, WOG search...
Interoperating - an action when one tries to mix and match across domains Agencies use many different strategies to make this happen….
Interoperability - a state where there is interchange and exchange without difficulty Government agencies need  complete  or  absolute  interoperability - every document  must  be discoverable
Mapping for interoperation
What is good enough? Moderate inter-operability is good enough for information re-use Departments like to maintain independence, operate in ‘silos’ Only perfect is good enough for Parliament public accountability is forcing the issue
Aggregation First list  who has metadata, for what Second list -  who uses what elements List all elements currently used is there a significant difference
Record review Live record
List review
Spreadsheet review Live spreadsheet
Aggregation - review Looking at the list, are there problems? It’s way too big Too many things in the list Variations in application profiles, errors, and variation in use of elements, formats, etc... Not all metadata is useful for discovery eg. random use of ‘DPC’  vs  ‘Department of Premier and Cabinet’  searchers may miss some documents.
Emergent goals - 1 Illuminate limitations in inter-operability resulting from existing metadata practices Articulate the cause of the problems Develop a shared strategy for improving the inter-operability
Emergent goals - 2 Encourage data managers to develop a single, comprehensive metadata application profile  derived from the current requirements and foci of all users with high local specificity and deep interoperability.
Analysis Look at the list of elements and decide which have  material  differences **Remembering that each agency values  all  its metadata content
Material differences Mis-use of available elements,  qualifiers etc Different expression of the same type of information Different granularity Different element name for the  same  information, … -> need to rationalise
Element Name Variants Inconsistent case  eg. DC.Title/TITLE/title EDNA.Userlevel/UserLevel Non-standard names  eg. DC.Keywords Non-standard qualifiers eg. DC.Description.Abstract Non-standard abbreviations  eg. DC.Lang
Field Selection Standard and non-standard element names eg. 'description' and DC.Description Locally created element names eg. Custodian
Value string Variants Despite DCMES recommendations … DC.Identifier:  other id numbers without qualifiers. DC.Date:  also used yyyy, yyyy/m/d, yyyy-dd-mm DC.Format:  Non-standard terms eg. VHS (PAL) Incorrect case eg. text/HTML
DC.Language: also used en, en-au, en-AU Qualifiers embedded in values: DC.Publisher CONTENT="corporateName=State..." Non-standard proper names  DPC for Department of Premier and Cabinet ->Generally inconsistent use of capitalisation and punctuation
Observation Most element variants due to non-standard use of capitals, punctuation, spelling Users seem to act independently of Application Profiles  Little use of collection specific qualifiers to enhance specificity
Rationalise! Add qualifiers to each type of element? no less elements but significantly increases semantic interoperability Dumb-down for interoperation? Loss of precision, too many documents from search but not everything is found Cross-map ontologies, one to another  Too cumbersome Map everything to a new ontology Blanchi, ‘Harmony’, etc.
Rationalise (a process?) Choose strategy Decide what is disposable/worth saving given chosen strategy Delete disposable elements  -> The list gets shorter, semantic inter-operability is improved.
Harmonisation Work together to choose appropriate formats and definitions for elements and qualifiers **Remembering that, locally, departments will want more and less precision
Harmonisation (process?) Eg  in DC.date - maybe just a different format so it’s obvious what to do in DC.subject - some use AGIFT, some use SCIS,  some use NRE’s geo-spatial thesaurus, etc. - consensus work to be done
Results from ARH - HA? DPC Project is working on towards an Intranet, harmonising the application profiles
Supporting granularity -  technologies Registries - leads to the well-defined full WOG ontology supporting evolving granularity Federated Metadata repositories preserve local control and remote interoperability
Key service providers - VOG VOG DoJ DoI DNRE
Key service providers - WOG WOG AP VOG AP
The grammar of DC metadata http://www.dlib.org/dlib/october00/baker/10baker.html
Resource has property (subject): ‘about land’ ‘about rural land’ ‘about Victorian rural land’  ‘about Victorian rural land in section 43’  ‘about Victorian rural land in section 43 in January 2002’ The grammar of DC metadata
Does your resource speak Dublin Core (AGLS)? “…  Pidgins are inherently limited in what they can express, but they are easy to learn and enormously useful. In real life, we talk one way to our professional colleagues and another way to visitors from other cultures. Our digital library applications need to do this as well. Simplicity and complexity are both appropriate, depending on context. If Dublin Core is too simple or generic to use as the native idiom of a particular application, pidgin statements may be extracted or translated from richer idioms that exist for specialized domains. This output should also be filtered to keep the fifteen buckets clear of encoding debris and semantic silt. One should treat digital tourists with courtesy and hide from them the complexities of a local application vocabulary or grammar. However sophisticated its local idiom may be, an application might also speak a pidgin that general users and generic search engines will understand. Simple, semantically clean, computationally obvious values will help us negotiate our way through a splendidly diverse and heterogeneous Internet.”     (Tom Baker,   http://www.dlib.org/dlib/october00/baker/10baker.html)
Pidgin vs Symbolic Languages Pidgin languages serve for good enough communication Symbolic languages serve for complete communication in abbreviated form structure dictionaries

More Related Content

PPT
Accessibility and Metadata
PPT
Introduction to the Semantic Web
PPT
Accessibility Issues
PPT
Converging research towards AccessForAll
PPT
Making working thesauri
PPT
Annotating for Individual experiences
PPT
Toward The Semantic Deep Web
PPT
Networked Digital Library Of Theses And Dissertations
Accessibility and Metadata
Introduction to the Semantic Web
Accessibility Issues
Converging research towards AccessForAll
Making working thesauri
Annotating for Individual experiences
Toward The Semantic Deep Web
Networked Digital Library Of Theses And Dissertations

What's hot (20)

PPT
interoperability: the value of recombinant potential
PDF
Using linguistic analysis to translate
PDF
Ontologies and semantic web
PPTX
Entity Relationship (ER) Model
PPT
test
PDF
Automatically converting tabular data to
PPT
Folksonomies: a bottom-up social categorization system
PPT
Object models and object representation
PPT
Access Management for Libraries by John Paschoud & Masha Garibyan
PPT
Repositories and the wider context
PPT
IKHarvester - Informal Knowledge Harvester
PPT
An introduction to OAI-ORE
PPT
Metadata issues and challenges: Link Data
PDF
Potential Future Directions for ePADD
PPTX
Knowledge Organization Systems (KOS): Management of Classification Systems in...
PPTX
Discovering and Accessing Email of Potential Historical Value with ePADD -- A...
PPT
Wiki meeting2
PPT
Semantic Web for Enterprise Architecture
PPT
Schemas and Schema-driven Metadata Software
PPTX
Multilingual Knowledge Organization Systems Management: Best Practices
interoperability: the value of recombinant potential
Using linguistic analysis to translate
Ontologies and semantic web
Entity Relationship (ER) Model
test
Automatically converting tabular data to
Folksonomies: a bottom-up social categorization system
Object models and object representation
Access Management for Libraries by John Paschoud & Masha Garibyan
Repositories and the wider context
IKHarvester - Informal Knowledge Harvester
An introduction to OAI-ORE
Metadata issues and challenges: Link Data
Potential Future Directions for ePADD
Knowledge Organization Systems (KOS): Management of Classification Systems in...
Discovering and Accessing Email of Potential Historical Value with ePADD -- A...
Wiki meeting2
Semantic Web for Enterprise Architecture
Schemas and Schema-driven Metadata Software
Multilingual Knowledge Organization Systems Management: Best Practices
Ad

Similar to Making Inter-operability Visible (20)

PPTX
Metadata: Digital Humanties
DOC
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
PDF
Domain-Driven Design
PPT
The Future of Cataloging and Catalogers
PDF
The Value and Benefits of Data-to-Text Technologies
PPT
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
PPT
MIDESS
PPT
New Directions in Metadata
PPT
IWMW 2002: The Value of Metadata and How to Realise It
PDF
Survey of Object Oriented Database
PPT
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
PDF
Document Based Data Modeling Technique
PPT
D.3.1: State of the Art - Linked Data and Digital Preservation
PPT
Mapping the content ecosystem
PPSX
Knowledge tree show
PPT
Faceted Navigation (LACASIS Fall Workshop 2005)
DOCX
Key aspects of big data storage and its architecture
PPT
Metadata practice and direction: a community perspective
PPT
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
PPTX
Content Architecture for Rapid Knowledge Reuse-congility2011
Metadata: Digital Humanties
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Domain-Driven Design
The Future of Cataloging and Catalogers
The Value and Benefits of Data-to-Text Technologies
OAIS and It's Applicability for Libraries, Archives, and Digital Repositories...
MIDESS
New Directions in Metadata
IWMW 2002: The Value of Metadata and How to Realise It
Survey of Object Oriented Database
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Document Based Data Modeling Technique
D.3.1: State of the Art - Linked Data and Digital Preservation
Mapping the content ecosystem
Knowledge tree show
Faceted Navigation (LACASIS Fall Workshop 2005)
Key aspects of big data storage and its architecture
Metadata practice and direction: a community perspective
DITA, Semantics, Content Management, Dynamic Documents, and Linked Data – A M...
Content Architecture for Rapid Knowledge Reuse-congility2011
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PPT
Teaching material agriculture food technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Approach and Philosophy of On baking technology
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Empathic Computing: Creating Shared Understanding
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
cuic standard and advanced reporting.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
sap open course for s4hana steps from ECC to s4
Electronic commerce courselecture one. Pdf
Teaching material agriculture food technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
“AI and Expert System Decision Support & Business Intelligence Systems”
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Approach and Philosophy of On baking technology
Network Security Unit 5.pdf for BCA BBA.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
20250228 LYD VKU AI Blended-Learning.pptx
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Empathic Computing: Creating Shared Understanding
Reach Out and Touch Someone: Haptics and Empathic Computing
Spectral efficient network and resource selection model in 5G networks
Programs and apps: productivity, graphics, security and other tools
cuic standard and advanced reporting.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
sap open course for s4hana steps from ECC to s4

Making Inter-operability Visible

  • 1. Making Inter-operability Visible Visualising Interoperability: ARH, Aggregation, Rationalisation and Harmonisation Michael Currie, Meigan Geileskey, Liddy Nevile, Richard Woodman A Project in the Victorian Department of Premier & Cabinet
  • 2. Summary Introduction disclaimer, context purpose, process The Case Study Some after-thoughts...
  • 3. Disclaimer Just what it is … Using info m’ment to teach maths Not a scientific paper ... Information scientists, systems architects, administrators, DC worshippers, …..
  • 4. Context Dept PC developing Intranet Dept SD developing VOG (brochure-net) Most depts developing doc. m’ment systems departmental agencies have (sometimes) implemented, more or less, DC-based Australian Government Locator Standard the Government needs discovery for all resources and information interoperability
  • 5. Context (2) a visualisation of interoperability to assist real-world deployment of metadata a few metadata records, but the future of the Whole-of-Government Intranet and all departments of government my role
  • 6. Purpose of Project With minimal interference To accommodate work on extranet Influence new document management systems to prepare for intranet To maintain local specificity and global discovery To increase interest and effort.
  • 7. Previous Process Departmental mandate to manage information Monthly meetings of those interested in metadata from various departments Only collaboration, cooperation, … Weak government “policy”.
  • 8. Case Study Process Literature review - what is inter-operability Metadata review Emergent phenomena Bricolage development Aggregation, rationalisation, harmonisation Registry, repository, WOG search...
  • 9. Interoperating - an action when one tries to mix and match across domains Agencies use many different strategies to make this happen….
  • 10. Interoperability - a state where there is interchange and exchange without difficulty Government agencies need complete or absolute interoperability - every document must be discoverable
  • 12. What is good enough? Moderate inter-operability is good enough for information re-use Departments like to maintain independence, operate in ‘silos’ Only perfect is good enough for Parliament public accountability is forcing the issue
  • 13. Aggregation First list who has metadata, for what Second list - who uses what elements List all elements currently used is there a significant difference
  • 16. Spreadsheet review Live spreadsheet
  • 17. Aggregation - review Looking at the list, are there problems? It’s way too big Too many things in the list Variations in application profiles, errors, and variation in use of elements, formats, etc... Not all metadata is useful for discovery eg. random use of ‘DPC’ vs ‘Department of Premier and Cabinet’ searchers may miss some documents.
  • 18. Emergent goals - 1 Illuminate limitations in inter-operability resulting from existing metadata practices Articulate the cause of the problems Develop a shared strategy for improving the inter-operability
  • 19. Emergent goals - 2 Encourage data managers to develop a single, comprehensive metadata application profile derived from the current requirements and foci of all users with high local specificity and deep interoperability.
  • 20. Analysis Look at the list of elements and decide which have material differences **Remembering that each agency values all its metadata content
  • 21. Material differences Mis-use of available elements, qualifiers etc Different expression of the same type of information Different granularity Different element name for the same information, … -> need to rationalise
  • 22. Element Name Variants Inconsistent case eg. DC.Title/TITLE/title EDNA.Userlevel/UserLevel Non-standard names eg. DC.Keywords Non-standard qualifiers eg. DC.Description.Abstract Non-standard abbreviations eg. DC.Lang
  • 23. Field Selection Standard and non-standard element names eg. 'description' and DC.Description Locally created element names eg. Custodian
  • 24. Value string Variants Despite DCMES recommendations … DC.Identifier: other id numbers without qualifiers. DC.Date: also used yyyy, yyyy/m/d, yyyy-dd-mm DC.Format: Non-standard terms eg. VHS (PAL) Incorrect case eg. text/HTML
  • 25. DC.Language: also used en, en-au, en-AU Qualifiers embedded in values: DC.Publisher CONTENT="corporateName=State..." Non-standard proper names DPC for Department of Premier and Cabinet ->Generally inconsistent use of capitalisation and punctuation
  • 26. Observation Most element variants due to non-standard use of capitals, punctuation, spelling Users seem to act independently of Application Profiles Little use of collection specific qualifiers to enhance specificity
  • 27. Rationalise! Add qualifiers to each type of element? no less elements but significantly increases semantic interoperability Dumb-down for interoperation? Loss of precision, too many documents from search but not everything is found Cross-map ontologies, one to another Too cumbersome Map everything to a new ontology Blanchi, ‘Harmony’, etc.
  • 28. Rationalise (a process?) Choose strategy Decide what is disposable/worth saving given chosen strategy Delete disposable elements -> The list gets shorter, semantic inter-operability is improved.
  • 29. Harmonisation Work together to choose appropriate formats and definitions for elements and qualifiers **Remembering that, locally, departments will want more and less precision
  • 30. Harmonisation (process?) Eg in DC.date - maybe just a different format so it’s obvious what to do in DC.subject - some use AGIFT, some use SCIS, some use NRE’s geo-spatial thesaurus, etc. - consensus work to be done
  • 31. Results from ARH - HA? DPC Project is working on towards an Intranet, harmonising the application profiles
  • 32. Supporting granularity - technologies Registries - leads to the well-defined full WOG ontology supporting evolving granularity Federated Metadata repositories preserve local control and remote interoperability
  • 33. Key service providers - VOG VOG DoJ DoI DNRE
  • 34. Key service providers - WOG WOG AP VOG AP
  • 35. The grammar of DC metadata http://www.dlib.org/dlib/october00/baker/10baker.html
  • 36. Resource has property (subject): ‘about land’ ‘about rural land’ ‘about Victorian rural land’ ‘about Victorian rural land in section 43’ ‘about Victorian rural land in section 43 in January 2002’ The grammar of DC metadata
  • 37. Does your resource speak Dublin Core (AGLS)? “… Pidgins are inherently limited in what they can express, but they are easy to learn and enormously useful. In real life, we talk one way to our professional colleagues and another way to visitors from other cultures. Our digital library applications need to do this as well. Simplicity and complexity are both appropriate, depending on context. If Dublin Core is too simple or generic to use as the native idiom of a particular application, pidgin statements may be extracted or translated from richer idioms that exist for specialized domains. This output should also be filtered to keep the fifteen buckets clear of encoding debris and semantic silt. One should treat digital tourists with courtesy and hide from them the complexities of a local application vocabulary or grammar. However sophisticated its local idiom may be, an application might also speak a pidgin that general users and generic search engines will understand. Simple, semantically clean, computationally obvious values will help us negotiate our way through a splendidly diverse and heterogeneous Internet.” (Tom Baker, http://www.dlib.org/dlib/october00/baker/10baker.html)
  • 38. Pidgin vs Symbolic Languages Pidgin languages serve for good enough communication Symbolic languages serve for complete communication in abbreviated form structure dictionaries