SlideShare a Scribd company logo
Jay Henry
Chief Marketing Officer
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016
DOI
ISSN
Author
ORCID
Author
Affiliations
(ISNIs or
RING IDs)
Title
Year
Published
Subjects
Circulation
Data
Abstract
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016
*This means data that can be linked together through
unambiguous identification and exchanged with others
Governed
Trusted
Transparent
And contain appropriate metadata
In order to be effective, identifiers must be:
Persistent numeric or alpha-numeric
designations associated with a single entity
Entities can be an institution, person, or piece of
content (People, Places, & Things)
1. Disambiguate, aka enforce uniqueness
2. Enable linking, aka data integration and interoperability
In other words, they provide a simple
basis for data governance
◦ Break down silos
◦ Keep data current and
synchronised
◦ Enable staff to interact
with data more effectively
◦ Simplify data exchange
◦ Improve overall data
quality
Institutional
Identifiers
CRM
Electronic
document
storage
Usage
statistics
Author
Database
Fulfilment
system
Membersh
ip system
License
Validation
Manuscript
Submission
System
• Resources & personnel required to join existing
records to IDs or an authority file
• Build customized solutions mapping systems
together
• Improve data capture to require an ID upon record
creation
• Manual vs programmatic cost-benefit questions
• Design new reporting and analysis tools to
leverage newly linked datasets
Researchers – create Current Research Information
Systems (CRIS) – one portal to figure out how to
best conduct research, who to work with, who will
fund it, what else has been contributed to the
subject thus far, where is the best equipment to
help further the research.
Funders – Want to track areas of interest, identify
worthwhile pursuits, and see where their money
goes.
Institutions – Demonstrate research output more
accurately and precisely describe the institution’s
contribution and who is affiliated with that work.
Publishers – Facilitate transactions of all types from
content discovery to delivery of author royalties.
Improved market analysis and targeted advertising.
 ISO Standard 27729
 ISNI is designed to be a
“bridge identifier”
 Covers any type of entity
ISNI Number ISNI Number
Party ID 2Party ID 1
Proprietary
Information and/or
Metadata
Proprietary
Information and/or
Metadata
In cooperation with ProQuest, OCLC, and
other public and commercial entities,
Ringgold has been working to map ISNIs
to deeper datasets for the past two
years.
It’s taken time due to the problems with
the raw source data, and the policies for
assignment of the unique ISNI identifier.
At the same time ISNI records are loaded
to the Ringgold Identify Database we will
being issuing ISNIs for institutions.
ProQuest (Bowker) is a Registration
Agency as well, focusing on individuals.
ISNI
(OCLC tech)
Third
Parties
M
E
M
E
B
E
R
S
M
E
M
E
B
E
R
S
M
E
M
E
B
E
R
S
M
E
M
E
B
E
R
S
M
E
M
E
B
E
R
S
M
E
M
E
B
E
R
S
Proprietary
Databases
Members submit data to RAGs:
a. auto-match
b. audit match
c. RAG assigns new ISNIs
d. RAGs synch w/ ISNI
e. ISNI used as bridge via
Public Data
Members can access “full”
ISNI information but cannot
provide or assign numbers to
3rd parties-- ISNI data can
be used w/in internal
systems (e.g. library may
assign ISNIs to all individuals
and departments within their
institution
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016
Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016
It was a desire to “help” authors differentiate and disambiguate
themselves that got ISNI started.
Along the way, a lot has been learned. A specific example, that
often doesn’t get a lot of attention, is the need for privacy
protection whenever there is an Identification process underway…
this holds true for individuals and institutions.
Our industry spends a great deal of time discussing “open data”,
but there are many times when that data should not (or cannot) be
made public (physicist romance author, animal tester, military
applications, etc….)
The Semantic Web cannot exist
without well structured data
Things take on a life
of their own
Vastness
Vagueness
Uncertainty
Inconsistency
Deceit
The challenges to creating a world
of content tagged with meaning:
Standard Identifiers can help with the
middle three – Artificial Intelligence
will handle Vastness and Deceit
Thank you
Jay Henry
Chief Marketing Officer
jay.henry@ringgold.com
www.ringgold.com

More Related Content

PPTX
Ringgold User Group Meeting 2016 (USA)
PPTX
Small Data, Big Benefits - Christine Orr at SSP 2016
PPTX
Emerging Standards: Data and Data Exchange in Scholarly Publishing
PPTX
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
PPTX
Persistent Identifiers - The 5 Things You Need To Know
PPTX
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
PPT
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
PPTX
Metadata & Standards in Scholarly Communication
Ringgold User Group Meeting 2016 (USA)
Small Data, Big Benefits - Christine Orr at SSP 2016
Emerging Standards: Data and Data Exchange in Scholarly Publishing
Institutional Identifiers - Phil Nicolson at ALPSP 'Setting The Standard' 2015
Persistent Identifiers - The 5 Things You Need To Know
Institutional Identifiers in Practice: Christine Orr at CESSE 2015
Unique Identifiers for Business Partners: progress with ISNI, the Ringgold ID...
Metadata & Standards in Scholarly Communication

What's hot (20)

PPTX
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
PPT
Connecting people, places and things
PPTX
Institutional Identifiers in Practice
PDF
New Metadata Developments
PPT
What Publishers Need to Know About Web Scale Discovery
PDF
The reach of Crossref metadata and who is using it
PDF
Checking for Originality: Crossref Similarity Check
PPTX
Pulling Together: information flow throughout the scholarly supply chain
PDF
Getting started with registering content with Crossref
PDF
Organizational Identifiers - Crossref LIVE Hannover
PDF
Introduction to DataCite - Martin Fenner
PDF
Crossref Event Data and other new services
PDF
Managing changes to content
PDF
"Cool" metadata for FAIR data
PDF
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
PDF
New Initiatives - Geoffrey Bilder - London LIVE 2017
PDF
Introduction to Crossref
PDF
Preparing Data for Sharing: The FAIR Principles
PDF
Correcting and Updating the Scholarly Record through CrossMark
PDF
Crossref in your publishing workflow
Persistent Identifiers in Scholarly Communications - Christine Orr at SSP 2016
Connecting people, places and things
Institutional Identifiers in Practice
New Metadata Developments
What Publishers Need to Know About Web Scale Discovery
The reach of Crossref metadata and who is using it
Checking for Originality: Crossref Similarity Check
Pulling Together: information flow throughout the scholarly supply chain
Getting started with registering content with Crossref
Organizational Identifiers - Crossref LIVE Hannover
Introduction to DataCite - Martin Fenner
Crossref Event Data and other new services
Managing changes to content
"Cool" metadata for FAIR data
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
New Initiatives - Geoffrey Bilder - London LIVE 2017
Introduction to Crossref
Preparing Data for Sharing: The FAIR Principles
Correcting and Updating the Scholarly Record through CrossMark
Crossref in your publishing workflow
Ad

Viewers also liked (6)

PPTX
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
PPTX
Using Data to Drive Discovery of New Scholarly Works
PPTX
Spring Cleaning: Easy Ways to Tidy Your Customer Data
PPT
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
PPTX
Institutional Identifiers internally and throughout the supply chain
Metadata Standards: A Golden Age Arrives? - Christine Orr at STM
Using Data to Drive Discovery of New Scholarly Works
Spring Cleaning: Easy Ways to Tidy Your Customer Data
Ringgold Webinar Series: ProtoView - Publication Metadata to Drive Discovery,...
Institutional Identifiers internally and throughout the supply chain
Ad

Similar to Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016 (20)

PPTX
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
PDF
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
PPTX
Who Are We, What Is It, What Can I Do With It and Why Does It Matter?
PPTX
IDPF Digicon Future of Metadata
PPTX
ISNI : a persistent identifier for creatives and associated organizations / T...
PDF
Metadata: Standards Basics for the Independent Publishing Community, with Gra...
PDF
Baer, International Standard Name Identifier, ISN - ISN
PPTX
Identify Database User Group Meeting 2017 UK
PDF
D'Agostino n"The International Standard Name Identifier & Identifying Textual...
PDF
D'Agostino "The International Standard Name Identifier & Identifying Textual ...
PDF
Metadata, identifiers and linking together content - Todd Carpenter
PDF
Henderson, Feick, Agnew, Guida, Rotenberg, D'Agostino, and Weissberg "What's ...
PDF
Wolven, Hickey, and Henderson, "Identifiers: New Problems, New Solutions, Par...
PDF
Drewry universal identifiers throughout production chain, overview and intero...
PDF
Introduction to ISNI
PPT
PPTX
Authority files - Jisc Digital Festival 2014
PPT
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
PPTX
Zen and the Art of Metadata Maintenance
PPT
OAI Metadata: Why and How
Ringgold Webinar Series: 2. Core Strength - Standard Identifiers as the Found...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Who Are We, What Is It, What Can I Do With It and Why Does It Matter?
IDPF Digicon Future of Metadata
ISNI : a persistent identifier for creatives and associated organizations / T...
Metadata: Standards Basics for the Independent Publishing Community, with Gra...
Baer, International Standard Name Identifier, ISN - ISN
Identify Database User Group Meeting 2017 UK
D'Agostino n"The International Standard Name Identifier & Identifying Textual...
D'Agostino "The International Standard Name Identifier & Identifying Textual ...
Metadata, identifiers and linking together content - Todd Carpenter
Henderson, Feick, Agnew, Guida, Rotenberg, D'Agostino, and Weissberg "What's ...
Wolven, Hickey, and Henderson, "Identifiers: New Problems, New Solutions, Par...
Drewry universal identifiers throughout production chain, overview and intero...
Introduction to ISNI
Authority files - Jisc Digital Festival 2014
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Zen and the Art of Metadata Maintenance
OAI Metadata: Why and How

More from Ringgold Inc (7)

PPTX
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
PPTX
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
PPTX
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
PPT
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
PPTX
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
PPTX
Identify database
PPTX
Using your Data to Drive Revenue – Laura Cox at London Book Fair 2018
Ringgold Webinar Series: 4. 30-Minute Workout - Quick Tips for Better Custome...
Ringgold Webinar Series: 3. Lean and Mean - Publication Metadata to Enhance D...
Ringgold Webinar Series: 1. Taking Stock – Commitment to Healthy Data
Rubbish in Rubbish out: applying good data governance techniques to gain maxi...
Identify database

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
August Patch Tuesday
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Mushroom cultivation and it's methods.pdf
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Getting Started with Data Integration: FME Form 101
MIND Revenue Release Quarter 2 2025 Press Release
August Patch Tuesday
Assigned Numbers - 2025 - Bluetooth® Document
Reach Out and Touch Someone: Haptics and Empathic Computing
Heart disease approach using modified random forest and particle swarm optimi...
Encapsulation_ Review paper, used for researhc scholars
Agricultural_Statistics_at_a_Glance_2022_0.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
A comparative study of natural language inference in Swahili using monolingua...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Mushroom cultivation and it's methods.pdf
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
A comparative analysis of optical character recognition models for extracting...
Per capita expenditure prediction using model stacking based on satellite ima...
OMC Textile Division Presentation 2021.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm

Emerging Standards: Data and Data Exchange in Scholarly Publishing - Jay Henry at CSE 2016

  • 6. *This means data that can be linked together through unambiguous identification and exchanged with others Governed Trusted Transparent And contain appropriate metadata In order to be effective, identifiers must be:
  • 7. Persistent numeric or alpha-numeric designations associated with a single entity Entities can be an institution, person, or piece of content (People, Places, & Things) 1. Disambiguate, aka enforce uniqueness 2. Enable linking, aka data integration and interoperability In other words, they provide a simple basis for data governance
  • 8. ◦ Break down silos ◦ Keep data current and synchronised ◦ Enable staff to interact with data more effectively ◦ Simplify data exchange ◦ Improve overall data quality Institutional Identifiers CRM Electronic document storage Usage statistics Author Database Fulfilment system Membersh ip system License Validation Manuscript Submission System
  • 9. • Resources & personnel required to join existing records to IDs or an authority file • Build customized solutions mapping systems together • Improve data capture to require an ID upon record creation • Manual vs programmatic cost-benefit questions • Design new reporting and analysis tools to leverage newly linked datasets
  • 10. Researchers – create Current Research Information Systems (CRIS) – one portal to figure out how to best conduct research, who to work with, who will fund it, what else has been contributed to the subject thus far, where is the best equipment to help further the research. Funders – Want to track areas of interest, identify worthwhile pursuits, and see where their money goes. Institutions – Demonstrate research output more accurately and precisely describe the institution’s contribution and who is affiliated with that work. Publishers – Facilitate transactions of all types from content discovery to delivery of author royalties. Improved market analysis and targeted advertising.
  • 11.  ISO Standard 27729  ISNI is designed to be a “bridge identifier”  Covers any type of entity ISNI Number ISNI Number Party ID 2Party ID 1 Proprietary Information and/or Metadata Proprietary Information and/or Metadata
  • 12. In cooperation with ProQuest, OCLC, and other public and commercial entities, Ringgold has been working to map ISNIs to deeper datasets for the past two years. It’s taken time due to the problems with the raw source data, and the policies for assignment of the unique ISNI identifier.
  • 13. At the same time ISNI records are loaded to the Ringgold Identify Database we will being issuing ISNIs for institutions. ProQuest (Bowker) is a Registration Agency as well, focusing on individuals.
  • 14. ISNI (OCLC tech) Third Parties M E M E B E R S M E M E B E R S M E M E B E R S M E M E B E R S M E M E B E R S M E M E B E R S Proprietary Databases Members submit data to RAGs: a. auto-match b. audit match c. RAG assigns new ISNIs d. RAGs synch w/ ISNI e. ISNI used as bridge via Public Data Members can access “full” ISNI information but cannot provide or assign numbers to 3rd parties-- ISNI data can be used w/in internal systems (e.g. library may assign ISNIs to all individuals and departments within their institution
  • 17. It was a desire to “help” authors differentiate and disambiguate themselves that got ISNI started. Along the way, a lot has been learned. A specific example, that often doesn’t get a lot of attention, is the need for privacy protection whenever there is an Identification process underway… this holds true for individuals and institutions. Our industry spends a great deal of time discussing “open data”, but there are many times when that data should not (or cannot) be made public (physicist romance author, animal tester, military applications, etc….)
  • 18. The Semantic Web cannot exist without well structured data Things take on a life of their own Vastness Vagueness Uncertainty Inconsistency Deceit The challenges to creating a world of content tagged with meaning: Standard Identifiers can help with the middle three – Artificial Intelligence will handle Vastness and Deceit
  • 19. Thank you Jay Henry Chief Marketing Officer jay.henry@ringgold.com www.ringgold.com

Editor's Notes

  • #3: Let’s take a moment to orient ourselves on the big picture…
  • #4: Our trees are interesting! Publications, vendors, authors… all the people places and things can be described using standard taxonomies and identifiers.
  • #5: This aerial view of our forest home provides a bit more perspective – but we’re really headed to a place where we can use standardized descriptions to develop new information
  • #6: Here, we’ve virtualized our understanding of the world by using data ---from this perspective, not only can we look at things far beyond our immediate sight, but are able to view our surroundings in different contexts and with much deeper analysis than by simply looking around– this is where we are headed when looking at the universe of people, the world of places, or all the stuff in it. We used to look at long lists of people we though might be customer, those that were already customers… and now we have ways to better understand who is really using our content, who is funding the most highly accessed research, and who’s are the individuals and institutions involved?
  • #7: You’ll note that I’ve used the term “Standard Identifiers” as opposed to just “Standards”… I’ll be focusing on using standard identifiers as the main data hooks that will allow us to aggregate information for the purpose of synthesizing knowledge. Interoperability implies communication; how we communicate something is very different than how we describe things. I should take a moment to clarify that the Ringgold ID is not a standard – not an ISO Certified standard, in any case, but in many cases, our data has become a defacto standard through application; some of you might be wondering what then constitutes a big “S” Standards – if any system uses a predefined taxonomy as an authority file to validate data (thereby achieving identify data entries for each and every instance it is needed) then a standard has been achieved. How data is exchanged is quite different than the data itself, and of course, standards may be applied to both. For my part, I’m going to talk about the data itself, not how it is exchanged– So, in terms of the data itself, what are we trying to standardize? Descriptions – the wrapper around highly unique content. More importantly, as an industry—as a species, really—we are creating data elements that can be interpreted by machines – I should say, easily interpreted by machines – I’ll come back to this topic near the end of my presentation.
  • #9: INTERNAL – Let’s look at your own ecosystem. Linking of data: Enable staff to use your data more efficiently, and keep the same view of an institution regardless of what system they are using. See overlaps and outliers when comparing two or more datasets. Example 1: Compare your fulfillment – active subscriber list – w your doc storage system, and see which subscribers have never submitted their license agreement.) Example 2: We’ve got a client that uses 3 systems to take and fulfill institutional subscriptions: CRM, authentication, and an accounting platform. Before linking these systems up with identifiers, there were disconnects that affected their clients: sometimes it was impossible to tell why the auth system was granting journal access to a particular institution – the access seemed unconnected to the payment.
  • #11: Loads of benefits: IF STANDARDS ARE INTEROPERABLE
  • #12: Bridge Identifier – this is an extremely important concept– there are identifiers, and there’s data… and while identifiers are data, not all identifiers operate or are maintained in the same way, and this is the important difference between and ISNI and a Ringgold ID.
  • #13: Mention that we are now board members (Laura).
  • #16: (ISNI straddles persons and institutions, so this will make a nice segue.) INTERNATIONAL STANDARD NAME IDENTIFIER, Iso standard. SCOPE: ISNI is meant to identify all things considered to be public parties, mostly which are creators of content, or otherwise appear in library & union catalogs (including fictional characters?). Typical records hold name variants, as you can see here. It is not limited to the scholarly or research sector, but covers all manner of popular authors, musicians, and contributors. (Original ISNI dataset was populated with VIAF records & other bibliographic sources like the Library of Congress and other international sources.) Our relationship: RIN is an ISNI registration agency, which means we will be working as a conduit for new record creation within our scope, which is primarily institutions in the scholarly supply chain. It is our plan to hold ISNIs for all institutions in our Identify database, and we are now we are working to ensure that all ISNI records which map to RINs are correct, and that we can achieve clean one to one matches. We are also working with them to create new ISNI IDs for insts in RIN, but that are not yet in ISNI. By mapping our database completely to theirs, we hope to put our clients at the starting line, so that our clients may maximize their supply chain linking.
  • #17: To look at a few specific records: Here’s an ISNI, but an institutional record rather than the personal record we saw earlier. Again, note all the name variants as they appear in library holdings records. I should mention that this record illustrates one of the biggest problems everyone is confronted with--- the Many-to-One (One “Golden Record” as one major publisher refers to their internal authorative record). Here we have many names for the same institution… all attributed to the 1 ISNI – this is not ulinke what Ringgold does– we have ‘alternate names’ for each Ringgold ID stored within the Identify database, and by the end of June, the ISNI names will also be linked to the Ringgold ID (mostly… there’s not a 100% 1|1 match between ISNI and Ringgold… that’s another story.