SlideShare a Scribd company logo
GNA Meeting,  Paris France Global Names Architecture Meeting David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) 2011
Somewhere around 2001
From T.E. Glover, The Fishes of Southwestern Japan, c.1870
Orthography The long-finned squid, Loligo pealeii (Laseur)
Agalinus paupercula borealis Agalinus pauperculum borealis Agalinis paupercula var. Borealis Agalinus pauperculum var. borealis Agalinus paupercula var. borealis Agalinus paupercula var. borealis Pennell Agalinus paupercula Britton var. borealis Pennell Agalinus paupercula (Gray) Britt. var. borealis Pennell Agalinis paupercula (A.Gray) Britton var. borealis Pennell Agalinus paupercula (Gray) Britton var. borealis (Pennell) Zenkert 1934 Orthography Reconciling different forms of the same name
Nomenclature The Bluefish,  Temnodon saltator
Taxonomy P. carinii  sec 1 P. carinii sec 2 P. jiroveci
With access to authority information Higher Taxonomy
Without authority information Higher Taxonomy
Issues that are not unique Particularly in federated systems
Taxonomic Data Sources Classification Taxonomic Status Heterotypic Synonymy Taxon Identifiers Nomenclatural Data Sources Orthography Nomenclatural Status Objective Synonymy Nomenclatural Identifiers Addressed by…
Catalog of Life Index Fungorum Species Fungorum Tropicos LepIndex GRIN DSMZ Euzeby index IPNI ITIS Euro + Med Plantbase Index Nominum Diptorum Orthoptera Species File The Plant List NCBI Taxonomy World Register of Marine Species Angiosperm Phylogeny Group list Solanaceae Source Amphibian Species World World Spider Catalogue AlgaeBase Index Nominum Algarum Index Nominum Genericorum ZooBank ERMS IUCN RedList Mammal Species of World Catalog of Fishes FishBase Catalog of Life Index Animalium ION Nomenclator Zoologicus Fauna Europaea IRMNG NZOR Coleorrhyncha Species File A lot of this…
Common  Discovery Network Documentation (metadata) model Data Sharing Format Data Sharing tools Consensus Web Service methods Few resolvable identifers No common resolution output Little Integration Not a lot of this
“ All  accumulated information  of a species is tied to a scientific name, a name that serves as a link between what has been learned in the past and what we today add to the body of knowledge.” (nearly) All names matter
Global discovery of nomenclature and taxonomic resources Common access to these resources Reconcile names labeling data and information to nomenclature and taxa Embedded services that add value to these resources We need
uBio
An index of all names used with biodiversity information reconciled to authoritative nomenclators
An index of taxon resources and species checklists
 
Nice Idea No Architecture
Without an architecture Ad-hoc Requires personal networking No clear fit to a larger picture
Common approach to common tasks
Architecture Global Registry for resource discovery Common and documented data standards Metadata Data Vocabularies Data Sharing tools Common web service methods Resolvable identifers (names/taxa)`
Architecture Enable global discovery of taxonomic and nomenclatural resources Derivative products (regional and thematic species checklists) Enable resources to be shared in a consistent manner Promote development of new derived products
Enable global discovery
Integrated Publishing Toolkit Supports Publication of Species Checklists (sensu lato) Supports EML as resource metadata format Darwin Core Archive as output formats Possible to add ISO metadata output TCS data output – lossy relative to source data Integrated Publishing Toolkit 2.0
Lowered the technical barriers to data publishing Publishing with spreadsheets Publishing via Email Publishing with no installed tools Publishing with no tools at all
Darwin Core Archives
Lots of documentation > 2500 downloads  English/French/Spanish
Many resources available
Promote Development of New Derived Products
GBIF Involved but not integrated Global Names Index Global Name Usage Bank Supported by what has been presented
Checklist Bank for GBIF network
Checklist Bank Status:  Dev version in place.  Integration with GBIF data portal 2011 http://ecat-dev.gbif.org/
i4Life
Common platform for multiple initiatives to discover and exchange taxonomic and nomenclatural information New derived products that improve efficiency and utility of taxonomic process Embed taxonomy within larger biodiversity informatics challenges Vision

More Related Content

PPT
Remsen Lect04
PPT
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
PPT
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
PPTX
Fbip specify2015
PPT
Sharing information between projects
PPTX
Dr David Schindel and Mike Trizna - BOL Data Portal
PPTX
Linking biodiversity data for ecology
PPTX
FishBase
Remsen Lect04
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
Fbip specify2015
Sharing information between projects
Dr David Schindel and Mike Trizna - BOL Data Portal
Linking biodiversity data for ecology
FishBase

What's hot (10)

PPTX
Animal telemetry, Ross Dwyer ACEAS Grand 2014
PPTX
Proposed Extension to Darwin Core for People
PPTX
Austin ecn2013
PPTX
American Gut - OMICRON
PPTX
Encyclopedia of Life: Use cases for phenotypes
PPT
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
PPTX
The Future of Microalgal Taxonomy
PPT
Dr Justin Schonfeld - Bioinformatics Applications
PDF
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
PPTX
ContentMine + EPMC: Finding Zika!
Animal telemetry, Ross Dwyer ACEAS Grand 2014
Proposed Extension to Darwin Core for People
Austin ecn2013
American Gut - OMICRON
Encyclopedia of Life: Use cases for phenotypes
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
The Future of Microalgal Taxonomy
Dr Justin Schonfeld - Bioinformatics Applications
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
ContentMine + EPMC: Finding Zika!
Ad

Viewers also liked (7)

PPTX
Nodes Portal Toolkit primer
PDF
Classification notes for website
PPT
The power of names smithsonian talk-2013-iczn_nomenclature&bioinformatics-v2
PPT
Tony Rees: An All Genera Index
PPTX
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
PDF
ViBRANT: linking communities and services
PDF
Chris Lyal - Taxonomy and the Web - integrating the pieces
Nodes Portal Toolkit primer
Classification notes for website
The power of names smithsonian talk-2013-iczn_nomenclature&bioinformatics-v2
Tony Rees: An All Genera Index
Sherborn: Pilsk, Joel Richard & Kalfatovic - Unlocking the Index Animalium: F...
ViBRANT: linking communities and services
Chris Lyal - Taxonomy and the Web - integrating the pieces
Ad

Similar to Global Names Architecture - Remsen (20)

PPT
Special Libraries Associatin
PPTX
Introduction to Biodiversity Informatics
PPTX
Tony Rees IRMNG 2015 presentation
PPTX
10 years of global biodiversity databases: are we there yet?
PPT
The Encyclopedia of Life: How realistic is it?
PPT
Mla May 7
PPT
Zoo Bank Talk Ms Ccourse09 Compressed Test
PPT
uBio presentation to Jim Edwards 2006
PPTX
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
PPT
Remsen EOL Content Summit
PDF
Bi 2005 20
PPTX
2014.04.01 Shorthouse REDM400
PPT
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
PPT
Shorthouse
PPTX
Two graphs, three responses
PPT
Thomson Reuters
PPT
The Biodiversity Heritage Library Mass Digitizing Project: A Grandeur in this...
PPT
Biodiversity Heritage Library : Development and Partnerhips
PPT
IRMNG presentation March 2012
PPT
EIA Biodiversity Data Mobilisation
Special Libraries Associatin
Introduction to Biodiversity Informatics
Tony Rees IRMNG 2015 presentation
10 years of global biodiversity databases: are we there yet?
The Encyclopedia of Life: How realistic is it?
Mla May 7
Zoo Bank Talk Ms Ccourse09 Compressed Test
uBio presentation to Jim Edwards 2006
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Remsen EOL Content Summit
Bi 2005 20
2014.04.01 Shorthouse REDM400
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Shorthouse
Two graphs, three responses
Thomson Reuters
The Biodiversity Heritage Library Mass Digitizing Project: A Grandeur in this...
Biodiversity Heritage Library : Development and Partnerhips
IRMNG presentation March 2012
EIA Biodiversity Data Mobilisation

More from David Remsen (17)

PPTX
Use and Limits of Scientific Names in Biological Informatics
PPTX
Biodiversity capecod short
PPT
uBio presentation to UMLS group of NLM / NIH
PPT
uBio presentation to Species 2000 May 2004
PPTX
Emergent interdisciplinary research opportunity for the MBL
PPTX
Remsen celebration of discovery
PDF
National Biodiversity Informatics Goals
PPT
Remsen sherborne
PPTX
Nodes Portal Toolkit Primer
PPTX
Collaboration Forum Keynote
PPT
Tdwg 2-remsen
PPT
Tdwg 1-remsen
PPT
Remsen sherborne
PPTX
D3 02 Vernacular Names
PPTX
D3 02 National Checklists
PPTX
Cataloging Taxonomic Data
PPTX
Digitisation of Taxonomic Data: Current Approaches
Use and Limits of Scientific Names in Biological Informatics
Biodiversity capecod short
uBio presentation to UMLS group of NLM / NIH
uBio presentation to Species 2000 May 2004
Emergent interdisciplinary research opportunity for the MBL
Remsen celebration of discovery
National Biodiversity Informatics Goals
Remsen sherborne
Nodes Portal Toolkit Primer
Collaboration Forum Keynote
Tdwg 2-remsen
Tdwg 1-remsen
Remsen sherborne
D3 02 Vernacular Names
D3 02 National Checklists
Cataloging Taxonomic Data
Digitisation of Taxonomic Data: Current Approaches

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation theory and applications.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Spectroscopy.pptx food analysis technology
Understanding_Digital_Forensics_Presentation.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Unlocking AI with Model Context Protocol (MCP)
Encapsulation theory and applications.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
NewMind AI Weekly Chronicles - August'25 Week I
sap open course for s4hana steps from ECC to s4
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Review of recent advances in non-invasive hemoglobin estimation
Mobile App Security Testing_ A Comprehensive Guide.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The AUB Centre for AI in Media Proposal.docx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectral efficient network and resource selection model in 5G networks
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Spectroscopy.pptx food analysis technology

Global Names Architecture - Remsen

  • 1. GNA Meeting, Paris France Global Names Architecture Meeting David Remsen Senior Programme Officer Global Biodiversity Information Facility (GBIF) 2011
  • 3. From T.E. Glover, The Fishes of Southwestern Japan, c.1870
  • 4. Orthography The long-finned squid, Loligo pealeii (Laseur)
  • 5. Agalinus paupercula borealis Agalinus pauperculum borealis Agalinis paupercula var. Borealis Agalinus pauperculum var. borealis Agalinus paupercula var. borealis Agalinus paupercula var. borealis Pennell Agalinus paupercula Britton var. borealis Pennell Agalinus paupercula (Gray) Britt. var. borealis Pennell Agalinis paupercula (A.Gray) Britton var. borealis Pennell Agalinus paupercula (Gray) Britton var. borealis (Pennell) Zenkert 1934 Orthography Reconciling different forms of the same name
  • 6. Nomenclature The Bluefish, Temnodon saltator
  • 7. Taxonomy P. carinii sec 1 P. carinii sec 2 P. jiroveci
  • 8. With access to authority information Higher Taxonomy
  • 10. Issues that are not unique Particularly in federated systems
  • 11. Taxonomic Data Sources Classification Taxonomic Status Heterotypic Synonymy Taxon Identifiers Nomenclatural Data Sources Orthography Nomenclatural Status Objective Synonymy Nomenclatural Identifiers Addressed by…
  • 12. Catalog of Life Index Fungorum Species Fungorum Tropicos LepIndex GRIN DSMZ Euzeby index IPNI ITIS Euro + Med Plantbase Index Nominum Diptorum Orthoptera Species File The Plant List NCBI Taxonomy World Register of Marine Species Angiosperm Phylogeny Group list Solanaceae Source Amphibian Species World World Spider Catalogue AlgaeBase Index Nominum Algarum Index Nominum Genericorum ZooBank ERMS IUCN RedList Mammal Species of World Catalog of Fishes FishBase Catalog of Life Index Animalium ION Nomenclator Zoologicus Fauna Europaea IRMNG NZOR Coleorrhyncha Species File A lot of this…
  • 13. Common Discovery Network Documentation (metadata) model Data Sharing Format Data Sharing tools Consensus Web Service methods Few resolvable identifers No common resolution output Little Integration Not a lot of this
  • 14. “ All accumulated information of a species is tied to a scientific name, a name that serves as a link between what has been learned in the past and what we today add to the body of knowledge.” (nearly) All names matter
  • 15. Global discovery of nomenclature and taxonomic resources Common access to these resources Reconcile names labeling data and information to nomenclature and taxa Embedded services that add value to these resources We need
  • 16. uBio
  • 17. An index of all names used with biodiversity information reconciled to authoritative nomenclators
  • 18. An index of taxon resources and species checklists
  • 19.  
  • 20. Nice Idea No Architecture
  • 21. Without an architecture Ad-hoc Requires personal networking No clear fit to a larger picture
  • 22. Common approach to common tasks
  • 23. Architecture Global Registry for resource discovery Common and documented data standards Metadata Data Vocabularies Data Sharing tools Common web service methods Resolvable identifers (names/taxa)`
  • 24. Architecture Enable global discovery of taxonomic and nomenclatural resources Derivative products (regional and thematic species checklists) Enable resources to be shared in a consistent manner Promote development of new derived products
  • 26. Integrated Publishing Toolkit Supports Publication of Species Checklists (sensu lato) Supports EML as resource metadata format Darwin Core Archive as output formats Possible to add ISO metadata output TCS data output – lossy relative to source data Integrated Publishing Toolkit 2.0
  • 27. Lowered the technical barriers to data publishing Publishing with spreadsheets Publishing via Email Publishing with no installed tools Publishing with no tools at all
  • 29. Lots of documentation > 2500 downloads English/French/Spanish
  • 31. Promote Development of New Derived Products
  • 32. GBIF Involved but not integrated Global Names Index Global Name Usage Bank Supported by what has been presented
  • 33. Checklist Bank for GBIF network
  • 34. Checklist Bank Status: Dev version in place. Integration with GBIF data portal 2011 http://ecat-dev.gbif.org/
  • 36. Common platform for multiple initiatives to discover and exchange taxonomic and nomenclatural information New derived products that improve efficiency and utility of taxonomic process Embed taxonomy within larger biodiversity informatics challenges Vision

Editor's Notes

  • #4: I was digitising resources like this gem – beautiful plates and lots of nice metadata to index them with. The only problem was that the only really useful biological metadata was the scientific name that labeled each picture. Had I only known what I was getting into.
  • #5: This is another early digitisation project. A seminal work. Only the name is spelled incorrectly or at least incorrectly according to todays spelling. I cant change the spelling in the source so what can I do about it? At the time I was short on ideas.
  • #6: Pictures and specimens and gene sequences are labelled with names like this. Eventually I learned these are all different forms of the same name. For various reasons, however, different sources often prefer to retain their versions of the name.
  • #7: This paper contains historic catch and distribution data. But the name represents a combination that is no longer in use. How can we retrieve historic information on species where the name has changed from what is in use today?
  • #8: In 1999, the fungus responsible for causing one of the major causes of death in people with HIV was renamed, following a split of the species. This requires those who work on this disease to be aware of the new name in order to access information related to the species.
  • #9: Here is a map showing a distribution of data related to hummingbirds. It is assembled because we have a taxonomic authority source that specifies all the names of the species within the hummingbird family. As a result the map is an accurate representation (aside from some stray data with clear geospatial issues).
  • #10: Without access to sufficient authoritative taxonomic data, we have been forced to rely on less-accurate classification data originating in occurrence datasets. These datasets often contain errors such as illustrated here where a European bird species was mistakenly placed in the hummingbird family.
  • #30: This outreach extends to a new suite of data publishing guides and tools that provide details on data formats, checklist metadata, and checklist publishing tools.
  • #31: In 2011 the number of taxonomic authority files published through the network has doubled thanks to promotional efforts within the GBIF network and partnerships that include other taxonomic initiatives.
  • #35: in use by ALA/ . Consultation through the GNA. Soliciting feedback about the APIs. Require discussion with community about attribution? Something about it being the component in the evolved portal that will provide all taxonomic services, including means to organise content
  • #36: in use by ALA/ . Consultation through the GNA. Soliciting feedback about the APIs. Require discussion with community about attribution? Something about it being the component in the evolved portal that will provide all taxonomic services, including means to organise content