Donat Agosti, Plazi
ISH Conference
Cuzco
23.7.2014, Cuzco
A Step Towards
(From) Read to Write Access to
Taxonomic Publications
Acknowledgement
Pensoft
Pro-iBiosphere / EU FP7
Zenodo / CERN
Why do you publish?
What do you expect from a publication?
What might others expect from publications?
What is a taxonomic publication?
DNA
Specimens Observations
Institution
Pharmacology/epidemiology
Publication
Treatment
Treatment
Treatment
Table
Appendix
Biology/ecology
Reference to other biota
Publication
Treatment
Publication
Taxonomic publication
Var sections
Bib. refs
Treatment
Treatment
Treatment
Treatment
publications or (more frequently) sections of
publications documenting the features or distribution
of a related group of organisms (called a “taxon”,
plural “taxa”) in ways adhering to highly formalized
conventions. Some of these are over a century old.
[Catapano, 2011]
Treatment
Each taxonomic name usage has it’s treatment
Treatment
Formica obsoleta Linnaeus, 1758: 580
Treatments as standard containers
http://en.wikipedia.org
Treatment
Plazi treatment elements schema
Treatment text
Cite Treatment 1
Cite DNA sequence 1
Cite Figure 1
Cite Abbreviation 1
Cite Table 1
Cite Appendix 1
Cite Reference 1
Cite supplementary
materials
Interaction sp. X
Trait 1 value = Y
Inline table
Treatment text, continued
Materials citation
Species interacted with
Cited by treatment
httpUri
httpUri
Map, Dashboard charts
Abbreviation 1
treatment x GUID
Trait database
Sequence in database
httpUri
RDF
Treatment citation ontology
Treatment (Plazi)
httpUri ?
Table 1
Appendix 1
Figure caption
Supplementary materials
Reference 1
Plazi treatment
Linked treatment elements
Other online source
Other links
Content from citations, instances
Content output from treatment
Hyperlink
httpUri
Figure image
Treatment
Countries (Region)
Australia (Queensland)
Export species materials citations (DwC)
Pseudomyrmex ants and Vachellia ant-acacias
are a classic example of mutualism in biology.
allenii
melanoceras
ruddiae
chiapensis
collinsii
cookii
cornigera
globulifera
hindsii
janzenii
mayana
sphaerocephala
boopis
flavicornis
hesperius
ita
janzeni
kuenckeli
mixtecus
nigrocinctus
nigropilosus
opaciceps
particeps
peperi
reconditus
satanicus
simulans
spinicola
subtilissimus
veneficus
ferrugineus
gentlei
gracilis
Transbiotic link network
Associated species linked through
references in taxonomic treatments
Acacia-ant species: Pseudomyrmex gracili
Treatment: redescription
Associated ant-acacia: Acacia gentlei
Ants Plants
Photocredits: Alex Wild
Treatment
Treatments linked
through citations
Treatment
Treatment
Linking of treatments using persistent identifiers
Treatment
citation
Treatment
identifier
Treatment
Linking of treatments to external resources
Treatment
Plazi Search and Retrieval
Server: Access to data
DwC-A
You
You
You
human
machine
What are taxonomic publications?
Taxonomic publictationS
Journal of Hymenoptera Research
5170 specimens
4062 plottable specimens from
1138 unique locations
Brazil
All content in Plazi (34,000 treatments)
14,590 specimens
8900 plottable specimens from
1138 unique locations
200,000,000+ printed pages
1,900,000 species described
20,000,000+ species treatments
17,000 new species per year
BUT: The data are hidden
Incomplete digitization
Publications are not
semantically enhanced
Collections are incomplete
Data is not linked
Most data are not open
Taxonomic publictationS
Taxonomic publications
PDFs are stupid –
only men can
understand them
but…
GoldenGATE editor
Conversion
Find the right mix of generic and domain specific solutions
Plazi
SRS
find scan «OCR» markup store
?
domain domaingeneric
Digitization and Markup Workflow:
$$$$ ?
Solution for the future
Publish semantically enhanced:
Journal of Hymenoptera Research
Solution
Open Access
Solution
Blue List
elements of taxonomic information that
are not subject to copyright… e.g.
treatments
(Patterson et al., 2014)
Solution
Bouchout Declaration
Solution
Support reliable and permanent open access to digital biodiversity
records
Create identifiers, link provide direct access to digital objects of
biodiversity literature, specimens, multimedia, genes, etc.
Ensure global interoperability and sharing of biodiversity data,
information and knowledge
Ongoing dialogue to refine the concept and implementation
As signatories, we encourage an overarching approach to Open Biodiversity
Knowledge Management which is based on the following fundamental
principles:
http://bouchoutdeclaration
Solution
Institutional: 57; Individual: 58
Large Natural History Institutions (e.g BGBM, Naturalis,
MfN, RGBM, INBio, MCZ, CAS)
Scientific Networks (e.g. IUBS, Vertnet, CRIA, TaiBIF,
Canadensys, Creative Commons, DataOne)
Scientists
Global
Initial Signatories (June 12, 2014)
Solution
Solution
Solution
Linking
Solution
DOI
persistent identifiers
Solution: Digital object identifiers
Publications
Treatments
Images / digital objects
Data
Solution
DOI for publications
DOI
Solution
DOI: CrossRef
doi/10.5281/zenodo.10
doi/10.5281/zenodo.10697
doi/10
DOI: DataCite (Biodiversity Literature Repository)
DOI for publications
Solution
Solution
ZENODO @CERN
ZENODO builds and operate a simple and innovative service that
enables researchers, scientists, EU projects and institutions to
share and showcase multidisciplinary research results (data and
publications) that are not part of the existing institutional or
subject-based repositories of the research communities.
Zenodo is the digital repository of CERN
Zenodo agreed to host and support the Biodiversity Literature
Repository
Solution
Solution
Solution
Services or what you get:
Wide access
Refindit
Refbank
Legal Issues
Archiving
Distribution of publications resolved
Your legacy literature is for everybody directly from
your publications accesible
Solution
Why don’t we assure that all the legacy taxonomic
literature is in the
Biodiversity Literature Repository?
The future
Why not assure that all the legacy taxonomic
literature is in the Biodiversity Literature
Repository?
Why not make our community the first that can
publish in its journal with all publications linked
to a digital copy?
The future
Why not assure that all the legacy taxonomic
literature is in the Biodiversity Literature
Repository?
Why not make our community the first that can
publish in its journal with all publications linked
to a digital copy?
Links
Links
Further reading: http://plazi.org/?q=plazi_publications
Catapano, 2011 (http://www.ncbi.nlm.nih.gov/books/NBK47081/)
Bouchout Declaration (http://bouchoutdeclaration.org)
Blue List (http://plazi.org/?q=blue_list)
Biodiversity Literature Repository (https://zenodo.org/collection/user-biosyslit
Zenodo (https://zenodo.org/about)
Refindit (http://refindit.org)
Refbank (http://refbank.org)
Pro-iBiosphere (http://pro-ibiosphere.eu/)
Introduction to persistent identifiers (http://wiki.pro-
ibiosphere.eu/wiki/Best_practices_for_stable_URIs)
Twitter
@plazi_treat; @bouchoutdec, @myrmoteras
Thank you!
Donat Agosti
agosti@plazi.org

More Related Content

PPT
How Bio Ontologies Enable Open Science
PPTX
The biodiversity informatics landscape: a systematics perspective
PPTX
Introduction to Biodiversity Informatics
PPTX
Facilitating semantic alignment.-biohackathon-jupp
PPTX
schema.org and biomedical ontologies
PDF
Next generation sequencing requires next generation publishing: the Biodivers...
PPTX
Research Objects: more than the sum of the parts
PPTX
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...
How Bio Ontologies Enable Open Science
The biodiversity informatics landscape: a systematics perspective
Introduction to Biodiversity Informatics
Facilitating semantic alignment.-biohackathon-jupp
schema.org and biomedical ontologies
Next generation sequencing requires next generation publishing: the Biodivers...
Research Objects: more than the sum of the parts
OBOPedia: An Encyclopaedia of Biology Using OBO OntologiesObopedia swat4ls-20...

What's hot (20)

PPTX
FAIRy Stories
PPTX
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
PPTX
Ontologies: Necessary, but not sufficient
PPTX
The Rhetoric of Research Objects
PPTX
Mtsr2015 goble-keynote
PPT
Publishing Germplasm Vocabularies as Linked Data
PPTX
Research Objects, SEEK and FAIRDOM
PDF
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
PPTX
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
PDF
PDF
Research Shared: researchobject.org
PPTX
Reproducibility, Research Objects and Reality, Leiden 2016
PPTX
Building a repository of biomedical ontologies with Neo4j
PPTX
Being Reproducible: SSBSS Summer School 2017
PPTX
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
PPTX
Reproducibility (and the R*) of Science: motivations, challenges and trends
PPTX
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
PPTX
FAIR data and model management for systems biology.
PPTX
The FAIRDOM Commons for Systems Biology
PPTX
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
FAIRy Stories
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
Ontologies: Necessary, but not sufficient
The Rhetoric of Research Objects
Mtsr2015 goble-keynote
Publishing Germplasm Vocabularies as Linked Data
Research Objects, SEEK and FAIRDOM
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Research Shared: researchobject.org
Reproducibility, Research Objects and Reality, Leiden 2016
Building a repository of biomedical ontologies with Neo4j
Being Reproducible: SSBSS Summer School 2017
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
Reproducibility (and the R*) of Science: motivations, challenges and trends
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR data and model management for systems biology.
The FAIRDOM Commons for Systems Biology
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
Ad

Viewers also liked (12)

PDF
Taxonomic studies in India: Patterns, Processes, Causes and Consequences
PPTX
Agosti 20140813 icd8_agosti_global_dipterology-2
DOC
Classification Key
KEY
Dichotomous key
PPTX
Taxonomic collection and identification
PPTX
Taxonomic Characters
PPT
Dichotomous key
PPT
Taxonomic procedures
PPTX
Taxonomic keys
PPT
Classification and Keys
PPT
Specimen collection and preservation
PDF
Taxonomy notes pdf
Taxonomic studies in India: Patterns, Processes, Causes and Consequences
Agosti 20140813 icd8_agosti_global_dipterology-2
Classification Key
Dichotomous key
Taxonomic collection and identification
Taxonomic Characters
Dichotomous key
Taxonomic procedures
Taxonomic keys
Classification and Keys
Specimen collection and preservation
Taxonomy notes pdf
Ad

Similar to A Step Towards (From) Read to Write Access to Taxonomic Publications (20)

PDF
20140623 swets agosti_final
PDF
20140317 pi b_nmbe_journal_club
PDF
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
PPTX
20140922 rda codata_legal_ig_plazi_final
PPTX
BioPortal: ontologies and integrated data resources at the click of a mouse
PPTX
20140327 rda plazi_final
PPT
Semantic Technologies at FAO
PPT
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
PPT
20110122 vibrant final
PPT
The seven-deadly-sins-of-bioinformatics3960
PPT
The Seven Deadly Sins of Bioinformatics
PPT
Prosdocimi ucb cdao
PDF
2 donat agosti-1
PPT
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
PPTX
Can machines understand the scientific literature
PPTX
2014.04.01 Shorthouse REDM400
PPTX
IUCN Species Conservation Profile (SCP)
PDF
2011 12 08 - LOINC Introduction
PPT
Remsen Lect04
PPTX
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...
20140623 swets agosti_final
20140317 pi b_nmbe_journal_club
BioDIP - a proposed infrastructure to link the taxonomic to the genomic and o...
20140922 rda codata_legal_ig_plazi_final
BioPortal: ontologies and integrated data resources at the click of a mouse
20140327 rda plazi_final
Semantic Technologies at FAO
Setting the Scene for ViBRANT – Strategy, Philosophy and Communication
20110122 vibrant final
The seven-deadly-sins-of-bioinformatics3960
The Seven Deadly Sins of Bioinformatics
Prosdocimi ucb cdao
2 donat agosti-1
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
Can machines understand the scientific literature
2014.04.01 Shorthouse REDM400
IUCN Species Conservation Profile (SCP)
2011 12 08 - LOINC Introduction
Remsen Lect04
Franz et al ice 2016 addressing the name meaning drift challenge in open ende...

More from agosti (15)

PPTX
DOI and the Mitteilungen: communicating scientific results in the future
PPTX
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
PPTX
Revolutionizing the Research on Ants through new Methods and Technologies: th...
PPTX
Open Research Data: Taxonomy
PDF
Nothing in taxonomy makes sense except in the light of Open Access
PPTX
20150701 opendata bern_agosti_2
PPTX
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
PPTX
20141027 bouchout declaration
PPT
20140924 rda _bouchout
PPT
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
PPT
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
PPT
20140523 swiss curators_bouchout_2
PPTX
20110725 ibc xml
PDF
20110222 behesty monitoring and measuring biodiversity
PPT
20090921 Art Databanken Agosti Final
DOI and the Mitteilungen: communicating scientific results in the future
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...
Revolutionizing the Research on Ants through new Methods and Technologies: th...
Open Research Data: Taxonomy
Nothing in taxonomy makes sense except in the light of Open Access
20150701 opendata bern_agosti_2
Plazi or the challenge to free biodiversity data caught in hundreds of millio...
20141027 bouchout declaration
20140924 rda _bouchout
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
Bouchout Declaration on Open Biodiversity Knowledge Management, Montpellier J...
20140523 swiss curators_bouchout_2
20110725 ibc xml
20110222 behesty monitoring and measuring biodiversity
20090921 Art Databanken Agosti Final

Recently uploaded (20)

PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PPT
Computional quantum chemistry study .ppt
PPT
veterinary parasitology ````````````.ppt
PPT
LEC Synthetic Biology and its application.ppt
PDF
Wound infection.pdfWound infection.pdf123
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
perinatal infections 2-171220190027.pptx
PDF
Science Form five needed shit SCIENEce so
PPT
Presentation of a Romanian Institutee 2.
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
gene cloning powerpoint for general biology 2
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PPT
Mutation in dna of bacteria and repairss
PPTX
limit test definition and all limit tests
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PPT
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
TORCH INFECTIONS in pregnancy with toxoplasma
BODY FLUIDS AND CIRCULATION class 11 .pptx
Computional quantum chemistry study .ppt
veterinary parasitology ````````````.ppt
LEC Synthetic Biology and its application.ppt
Wound infection.pdfWound infection.pdf123
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
perinatal infections 2-171220190027.pptx
Science Form five needed shit SCIENEce so
Presentation of a Romanian Institutee 2.
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
Hypertension_Training_materials_English_2024[1] (1).pptx
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
gene cloning powerpoint for general biology 2
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
Mutation in dna of bacteria and repairss
limit test definition and all limit tests
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...

A Step Towards (From) Read to Write Access to Taxonomic Publications

  • 1. Donat Agosti, Plazi ISH Conference Cuzco 23.7.2014, Cuzco A Step Towards (From) Read to Write Access to Taxonomic Publications
  • 3. Why do you publish?
  • 4. What do you expect from a publication?
  • 5. What might others expect from publications?
  • 6. What is a taxonomic publication?
  • 9. Treatment Treatment publications or (more frequently) sections of publications documenting the features or distribution of a related group of organisms (called a “taxon”, plural “taxa”) in ways adhering to highly formalized conventions. Some of these are over a century old. [Catapano, 2011]
  • 10. Treatment Each taxonomic name usage has it’s treatment Treatment Formica obsoleta Linnaeus, 1758: 580
  • 11. Treatments as standard containers http://en.wikipedia.org Treatment
  • 12. Plazi treatment elements schema Treatment text Cite Treatment 1 Cite DNA sequence 1 Cite Figure 1 Cite Abbreviation 1 Cite Table 1 Cite Appendix 1 Cite Reference 1 Cite supplementary materials Interaction sp. X Trait 1 value = Y Inline table Treatment text, continued Materials citation Species interacted with Cited by treatment httpUri httpUri Map, Dashboard charts Abbreviation 1 treatment x GUID Trait database Sequence in database httpUri RDF Treatment citation ontology Treatment (Plazi) httpUri ? Table 1 Appendix 1 Figure caption Supplementary materials Reference 1 Plazi treatment Linked treatment elements Other online source Other links Content from citations, instances Content output from treatment Hyperlink httpUri Figure image Treatment
  • 13. Countries (Region) Australia (Queensland) Export species materials citations (DwC)
  • 14. Pseudomyrmex ants and Vachellia ant-acacias are a classic example of mutualism in biology. allenii melanoceras ruddiae chiapensis collinsii cookii cornigera globulifera hindsii janzenii mayana sphaerocephala boopis flavicornis hesperius ita janzeni kuenckeli mixtecus nigrocinctus nigropilosus opaciceps particeps peperi reconditus satanicus simulans spinicola subtilissimus veneficus ferrugineus gentlei gracilis Transbiotic link network Associated species linked through references in taxonomic treatments Acacia-ant species: Pseudomyrmex gracili Treatment: redescription Associated ant-acacia: Acacia gentlei Ants Plants Photocredits: Alex Wild Treatment Treatments linked through citations Treatment
  • 15. Treatment Linking of treatments using persistent identifiers Treatment citation Treatment identifier
  • 16. Treatment Linking of treatments to external resources
  • 17. Treatment Plazi Search and Retrieval Server: Access to data DwC-A You You You human machine
  • 18. What are taxonomic publications? Taxonomic publictationS
  • 19. Journal of Hymenoptera Research 5170 specimens 4062 plottable specimens from 1138 unique locations
  • 21. All content in Plazi (34,000 treatments) 14,590 specimens 8900 plottable specimens from 1138 unique locations
  • 22. 200,000,000+ printed pages 1,900,000 species described 20,000,000+ species treatments 17,000 new species per year BUT: The data are hidden Incomplete digitization Publications are not semantically enhanced Collections are incomplete Data is not linked Most data are not open Taxonomic publictationS
  • 23. Taxonomic publications PDFs are stupid – only men can understand them but…
  • 25. Conversion Find the right mix of generic and domain specific solutions Plazi SRS find scan «OCR» markup store ? domain domaingeneric Digitization and Markup Workflow: $$$$ ?
  • 26. Solution for the future Publish semantically enhanced: Journal of Hymenoptera Research
  • 28. Solution Blue List elements of taxonomic information that are not subject to copyright… e.g. treatments (Patterson et al., 2014)
  • 30. Solution Support reliable and permanent open access to digital biodiversity records Create identifiers, link provide direct access to digital objects of biodiversity literature, specimens, multimedia, genes, etc. Ensure global interoperability and sharing of biodiversity data, information and knowledge Ongoing dialogue to refine the concept and implementation As signatories, we encourage an overarching approach to Open Biodiversity Knowledge Management which is based on the following fundamental principles: http://bouchoutdeclaration
  • 31. Solution Institutional: 57; Individual: 58 Large Natural History Institutions (e.g BGBM, Naturalis, MfN, RGBM, INBio, MCZ, CAS) Scientific Networks (e.g. IUBS, Vertnet, CRIA, TaiBIF, Canadensys, Creative Commons, DataOne) Scientists Global Initial Signatories (June 12, 2014)
  • 36. Solution: Digital object identifiers Publications Treatments Images / digital objects Data
  • 38. Solution DOI: CrossRef doi/10.5281/zenodo.10 doi/10.5281/zenodo.10697 doi/10 DOI: DataCite (Biodiversity Literature Repository) DOI for publications
  • 40. Solution ZENODO @CERN ZENODO builds and operate a simple and innovative service that enables researchers, scientists, EU projects and institutions to share and showcase multidisciplinary research results (data and publications) that are not part of the existing institutional or subject-based repositories of the research communities. Zenodo is the digital repository of CERN Zenodo agreed to host and support the Biodiversity Literature Repository
  • 43. Solution Services or what you get: Wide access Refindit Refbank Legal Issues Archiving Distribution of publications resolved Your legacy literature is for everybody directly from your publications accesible
  • 44. Solution Why don’t we assure that all the legacy taxonomic literature is in the Biodiversity Literature Repository?
  • 45. The future Why not assure that all the legacy taxonomic literature is in the Biodiversity Literature Repository? Why not make our community the first that can publish in its journal with all publications linked to a digital copy?
  • 46. The future Why not assure that all the legacy taxonomic literature is in the Biodiversity Literature Repository? Why not make our community the first that can publish in its journal with all publications linked to a digital copy?
  • 47. Links Links Further reading: http://plazi.org/?q=plazi_publications Catapano, 2011 (http://www.ncbi.nlm.nih.gov/books/NBK47081/) Bouchout Declaration (http://bouchoutdeclaration.org) Blue List (http://plazi.org/?q=blue_list) Biodiversity Literature Repository (https://zenodo.org/collection/user-biosyslit Zenodo (https://zenodo.org/about) Refindit (http://refindit.org) Refbank (http://refbank.org) Pro-iBiosphere (http://pro-ibiosphere.eu/) Introduction to persistent identifiers (http://wiki.pro- ibiosphere.eu/wiki/Best_practices_for_stable_URIs) Twitter @plazi_treat; @bouchoutdec, @myrmoteras