SlideShare a Scribd company logo
www.iDigBio.org
Update from the
Entomological Collections Network (ECN) 2012
November 10 – 11, Knoxville, TN
Debbie Paul, Greg Riccardi, Gil Nelson
HUB
University of Florida
Florida State University
www.iDigBio.org
Advancing Digitization of Biodiversity Collections
• Facilitate use of biodiversity data to address environmental and economic
challenges
 Researchers
 Educators
 General public
 Policy-makers
• Enable digitization of biodiversity collections data
 Develop efficient and effective digitization standards and workflows
 Respond to cyberinfrastructure needs
• Provide portal access to biodiversity data in a cloud-computing environment
• Plan for long-term sustainability of the national digitization effort
 Expand participation: partners and data sources
Seven Thematic Collections Networks (TCNs)
• InvertNet: An Integrative Platform for Research on Environmental Change, Species Discovery and
Identification (Illinois Natural History Survey, University of Illinois) http://invertnet.org
• Plants, Herbivores, and Parasitoids: A Model System for the Study of Tri-Trophic Associations
(American Museum of Natural History) http://tcn.amnh.org
• North American Lichens and Bryophytes: Sensitive Indicators of Environmental Quality and Change
(University of Wisconsin – Madison) http://symbiota.org/nalichens/index.php
http://symbiota.org/bryophytes/index.php
• Digitizing Fossils to Enable New Syntheses in Biogeography-Creating a PALEONICHES-TCN (University of
Kansas)
• The Macrofungi Collection Consortium: Unlocking a Biodiversity Resource for Understanding Biotic
Interactions, Nutrient Cycling and Human Affairs (New York Botanical Garden)
• Mobilizing New England Vascular Plant Specimen Data to Track Environmental Change (Yale University)
• Southwest Collections of Arthropods Network (SCAN): A Model for Collections Digitization to Promote
Taxonomic and Ecological Research (Northern Arizona University)
http://hasbrouck.asu.edu/symbiota/portal/index.php
National Resource (iDigBio), Thematic Collection Networks
(TCNs), and Collaborators
Currently 7 TCNs with 130 participating institutions
• iDigBio HUB V1 Portal --12 / 12 / 12
• anticipating real data -- January 2013
–Symbiota
Lichens, Bryophytes and Climate Change TCN
–VertNet
HUB v1
HUB v1
Paul hu bupdate_i_digbio_ecn_2012
Paul hu bupdate_i_digbio_ecn_2012
• iDigBio API v1 (Releasing 11/9/12)
– Adds support for storage of taxon data, people, and organizations
– Improved handling of inter-type relationships
– Ready to enable write access for a limited set of beta-testers.
• iDigBio Portal v1 (Releasing 12/12/12)
– Integrated with iDigBio Web authentication.
• Slowly expanding interactive features, focus is on backend and process
right now
• Creation of custom record groups (of people, specimens, or whatever)
from searches
– Can download groups as CSV, view groups points with coordinates
map
• Added taxon, people, and organization indexes and searching.
• Overall presentation of field names is much more consistent across the
portal.
HUB v1
Building the iDigBio Cloud
• Cloud-based strategy
– Providing useful services/APIs (programmatic and web-based Application
Programming Interface)
– Federated scalable object storage and information processing
– Digitization-oriented virtual appliances
– Reliance on standards, proven solutions and sustainable software
• Continuous consultation with stakeholders
– Surveys, working groups, workshops, person-to-person
What Makes iDigBio Unique?
 Ingest all contributed data with emphasis on GUIDs, not only a restricted set of
selected data elements
 Maintain persistent datasets and versioning, allowing new and edited records to
be uploaded as needed
 Ingest textual specimen records, associated still images, video, audio, and other
media
 Ingest linked documents and associated literature, including field notes, ledgers,
monographs, related specimen collections, etc.
 Provide virtual annotation capabilities and track annotations back to the
originating collection (provider)
 Facilitate sharing and integration of data relevant to biodiversity research
 Provide computational services for biodiversity research
https://www.idigbio.org/content/idigbios-train-trainers-georeferencing-update-ii-out-dark-ages
• Join an existing Working Group
• Propose a working group
• Apply to become a visiting scholar
• Apply to attend a workshop
• Propose a workshop
• Host/Co-host workshops at your next professional meeting
• Contribute your questions to our forums
• Encourage your students to contribute to our forums
• Encourage students to apply to our workshops
• Encourage students to get involved in a working group
https://www.idigbio.org/wiki/index.php/IDigBio_Working_Groups
Recent and Ongoing Activities
• Assessment of common and effective practices (paper in ZooKeys)
• MISC - Minimum information for scientific collections working group
• Collaborative georeferencing pilot project at Godfrey Herbarium
• DROID - Digitization workflows working groups
• Public Participation in Digitization of Biodiversity Specimens workshop
• GWG - Georeferencing working group & train-the-trainers workshop
• AOCR - OCR/natural language processing working group
• Linked data workshop
• Series of digitization training workshops
• Call for appliances – tool integration
• Call for working groups
• Cyberinfrastructure working group
• Specimen data portal v0 implementation
• Server hosting
o Jan 2013 Field Notes (Smithsonian)
o Feb 12 iDigBio Augmenting OCR Working Group
2013 iSchools iConference Panel
o Feb 13 - 14 iDigBio Augmenting OCR Hackathon
BRIT, Ft Worth TX
o March 6-7 Wet Collections Digitization Workshop – KU
o April 10-13 ?Workshop at ASB
o April 2013 Entomology Digitization Workshop
Field Museum
o Summer 2013 iDigBio GWG Georeferencing Train the Trainers II
o June 17-22 Society for the Preservation of Natural History
Collections
o July 24-Aug 1 Botany 2013
o Sept 2013 Paleo Digitization Workshop (place tbd)
This material is based upon work supported by the National Science Foundation under
Cooperative Agreement EF-1115210. Any opinions, findings, and conclusions or
recommendations expressed in this material are those of the author(s) and do not necessarily
reflect the views of the NSF.

More Related Content

PPTX
Levels of Service for Digital Libraries
PDF
Corrado -- Establishing the Landscape
PPTX
Conrad "The experience of scholarly users: An introduction"
PPTX
Research data spring: clipper
PDF
SGCI - The Science Gateways Community Institute: Going Beyond Borders
PPTX
Workshop about research data archiving and open access publishing at the Rese...
PPTX
Research Data Management at the University of Salford
PDF
Digital Projects in Special Collections
Levels of Service for Digital Libraries
Corrado -- Establishing the Landscape
Conrad "The experience of scholarly users: An introduction"
Research data spring: clipper
SGCI - The Science Gateways Community Institute: Going Beyond Borders
Workshop about research data archiving and open access publishing at the Rese...
Research Data Management at the University of Salford
Digital Projects in Special Collections

What's hot (20)

PPT
EDINA / Data Library Overview
PDF
SGCI Science Gateways: Ushering in a New Era of Sustainability
PPTX
Research data spring - Jisc Digital Festival 2015
PPT
MANTRA & Open Educational Resources
PDF
Davis Digital Preservation and the Web: Challenges for Libraries
PPTX
Ensuring Continuity of Access To Our Published Heritage
PPT
OGC Web Service Shibboleth Interoperability Experiment
PPTX
NDS Relevant Update from the NIH Data Science (ADDS) Office
PPTX
Technology & Archives: Exchange Forum Programmer & Archivist Collaboration
PPT
Internet2 and Cyberinfrastructure
PDF
KEDL DBpedia 2019
PPTX
SHARE: Shared Access Research Ecosystem – Jisc and CNI conference 10 July 2014
PDF
Digital Curation in Libraries: An innovative way of content preservation and...
PDF
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
PPTX
Research data spring: DataVault
PPTX
Research data spring: filling in the digital preservation gap
PDF
COBWEB - Chris Higgins, EDINA
PPT
Edinburgh DataShare - DSpace for Data
PPT
Introduction to digital curation
PDF
The Heterogenous Zone: Six use cases for six research data collections in Edi...
EDINA / Data Library Overview
SGCI Science Gateways: Ushering in a New Era of Sustainability
Research data spring - Jisc Digital Festival 2015
MANTRA & Open Educational Resources
Davis Digital Preservation and the Web: Challenges for Libraries
Ensuring Continuity of Access To Our Published Heritage
OGC Web Service Shibboleth Interoperability Experiment
NDS Relevant Update from the NIH Data Science (ADDS) Office
Technology & Archives: Exchange Forum Programmer & Archivist Collaboration
Internet2 and Cyberinfrastructure
KEDL DBpedia 2019
SHARE: Shared Access Research Ecosystem – Jisc and CNI conference 10 July 2014
Digital Curation in Libraries: An innovative way of content preservation and...
SGCI Science Gateways: Software sustainability via on-campus teams - Webinar ...
Research data spring: DataVault
Research data spring: filling in the digital preservation gap
COBWEB - Chris Higgins, EDINA
Edinburgh DataShare - DSpace for Data
Introduction to digital curation
The Heterogenous Zone: Six use cases for six research data collections in Edi...
Ad

Similar to Paul hu bupdate_i_digbio_ecn_2012 (20)

PPTX
D paul ecn2013
PPTX
Gil ecn2013 ppt
PPTX
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
PPTX
Oboyski ecn2013
PPTX
De-centralized but global: Redesigning biodiversity data aggregation for impr...
PDF
Biodiversity Virtual e-Laboratory (BioVeL)
PDF
The Biodiversity Informatics Landscape
PPTX
Biodiversity Informatics: An Interdisciplinary Challenge
PPT
BSC Shorthouse ESC 2011
PPTX
Triplifier talk
PDF
Global Biodiversity Information Facility (GBIF) - 2012
PPT
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
PPTX
GBIF Work Programme 2016 Update
PPTX
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
PPTX
Community Standards and Tools for Biodiversity Science at NIEHD
PPT
Special Libraries Associatin
PDF
ANTABIF at the BELSPO-SOA event
PPTX
The biodiversity informatics landscape: a systematics perspective
PDF
ANTABIF at BNCAR
PPTX
Citizen science project list (Europe & worldwide) v1
D paul ecn2013
Gil ecn2013 ppt
Accessing Digital Collections Data Sources for Research: A Tour of iDigBio Da...
Oboyski ecn2013
De-centralized but global: Redesigning biodiversity data aggregation for impr...
Biodiversity Virtual e-Laboratory (BioVeL)
The Biodiversity Informatics Landscape
Biodiversity Informatics: An Interdisciplinary Challenge
BSC Shorthouse ESC 2011
Triplifier talk
Global Biodiversity Information Facility (GBIF) - 2012
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
GBIF Work Programme 2016 Update
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 1
Community Standards and Tools for Biodiversity Science at NIEHD
Special Libraries Associatin
ANTABIF at the BELSPO-SOA event
The biodiversity informatics landscape: a systematics perspective
ANTABIF at BNCAR
Citizen science project list (Europe & worldwide) v1
Ad

More from ECNOfficer (20)

PPT
Price2 ecn2013
PPTX
Sikes ecn2013 dn_ab
PPT
Ryder ecn2013
PPTX
Janzen ecn2013
PPTX
Nearns ecn2013
PPT
Krell ecn2013
PPTX
Giddens ecn2013
PPTX
Rubinoff ecn2013 uhim
PPT
Mc alister ecn2013
PPTX
Dombroskie ecn2013
PPT
Dmitriev ecn2013
PPT
Thomas ecn2013
PPTX
Jones ecn2013 the_goodbadugly conabio
PPTX
Austin ecn2013
PPT
Yu ecn2013 cnc_databasing
PPT
Solis ecn2013 usfws
PPT
Schuh ecn2013 tcn_data_structure
PPTX
Dm smith ecn2013
PPTX
Abrahamson ecn2013 evaluating_naturalhistorycollectionuse
PPTX
Furth ecn 2013
Price2 ecn2013
Sikes ecn2013 dn_ab
Ryder ecn2013
Janzen ecn2013
Nearns ecn2013
Krell ecn2013
Giddens ecn2013
Rubinoff ecn2013 uhim
Mc alister ecn2013
Dombroskie ecn2013
Dmitriev ecn2013
Thomas ecn2013
Jones ecn2013 the_goodbadugly conabio
Austin ecn2013
Yu ecn2013 cnc_databasing
Solis ecn2013 usfws
Schuh ecn2013 tcn_data_structure
Dm smith ecn2013
Abrahamson ecn2013 evaluating_naturalhistorycollectionuse
Furth ecn 2013

Recently uploaded (20)

PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Electronic commerce courselecture one. Pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPT
Teaching material agriculture food technology
PPTX
Cloud computing and distributed systems.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Modernizing your data center with Dell and AMD
Electronic commerce courselecture one. Pdf
Empathic Computing: Creating Shared Understanding
Review of recent advances in non-invasive hemoglobin estimation
Per capita expenditure prediction using model stacking based on satellite ima...
Diabetes mellitus diagnosis method based random forest with bat algorithm
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Mobile App Security Testing_ A Comprehensive Guide.pdf
Teaching material agriculture food technology
Cloud computing and distributed systems.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Network Security Unit 5.pdf for BCA BBA.
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

Paul hu bupdate_i_digbio_ecn_2012

  • 1. www.iDigBio.org Update from the Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi, Gil Nelson HUB
  • 2. University of Florida Florida State University www.iDigBio.org
  • 3. Advancing Digitization of Biodiversity Collections • Facilitate use of biodiversity data to address environmental and economic challenges  Researchers  Educators  General public  Policy-makers • Enable digitization of biodiversity collections data  Develop efficient and effective digitization standards and workflows  Respond to cyberinfrastructure needs • Provide portal access to biodiversity data in a cloud-computing environment • Plan for long-term sustainability of the national digitization effort  Expand participation: partners and data sources
  • 4. Seven Thematic Collections Networks (TCNs) • InvertNet: An Integrative Platform for Research on Environmental Change, Species Discovery and Identification (Illinois Natural History Survey, University of Illinois) http://invertnet.org • Plants, Herbivores, and Parasitoids: A Model System for the Study of Tri-Trophic Associations (American Museum of Natural History) http://tcn.amnh.org • North American Lichens and Bryophytes: Sensitive Indicators of Environmental Quality and Change (University of Wisconsin – Madison) http://symbiota.org/nalichens/index.php http://symbiota.org/bryophytes/index.php • Digitizing Fossils to Enable New Syntheses in Biogeography-Creating a PALEONICHES-TCN (University of Kansas) • The Macrofungi Collection Consortium: Unlocking a Biodiversity Resource for Understanding Biotic Interactions, Nutrient Cycling and Human Affairs (New York Botanical Garden) • Mobilizing New England Vascular Plant Specimen Data to Track Environmental Change (Yale University) • Southwest Collections of Arthropods Network (SCAN): A Model for Collections Digitization to Promote Taxonomic and Ecological Research (Northern Arizona University) http://hasbrouck.asu.edu/symbiota/portal/index.php
  • 5. National Resource (iDigBio), Thematic Collection Networks (TCNs), and Collaborators Currently 7 TCNs with 130 participating institutions
  • 6. • iDigBio HUB V1 Portal --12 / 12 / 12 • anticipating real data -- January 2013 –Symbiota Lichens, Bryophytes and Climate Change TCN –VertNet HUB v1
  • 10. • iDigBio API v1 (Releasing 11/9/12) – Adds support for storage of taxon data, people, and organizations – Improved handling of inter-type relationships – Ready to enable write access for a limited set of beta-testers. • iDigBio Portal v1 (Releasing 12/12/12) – Integrated with iDigBio Web authentication. • Slowly expanding interactive features, focus is on backend and process right now • Creation of custom record groups (of people, specimens, or whatever) from searches – Can download groups as CSV, view groups points with coordinates map • Added taxon, people, and organization indexes and searching. • Overall presentation of field names is much more consistent across the portal. HUB v1
  • 11. Building the iDigBio Cloud • Cloud-based strategy – Providing useful services/APIs (programmatic and web-based Application Programming Interface) – Federated scalable object storage and information processing – Digitization-oriented virtual appliances – Reliance on standards, proven solutions and sustainable software • Continuous consultation with stakeholders – Surveys, working groups, workshops, person-to-person
  • 12. What Makes iDigBio Unique?  Ingest all contributed data with emphasis on GUIDs, not only a restricted set of selected data elements  Maintain persistent datasets and versioning, allowing new and edited records to be uploaded as needed  Ingest textual specimen records, associated still images, video, audio, and other media  Ingest linked documents and associated literature, including field notes, ledgers, monographs, related specimen collections, etc.  Provide virtual annotation capabilities and track annotations back to the originating collection (provider)  Facilitate sharing and integration of data relevant to biodiversity research  Provide computational services for biodiversity research
  • 14. • Join an existing Working Group • Propose a working group • Apply to become a visiting scholar • Apply to attend a workshop • Propose a workshop • Host/Co-host workshops at your next professional meeting • Contribute your questions to our forums • Encourage your students to contribute to our forums • Encourage students to apply to our workshops • Encourage students to get involved in a working group
  • 16. Recent and Ongoing Activities • Assessment of common and effective practices (paper in ZooKeys) • MISC - Minimum information for scientific collections working group • Collaborative georeferencing pilot project at Godfrey Herbarium • DROID - Digitization workflows working groups • Public Participation in Digitization of Biodiversity Specimens workshop • GWG - Georeferencing working group & train-the-trainers workshop • AOCR - OCR/natural language processing working group • Linked data workshop • Series of digitization training workshops • Call for appliances – tool integration • Call for working groups • Cyberinfrastructure working group • Specimen data portal v0 implementation • Server hosting
  • 17. o Jan 2013 Field Notes (Smithsonian) o Feb 12 iDigBio Augmenting OCR Working Group 2013 iSchools iConference Panel o Feb 13 - 14 iDigBio Augmenting OCR Hackathon BRIT, Ft Worth TX o March 6-7 Wet Collections Digitization Workshop – KU o April 10-13 ?Workshop at ASB o April 2013 Entomology Digitization Workshop Field Museum o Summer 2013 iDigBio GWG Georeferencing Train the Trainers II o June 17-22 Society for the Preservation of Natural History Collections o July 24-Aug 1 Botany 2013 o Sept 2013 Paleo Digitization Workshop (place tbd)
  • 18. This material is based upon work supported by the National Science Foundation under Cooperative Agreement EF-1115210. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF.

Editor's Notes

  • #4: Network Integrated Collections Alliance
  • #11: services / tools – near future
  • #12: iDigBio uses a cloud-based strategy for building its cyberinfrastructure.The strategy is to build a federated scalable object storage and information processing that can be accessed through programmatic and web-based interfaces; always relying on standards, proven solutions and sustainable software that can be packaged in virtual appliances that facilitate the digitization workflow.This process is transparent and with continuous consultation with stakeholders through surveys, working groups, etc.The picture abstracts main components that are being built into the iDigBiocyberinfrastructure, and suffices to say that it is being built having in mind the strategy just described.----------------------If the presenter is asked about the details of the figure, it shows:a) The physical hardware at the bottom layer (“Cloud Nodes”);b) The storage of both text and binary objects (“Bulk Text Storage” and “Binary Object Storage”) on top of physical nodes;c) Various types of customized indexing that accelerates searches of data (“Full-text and Faceted Indexing”, “Geo-spatial and Range Indexing)”;d) The different types of Application Programming Interfaces (APIs) that will be exposed to the public (“Search API”, “Metadata API”, “Object API”). The APIs allow other applications, especially from other stakeholders, to ingest and retrieve the data in iDigBio in a programmatic fashion.e) The “iDigBio Specimen Portal” provides a web-based user interface that also makes use of the exposed APIs.f) Different types of “Appliances” and “Third Party Consumers” also interact with the exposed APIs with the purpose of facilitating the digitization process by packaging well existing biodiversity tools.g) All components will need to be integrated and managed securely (authentication, authorization, access control, auditing).The names in smaller fonts inside each box (e.g., Apache, Solr) are particular software packages considered for delivering the expected capability.
  • #15: how to get involved (join working groups, attend workshops, propose needed working groups / workshops, host or co-host workshops at existing meetings, propose tool integration projects, post your questions to our forums – put our staff and working groups collective knowledge to good use)