SlideShare a Scribd company logo
Sharing of biodiversity data with Web Services Demonstration of BioCASE
TOPICS Biodiversity data Data Standards Data exchange tools The  BioCASE  data provider software Decentralized data network
Biodiversity collections data Different  Biodiversity collections  data   describe very similar data objects. Preserved reference collections , such as those in museums and herbaria. Living collections,  like botanical and zoological gardens, aquaria,  seed banks , microbial strain cultures and tissue collections. Data collections , from surveys of objects in the field, such as observations. These collections have most of their  attributes in common , although the  terminology  used to describe them may  differ substantially . [http://www.bgbm.org/TDWG/CODATA/ABCD-Evolution.htm]
Germplasm data, seed genebanks Germplasm genebanks are biodiversity collections. Collection level data Metadata about genebank institutes and the germplasm collections they hold. Unit level data The unit level data for germplasm collections are the accessions. Genebank accessions have most of the same properties and attributes as other biodiversity specimens.
Data Standards
Crop Descriptors The IPGRI  crop descriptors  (as well as other networks) is developed to meet specific needs for these crops. The MCPD is designed to be compatible with the  IPGRI   crop specific descriptor lists  and the FAO World Information and Early Warning System ( WIEWS ). The  MCPD  descriptor list is compatible with ABCD (2.06).
Taxonomic Database Working Group   Standards development and maintenance Darwin Core 2  - Element definitions designed to support the sharing and integration of primary biodiversity data".   [http://darwincore.calacademy.org/] Access to Biological Collection Data (ABCD) 2.06  - An evolving comprehensive standard for the access to and exchange of data about specimens and observations (a.k.a. primary biodiversity data)“. [http://www.bgbm.org/TDWG/CODATA/Schema/]
ABCD   A ccess to  B iological  C ollection  D ata ABCD  is a  common data specification  for data on biological specimens and observations  (including the plant genetic resources seed banks). The design goal is to be both  comprehensive  and  general   (about 1200 elements). Development of the ABCD started after the  2000  meeting of the TDWG. ABCD was developed with support from  TDWG/CODATA , ENHSIN, BioCASE, and GBIF. The  MCPD  descriptor list is now completely mapped and compatible to ABCD 2.06 [http://www.bgbm.org/TDWG/CODATA/Schema/]
PGR sub-unit of ABCD PGR
Generation Challenge Program GCP_Passport_1.03 In the context of the GCP (Generation Challenge Program), the  GCP  Passport  data exchange schema was developed. Similar XML schema are under development for  Phenotype  (trait data) and  Genotype .
Demo Data Portal A demo data portal was developed, providing live access to selected BioCASE data providers. [http://geifir.ngb.se/abcdproto/default.jsp]
Create your own BioCASE data schema Create an XML schema (xsd file) of your data model and copy the schema online (http://...) Create a Concept Mapping Configuration (CMF) file from the XML schema. [http://ww3.bgbm.org/biocase/utilities/process_schema.html]  (or use your own BioCASE installation ... /utilities/process_schema.html) Save the result XML (CMF file) into your BioCASE installation cmf folder to make it available for local mapping. .../biocase/configuration/templates/cmf/cmf_your-preferred-file-name.xml   Visit : [http://ww3.bgbm.org/bps2/GenerateCmFiles] for more info!
Biodiversity informatics data exchange tools
Data Provider Software Distributed network of data providers  retrieving structured data from multiple, distributed, heterogeneous databases across the Internet. DiGIR ,  Di stributed  G eneric  I nformation  R etrieval.   [http://digir.net] BioCASE , The Biological Collection Access Service for Europe. [http://www.biocase.org/]
Protocol integration - TAPIR There is a need to  integrate  the current protocols in use by different biodiversity informatics community networks.  During the TDWG meeting in 2004, the unified protocol was presented and named  TAPIR . The  T DWG  A ccess  P rotocol for  I nformation  R etrieval. New BioCASE and DiGIR software will implement the TAPIR protocol. Will  TAPIR  also help us to integrate GBIF with the  BioMOBY  community? [http://ww3.bgbm.org/tapir]
BioMOBY BioMOBY is an international research project on methodologies for biological data representation, distribution, and discovery. BioMOBY is chosen as the web service framework for the Generation Challenge Program [http://www.biomoby.org/] Work is in progress to develop BioMOBY and BioCASE interoperability.
BioCASE data provider software BioCASE Bio logical  C ollection  A ccess for  E urope [http://www.biocase.org/]
BioCASE  Biological Collection Access for Europe BioCASE  establish web-based unified  access to   biological collections in Europe  while leaving control of the information with the collection holders. ABCD  is the main data definition used by BioCASE. Designed generic to handle any schema and connect to any SQL capable database. BioCASE provide  full access  to its registry for  GBIF . Being a BioCASE provider thus means being a GBIF provider. [http://www.biocase.org/]
BioCASE  [http://www.biocase.org/] BioCASE  runs on MS Windows, Mac OS X, Linux, BSD, Solaris... BioCASE works with many different  databases , PostgreSQL, MySQL, Oracle, MS Access, MS SQL Server.... BioCASE works with  UNICODE   ضاإطقكغب   שּׁשׁﭻﭗﭼﱠ אָבּדּוּ BioCASE is  OpenSource BioCASE is developed in the  Python  programming language CVS
Distributed BioCASE network
BioCASE protocol stack 
BioCASE Provider Software v 2.3.1  Required configuration:  Web server : Any CGI compliant web server: Apache, IIS, etc.  Database : major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work.  Python  (BioCASE is developed in the Python programming language. Install version 2.3 or later) [http://ww3.bgbm.org/bps2/DocumentationToc] [http://www.biocase.org/products/provider_software/index.shtml]
BioCASE installation Download the provider software and unzip the archive file [provider_software_2.3.1.tar.gz] For example uncompress it into [C:\biocase\]  Configure your web server to publish the www folder. Example [C:\biocase\] to be accessible trough [http://localhost/biocase/] Download and install the latest Python software [http://www.python.org/download/] Execute the [C:\biocase\setup.py] script.   For a UNIX like system:  %> cd biocase %> python setup.py  Test your installation [http://localhost/biocase] [http://ww3.bgbm.org/bps2/Installation]
BioCASE Install third party software [ http://localhost/biocase/utilities/testlibs.cgi  ] Follow the links from the Library test page.  The column for installed version will display the installed version after successful installation. To update the BioCASE software: Download the new release. Unzip to a temporary folder. Execute the setup.py and follow the instructions.
BioCASE configuration After successful installation you will need to configure your data provider. Follow the instructions from the BioCASE documentation to configure Data sources . If you provide more datasets or several databases they will be configured as individual data sources. Database connection . So the software can access your database. Database structure . Define the relevant tables, the primary keys and foreign keys. Data model . Map your database model to the standard represented by the XML Schemas you choose. [http://ww3.bgbm.org/bps2/Configuration]
Example of a service request All exchanged data is formatted with XML tags.
Example of a service response
TAPIR TAPIR will offer you more advanced request formats.
TAPIR service request TAPIR will offer you more advanced request formats.
TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ...
Decentralized data network  with web services
Data warehouse model (Slide by Samy Gaiji, IPGRI)
Decentralized model (Slide by Samy Gaiji, IPGRI)
Data flow from genebanks to EURISCO and ECCDBs
Decentralized model
Genebanks on BioCASE The BioCASE data provider software has been implemented at (almost) all the CGIAR germplasm centers during the autumn of 2005. Several other genebanks have installed the GBIF web service technology. Nordic Gene Bank, IPK Gatersleben, IHAR (DiGIR), USDA GRIN, CGN, more to follow soon...
Germplasm data indexing tools We are building data indexing methodologies for access to germplasm data with BioCASE.  This is planned to build a Germplasm Clearing House Mechanism. Development in cooperation with GBIF, which themselves index basic biodiversity data from a similar approach.  [http://chm.grinfo.net/index.php]
BioCASE and germplasm data [http://chm.grinfo.net/index.php?app=data_providers]
Global Unique Identifiers, GUID ( LSID , Life Science Identifiers)  [http://lsid.sourceforge.net/] Biodiversity informatics  workflow  tools (BioMOBY and Taverna, Kepler and SEEK...) Germplasm Clearing House Mechanism [http://chm.grinfo.net/] TAPIR Works in progress
Thank you for listening!

More Related Content

PPTX
Data exchange alternatives, SBIS conference in Stockholm (2008)
PPT
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
PPTX
Data exchange alternatives, GIGA TAG (2009)
PDF
GBIF BIFA mentoring, Day 4a GBIF IPT, July 2016
PPTX
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
PPT
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
PPT
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
PDF
GBIF-Norway at NMBU, January 2015
Data exchange alternatives, SBIS conference in Stockholm (2008)
EURISCO needs and priorities, at CGIAR ICT-KM Workshop, IPGRI, Rome (2005)
Data exchange alternatives, GIGA TAG (2009)
GBIF BIFA mentoring, Day 4a GBIF IPT, July 2016
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
Prototype Crop Wild Relatives Portal, at the IMC Meeting (2007)
GBIF-Norway at NMBU, January 2015

What's hot (20)

PPTX
Workshop about research data archiving and open access publishing at the Rese...
PDF
TDWG VoMaG Vocabulary management workflow, 2013-10-31
PDF
GBIF BIFA mentoring, Day 5a Data management, July 2016
PDF
Global Biodiversity Information Facility - 2013
PDF
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
PPT
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
PPTX
Germplasm data exchange, CGIAR SINGER (2009)
PDF
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
PDF
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
PPTX
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
PDF
GBIF BIFA mentoring, Day 2 Publish data, July 2016
PDF
GBIF BIFA mentoring, Day 4b Event core, July 2016
PPT
NGB Documenation System SESTO (4 February 2004)
PPTX
GlobusWorld 2021: Managing Genomics Data at the DOE Joint Genomics Institute
PDF
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
PDF
W4 4 marc-alexandre-nolin-v2
PDF
Data Exchange Model Of EPGRIS, seminar at the Vavilov Institute in St Petersb...
PPT
Digital Object Identifiers for EOSDIS data
PDF
Building collaborative workflows for scientific data
PDF
Embl ebi use-cases_-_t.wildish
Workshop about research data archiving and open access publishing at the Rese...
TDWG VoMaG Vocabulary management workflow, 2013-10-31
GBIF BIFA mentoring, Day 5a Data management, July 2016
Global Biodiversity Information Facility - 2013
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
Germplasm data exchange, CGIAR SINGER (2009)
European agrobiodioversity, ECPGR network meeting on EURISCO, Central Crop Da...
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
Intro to GBIF: Infrastructures and Platforms for Environmental Crowd Sensing ...
GBIF BIFA mentoring, Day 2 Publish data, July 2016
GBIF BIFA mentoring, Day 4b Event core, July 2016
NGB Documenation System SESTO (4 February 2004)
GlobusWorld 2021: Managing Genomics Data at the DOE Joint Genomics Institute
Persistent Identifiers, Herbarium workshop at Kongsvold, September 1 to 4, 2014
W4 4 marc-alexandre-nolin-v2
Data Exchange Model Of EPGRIS, seminar at the Vavilov Institute in St Petersb...
Digital Object Identifiers for EOSDIS data
Building collaborative workflows for scientific data
Embl ebi use-cases_-_t.wildish
Ad

Similar to BioCASE web services for germplasm data sets, at FAO, Rome (2006) (20)

PPT
Web services for sharing germplasm data sets, at FAO in Rome (2006)
PPT
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
PPT
Global Information Systems for Plant Genetic Resources (2009)
PDF
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
PPT
EIA Biodiversity Data Mobilisation
PPTX
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
PPTX
WWW in biotechnology
PPT
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
PPTX
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
PPTX
Software Pipelines: The Good, The Bad and The Ugly
PPTX
De-centralized but global: Redesigning biodiversity data aggregation for impr...
PDF
Chembience
PPTX
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
PPTX
Biothings presentation
PPTX
OSFair2017 Workshop | EGI applications database
PPT
2005 09 Dc Keynote
PDF
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
PDF
D3.3.1 Sematic tagging and open data publication tools
PDF
CCCB Germline Variant Analysis on Cloud Platform
PPT
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Web services for sharing germplasm data sets, at FAO in Rome (2006)
GBIF web services for biodiversity data, for USDA GRIN, Washington DC, USA (2...
Global Information Systems for Plant Genetic Resources (2009)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EIA Biodiversity Data Mobilisation
EURISCO demo installations of IPT, at GBIF EU Nodes meeting in Alicante (11 M...
WWW in biotechnology
TAPIR PyWrapper3, at GBIF GB14 nodes meeting (2007)
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Software Pipelines: The Good, The Bad and The Ugly
De-centralized but global: Redesigning biodiversity data aggregation for impr...
Chembience
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
Biothings presentation
OSFair2017 Workshop | EGI applications database
2005 09 Dc Keynote
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
D3.3.1 Sematic tagging and open data publication tools
CCCB Germline Variant Analysis on Cloud Platform
Global Information Systems for Plant Genetic Resources, SeedNet training cour...
Ad

More from Dag Endresen (20)

PDF
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16
PDF
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
PDF
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
PDF
Ontologies for biodiversity informatics, UiO DSC June 2023
PDF
Evacuation of the Kherson herbarium
PDF
2023-05-08 GLIS SAC Rome
PDF
BioDT for the UiO Science section meeting 2023-03-24
PDF
Data and Stats Forum at MINA NMBU - 2023-04-26
PPTX
BioDATA final conference in Oslo, November 2022
PDF
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
PDF
GBIF at Living Norway Open Science Lab 2022-03-03
PDF
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
PDF
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
PDF
The role of biodiversity informatics in GBIF, 2021-05-18
PDF
GBIF and Biodiversity informatics for museums, 15 March 2021
PDF
2016-10-12 MUSIT & GBIF - Dataset portals
PDF
2021-01-27--biodiversity-informatics-gbif-(52slides)
PDF
GBIF and Open Science
PDF
FAIR and open biodiversity collection data management
PDF
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...
Joint GBIF Biodiversa+ symposium in Helsinki on 2024-04-16
Iliad webinar 2024-03-13, Accessing and publishing marine biodiversity data i...
Modelling Research Expeditions in Wikidata: Best Practice for Standardisation...
Ontologies for biodiversity informatics, UiO DSC June 2023
Evacuation of the Kherson herbarium
2023-05-08 GLIS SAC Rome
BioDT for the UiO Science section meeting 2023-03-24
Data and Stats Forum at MINA NMBU - 2023-04-26
BioDATA final conference in Oslo, November 2022
GBIF data mobilisation for the Nansen Legacy, Tromsø, 2022-09-20
GBIF at Living Norway Open Science Lab 2022-03-03
GBIF & GRScicoll, Høstseminar Norges museumsforbunds Seksjon for natur, 2021-...
Råd fra GBIF-Norge til datainfrastrukturutvalget i dialogmøte 2021-11-19
The role of biodiversity informatics in GBIF, 2021-05-18
GBIF and Biodiversity informatics for museums, 15 March 2021
2016-10-12 MUSIT & GBIF - Dataset portals
2021-01-27--biodiversity-informatics-gbif-(52slides)
GBIF and Open Science
FAIR and open biodiversity collection data management
BioDATA capacity enhancement curriculum at GBIF GB26 Global Nodes Meeting in ...

Recently uploaded (20)

PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
August Patch Tuesday
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Getting Started with Data Integration: FME Form 101
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
1. Introduction to Computer Programming.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Mushroom cultivation and it's methods.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Assigned Numbers - 2025 - Bluetooth® Document
Univ-Connecticut-ChatGPT-Presentaion.pdf
August Patch Tuesday
NewMind AI Weekly Chronicles - August'25-Week II
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
A comparative analysis of optical character recognition models for extracting...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Getting Started with Data Integration: FME Form 101
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
Accuracy of neural networks in brain wave diagnosis of schizophrenia
1. Introduction to Computer Programming.pptx
Programs and apps: productivity, graphics, security and other tools
Network Security Unit 5.pdf for BCA BBA.
Mushroom cultivation and it's methods.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
OMC Textile Division Presentation 2021.pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Assigned Numbers - 2025 - Bluetooth® Document

BioCASE web services for germplasm data sets, at FAO, Rome (2006)

  • 1. Sharing of biodiversity data with Web Services Demonstration of BioCASE
  • 2. TOPICS Biodiversity data Data Standards Data exchange tools The BioCASE data provider software Decentralized data network
  • 3. Biodiversity collections data Different Biodiversity collections data describe very similar data objects. Preserved reference collections , such as those in museums and herbaria. Living collections, like botanical and zoological gardens, aquaria, seed banks , microbial strain cultures and tissue collections. Data collections , from surveys of objects in the field, such as observations. These collections have most of their attributes in common , although the terminology used to describe them may differ substantially . [http://www.bgbm.org/TDWG/CODATA/ABCD-Evolution.htm]
  • 4. Germplasm data, seed genebanks Germplasm genebanks are biodiversity collections. Collection level data Metadata about genebank institutes and the germplasm collections they hold. Unit level data The unit level data for germplasm collections are the accessions. Genebank accessions have most of the same properties and attributes as other biodiversity specimens.
  • 6. Crop Descriptors The IPGRI crop descriptors (as well as other networks) is developed to meet specific needs for these crops. The MCPD is designed to be compatible with the IPGRI crop specific descriptor lists and the FAO World Information and Early Warning System ( WIEWS ). The MCPD descriptor list is compatible with ABCD (2.06).
  • 7. Taxonomic Database Working Group Standards development and maintenance Darwin Core 2 - Element definitions designed to support the sharing and integration of primary biodiversity data". [http://darwincore.calacademy.org/] Access to Biological Collection Data (ABCD) 2.06 - An evolving comprehensive standard for the access to and exchange of data about specimens and observations (a.k.a. primary biodiversity data)“. [http://www.bgbm.org/TDWG/CODATA/Schema/]
  • 8. ABCD A ccess to B iological C ollection D ata ABCD is a common data specification for data on biological specimens and observations (including the plant genetic resources seed banks). The design goal is to be both comprehensive and general (about 1200 elements). Development of the ABCD started after the 2000 meeting of the TDWG. ABCD was developed with support from TDWG/CODATA , ENHSIN, BioCASE, and GBIF. The MCPD descriptor list is now completely mapped and compatible to ABCD 2.06 [http://www.bgbm.org/TDWG/CODATA/Schema/]
  • 9. PGR sub-unit of ABCD PGR
  • 10. Generation Challenge Program GCP_Passport_1.03 In the context of the GCP (Generation Challenge Program), the GCP Passport data exchange schema was developed. Similar XML schema are under development for Phenotype (trait data) and Genotype .
  • 11. Demo Data Portal A demo data portal was developed, providing live access to selected BioCASE data providers. [http://geifir.ngb.se/abcdproto/default.jsp]
  • 12. Create your own BioCASE data schema Create an XML schema (xsd file) of your data model and copy the schema online (http://...) Create a Concept Mapping Configuration (CMF) file from the XML schema. [http://ww3.bgbm.org/biocase/utilities/process_schema.html] (or use your own BioCASE installation ... /utilities/process_schema.html) Save the result XML (CMF file) into your BioCASE installation cmf folder to make it available for local mapping. .../biocase/configuration/templates/cmf/cmf_your-preferred-file-name.xml Visit : [http://ww3.bgbm.org/bps2/GenerateCmFiles] for more info!
  • 14. Data Provider Software Distributed network of data providers retrieving structured data from multiple, distributed, heterogeneous databases across the Internet. DiGIR , Di stributed G eneric I nformation R etrieval. [http://digir.net] BioCASE , The Biological Collection Access Service for Europe. [http://www.biocase.org/]
  • 15. Protocol integration - TAPIR There is a need to integrate the current protocols in use by different biodiversity informatics community networks. During the TDWG meeting in 2004, the unified protocol was presented and named TAPIR . The T DWG A ccess P rotocol for I nformation R etrieval. New BioCASE and DiGIR software will implement the TAPIR protocol. Will TAPIR also help us to integrate GBIF with the BioMOBY community? [http://ww3.bgbm.org/tapir]
  • 16. BioMOBY BioMOBY is an international research project on methodologies for biological data representation, distribution, and discovery. BioMOBY is chosen as the web service framework for the Generation Challenge Program [http://www.biomoby.org/] Work is in progress to develop BioMOBY and BioCASE interoperability.
  • 17. BioCASE data provider software BioCASE Bio logical C ollection A ccess for E urope [http://www.biocase.org/]
  • 18. BioCASE Biological Collection Access for Europe BioCASE establish web-based unified access to biological collections in Europe while leaving control of the information with the collection holders. ABCD is the main data definition used by BioCASE. Designed generic to handle any schema and connect to any SQL capable database. BioCASE provide full access to its registry for GBIF . Being a BioCASE provider thus means being a GBIF provider. [http://www.biocase.org/]
  • 19. BioCASE [http://www.biocase.org/] BioCASE runs on MS Windows, Mac OS X, Linux, BSD, Solaris... BioCASE works with many different databases , PostgreSQL, MySQL, Oracle, MS Access, MS SQL Server.... BioCASE works with UNICODE ضاإطقكغب שּׁשׁﭻﭗﭼﱠ אָבּדּוּ BioCASE is OpenSource BioCASE is developed in the Python programming language CVS
  • 22. BioCASE Provider Software v 2.3.1 Required configuration: Web server : Any CGI compliant web server: Apache, IIS, etc. Database : major databases are supported, including MySQL, Oracle, SQLServer, Sybase, Access, PostgreSQL. Theoretically any database with a Python library should work. Python (BioCASE is developed in the Python programming language. Install version 2.3 or later) [http://ww3.bgbm.org/bps2/DocumentationToc] [http://www.biocase.org/products/provider_software/index.shtml]
  • 23. BioCASE installation Download the provider software and unzip the archive file [provider_software_2.3.1.tar.gz] For example uncompress it into [C:\biocase\] Configure your web server to publish the www folder. Example [C:\biocase\] to be accessible trough [http://localhost/biocase/] Download and install the latest Python software [http://www.python.org/download/] Execute the [C:\biocase\setup.py] script. For a UNIX like system: %> cd biocase %> python setup.py Test your installation [http://localhost/biocase] [http://ww3.bgbm.org/bps2/Installation]
  • 24. BioCASE Install third party software [ http://localhost/biocase/utilities/testlibs.cgi ] Follow the links from the Library test page. The column for installed version will display the installed version after successful installation. To update the BioCASE software: Download the new release. Unzip to a temporary folder. Execute the setup.py and follow the instructions.
  • 25. BioCASE configuration After successful installation you will need to configure your data provider. Follow the instructions from the BioCASE documentation to configure Data sources . If you provide more datasets or several databases they will be configured as individual data sources. Database connection . So the software can access your database. Database structure . Define the relevant tables, the primary keys and foreign keys. Data model . Map your database model to the standard represented by the XML Schemas you choose. [http://ww3.bgbm.org/bps2/Configuration]
  • 26. Example of a service request All exchanged data is formatted with XML tags.
  • 27. Example of a service response
  • 28. TAPIR TAPIR will offer you more advanced request formats.
  • 29. TAPIR service request TAPIR will offer you more advanced request formats.
  • 30. TAPIR service response singer:/sourcename singer:/taxonomy/genus singer:/taxonomy/species singer:/taxonomy/subspecies singer:/holding/ID singer:/holding/name singer:/origin/collecting/countrysource singer:/origin/collecting/countrysourceID singer:/status/biologicalstatus singer:/status/biologicalstatusID ...
  • 31. Decentralized data network with web services
  • 32. Data warehouse model (Slide by Samy Gaiji, IPGRI)
  • 33. Decentralized model (Slide by Samy Gaiji, IPGRI)
  • 34. Data flow from genebanks to EURISCO and ECCDBs
  • 36. Genebanks on BioCASE The BioCASE data provider software has been implemented at (almost) all the CGIAR germplasm centers during the autumn of 2005. Several other genebanks have installed the GBIF web service technology. Nordic Gene Bank, IPK Gatersleben, IHAR (DiGIR), USDA GRIN, CGN, more to follow soon...
  • 37. Germplasm data indexing tools We are building data indexing methodologies for access to germplasm data with BioCASE. This is planned to build a Germplasm Clearing House Mechanism. Development in cooperation with GBIF, which themselves index basic biodiversity data from a similar approach. [http://chm.grinfo.net/index.php]
  • 38. BioCASE and germplasm data [http://chm.grinfo.net/index.php?app=data_providers]
  • 39. Global Unique Identifiers, GUID ( LSID , Life Science Identifiers) [http://lsid.sourceforge.net/] Biodiversity informatics workflow tools (BioMOBY and Taverna, Kepler and SEEK...) Germplasm Clearing House Mechanism [http://chm.grinfo.net/] TAPIR Works in progress
  • 40. Thank you for listening!

Editor's Notes

  • #4: * Text formulation source [http://www.bgbm.org/TDWG/CODATA/ABCD-Evolution.htm] wording above is modified. * Photo (top): Beetle collection in Benin, West Africa (March 24 2004). Photographer Dag Endresen. [http://r142b.ngb.se/ngb/2003-03_Cotonou_Benin/index.php?offset=85&size=medium&stp=1]
  • #5: Photo (top) * Seed storage in Benin, West Africa (March 24 2004). Photographer Dag Endresen. [http://r142b.ngb.se/ngb/2003-03_Cotonou_Benin/index.php?offset=108&size=medium&stp=1] Photo (below)* VIR seed collection. St. Petersburg. Photographer Eva Thörn (NGB Picture Archive, image 001319).
  • #6: Photo: Field been from Boreal, accession NGB11518, 2005-03-05, Dag Endresen [http://r142b.ngb.se/ngb/2005-03--the-making-of-seeds-pictures/index.php?offset=19&size=medium&stp=1]
  • #7: * IPGRI Descriptors lists [http://www.ipgri.cgiar.org/system/page.asp?frame=programmes/inibap/home.htm] (119 descriptor lists, 2005) * MCPD [http://www.ipgri.cgiar.org/publications/pdf/333.pdf] * UPOV - International Union for the Protection of New Varieties of Plants (UPOV) [ http://www.upov.int/] * UPOV - The International Union for the Protection of New Varieties of Plants or UPOV (French: Union internationale pour la protection des obtentions végétales) is an intergovernmental organization with headquarters in Geneva, Switzerland. [http://en.wikipedia.org/wiki/UPOV] * COMECON - The Council for Mutual Economic Assistance (COMECON / Comecon / CMEA / CEMA), 1949 – 1991, was an economic organisation of communist states and a kind of Eastern European equivalent to the European Economic Community. The military counterpart to the Comecon was the Warsaw Pact. [http://en.wikipedia.org/wiki/Comecon] * Multi-crop Passport Descriptors (MCPD) [http://www.ipgri.cgiar.org/publications/pdf/124.pdf] F AO (Food and Agricultural Organization of the United Nations) - IPGRI (International Plant Genetic Resources Institute). This is a revised version (December 2001) of the 1997 MCPD List. * FAO World Information and Early WarningSystem ( WIEWS) [http://apps3.fao.org/wiews/] * 19 Plant Uses Categories based on categories developed for the Working Group on Taxonomic Databases (TDWG) (Cook, Frances E.M., 1995. Economic Botany: Data Collection Standard. Royal Botanic Gardens Kew). [ http://www.ecpgr.cgiar.org/epgris/Training/MCPD-1998.doc] * The mapping of MCPD to ABCD was started in 2004 by Helmut Knüpffer and Walter Berendsohn, and continued by Javier de la Torre and Dag Terje Filip Endresen in 2005. [ http://ww3.bgbm.org/MCDPH] [ http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf ]
  • #8: * Illustration: Corn earworm pupae that will be used to produce control parasites for release in the field. Photo by Scott Bauer. [http://www.ars.usda.gov/is/graphics/photos/k5554-2.htm] * UBIF is an attempt to define a common foundation for several TDWG/GBIF standards like SDD (see SDD WIKI), ABCD (see ABCD content schema homepage) or TaxonConceptNames (see Taxonomic Concept Transfer Schema WIKI). * Unified Biosciences Information Frameword (UBIF) XML schema for data exchange and integration across knowledge domains. The schema has been design for biological data, but is applicable to other knowledge areas as well. It is based on work of the TDWG SDD and ABCD subgroups and currently jointly authored by the SDD, ABCD, TaxonName subgroups and by GBIF (Global Biodiversity Information Facility). The framework may be used without changes for new schemata, no registration is necessary. * Complex Types are part of the UBIF infrastructure (TDWG common complex type for several schemas, ABCD, SDD, TCS, Lnnean Core, etc.)
  • #9: * The mapping of MCPD to ABCD was started in 2004 by Helmut Knüpffer and Walter Berendsohn, and continued by Javier de la Torre and Dag Terje Filip Endresen in 2005. [ http://ww3.bgbm.org/MCDPH] [ http://www.bgbm.org/TDWG/CODATA/Schema/Mappings/EURISCO-2-ABCD.pdf ]
  • #11: GCP_Passport v 1.03 [http://tor.ngb.se/dev/temp/gcp_passport_01_03.xsd]
  • #12: * Demo Data Portal [http://geifir.ngb.se/abcdproto/default.jsp] The work on the demo portal has been replaced by routines to harvest and index remote data. The live remote access proved to slow and unreliable. See the Germplasm Clearing House Mechanism for more info [http://chm.grinfo.net/].
  • #16: * Illustration: Tapir - © 1999-2005 www.barrysclipart.com (Licence “Feel free to use Barrys Clipart Server content in personal/ non profit projects to create webpages…”) [http://www.barrysclipart.com/barrysclipart.com/showphoto.php?photo=13376&papass=&sort=1&thecat=149] Not Quality counts: Chemist Gary List checks soybeans. Photo by Keith Weller. [ http://www.ars.usda.gov/is/graphics/photos/k5256-2.htm]
  • #17: Photo: PICT0173.jpg Sub-section from Whale Safari to Kaikoura New Zealand. Photo Dag Terje Filip Endresen [http://r142b.ngb.se/ngb/2004-10-New-Zealand-Australia/index.php?offset=79&size=medium&stp=1]
  • #18: [http://www.biocase.org/index.shtml]
  • #19: BioCASE development is coordinated by the Botanischer Garten und Botanisches Museum Berlin-Dahlem – BGBM.
  • #21: http://www.biocase.org/whats_biocase/unit_net.shtml
  • #22: http://www.biocase.org/whats_biocase/unit_net.shtml
  • #29: http://tapir.grinfo.net/queryforms/qf_tapir.cgi?url=http://tapir.grinfo.net//pywrapper.cgi?dsa=SINGER
  • #30: http://tapir.grinfo.net/queryforms/qf_tapir.cgi?url=http://tapir.grinfo.net//pywrapper.cgi?dsa=SINGER <?xml version='1.0' encoding='UTF-8'?> <request> <header /> <inventory count='true' start='0' limit='40' xmlns:singer='http://singer.grinfo.net/schema/SINGER-1.0' > <concepts> <concept path='singer:/sourcename'/> <concept path='singer:/taxonomy/genus' /> <concept path='singer:/taxonomy/species' /> <concept path='singer:/taxonomy/subspecies' /> <concept path='singer:/holding/ID' /> <concept path='singer:/holding/name' /> <concept path='singer:/origin/collecting/countrysource' /> <concept path='singer:/origin/collecting/countrysourceID' /> <concept path='singer:/status/biologicalstatus' /> <concept path='singer:/status/biologicalstatusID' /> </concepts> <filter> <like> <concept path='singer:/taxonomy/genus' /> <literal value='cice*' /> </like> </filter> </inventory> </request>
  • #31: http://tapir.grinfo.net/queryforms/qf_tapir.cgi?url=http://tapir.grinfo.net//pywrapper.cgi?dsa=SINGER http://apps3.fao.org/wiews/wiews.jsp
  • #33: Slide by Samy Gaiji, from presentation on: “ Information Networking - Challenges for the Plant Genetic Resources Communities, 2004.
  • #34: Slide by Samy Gaiji, from presentation on: “ Information Networking - Challenges for the Plant Genetic Resources Communities, 2004.
  • #36: http://apps3.fao.org/wiews/wiews.jsp
  • #37: Photo (top) IRRI genebank. Los Banos, Philippines [ http://www.cgiar.org/images/irrigenebank1.jpg] Photo (below) CIP genebank. Lima, Peru [http://www.cgiar.org/images/cipgenebank.jpg]
  • #39: http://chm.grinfo.net/index.php?app=data_providers