SlideShare a Scribd company logo
Opening and Integration of CASDD and 
Germplasm Data to AGRIS 
Prof. Xuefu Zhang & Dr. Guojian Xian 
Agricultural Information Institute of CAAS 
Research Data Alliance Fourth Plenary Meeting, 22-24 September, 2014, Amsterdam
Contents 
 Open CASDD as Restful APIs 
 Open Germplasm as Restful APIs 
 Integration and Extension to AGRIS 
 Fruitful Results 
2
Main Materials 
• Chinese Agricultural Sci-tech Documents Database (CASDD) 
– 440,113 records 
• CGRIS Germplasm Data 
• AGROVOC 
– agrovoc_2013-12-17_core.rdf 
• Chinese Agricultural Thesaurus(CAT) 
• KOS Mapping Results: 
– AGROVOC_CAT.nt 
• AGRIS 2.0 
– (Latest version: 20140427) 
3
About CASDD 
• Chinese Agricultural Sci-tech Documents Database 
(CASDD), as agricultural bibliographic/abstracts database 
in China developed by CAAS, has the largest number of 
records and the longest time span of documents. 
• Covering over 1000 kinds of agricultural academic journals 
and other materials, over 6 million records, in the fields of 
agronomy, horticulture, plant protection, soil sciences, 
animal husbandry, veterinary, agricultural engineering, 
agricultural products processing, agricultural economic,etc. 
• It is the most comprehensive, reliable and accessible 
information resources of agricultural science and 
technology information from research institutions, education 
and related departments. 
4
Refining and Analyzing CASDD 
5 
Tagging CAT and AGROVOC concepts to CASDD 
CASDD 
CAT AGROVOC 
RDF Core 
Mapping 
of CAT and 
AGROVOC 
Solr 4.7 
Write&Read 
Tagging(URI,Preflabel) 
Indexing 
CASDD 
Index 
Virtuoso 
Triple 
Store 
Tagging(URI,Preflabel) 
Sparql 
query 
MMseg4J/ 
IKAnalyzer 
Java 
Application 
SQE Plugin
English Coverage Analysis of CASDD Records 
6 
Total Records: 440,113 
Fields Records Percentage 
English Title 289,314 65.74% 
English Keywords 286,032 64.99% 
English Abstract 286,921 65.19%
7 
The CAT Concepts Coverage in CASDD
8 
The AGROVOC Concepts Coverage in CASDD 
TermCount TermFreq. Record Number Match Ratio 
TermFreq>=3 
TermCount>=1 
400,009 90.89% 
TermFreq>=3 
TermCount>=2 
320,472 72.82% 
TermFreq>=3 
TermCount>=3 
227,481 51.69% 
TermFreq>=3 
TermCount>=5 
83,992 19.08% 
TermFreq>=5 
TermCount>=3 
51,726 11.75%
CASDD Restful API (Architecture) 
Accessing & Linking 
CASDD Restful Web Service (API) Endpoint 
CASDD 
Database 
Tomcat 
(Jersey API) 
Reading Only 
Container 
Solr 4.7 
(SQE Plugin) 
Third Part 
Application 
Container 
AGRIS 
agINFRA 
Index 
CAT + AGROVOC + Mapping
CASDD Restful API(Features) 
• Aims to provide a light-weight solution to expose the 
records of CASDD to the third party applications. 
• Providing several ways to access the records, such as 
query with keywords, ARN, PublicationDate, AGROVOC 
Concept URIs, Chinese Agricultural Thesaurus (CAT) 
URIs. 
• The results also supporting pagination and sorting. 
• The output formats include RDF/XML following the 
AGRIS AP standard and plain JSON. 
• Authentication and Detail Logging for evaluations 
10
CASDD Restful API(Samples) 
11 
BrowsGinetg rreeccoorrddss wwiitthh ApGagRiOnaVtOioCn URI
Contents 
 Open CASDD as Restful APIs 
 Open Germplasm as Restful APIs 
 Integration and Extension to AGRIS 
 Fruitful Results 
12
Germplasm Data of CGRIS 
• CGRIS germplasm database is a central 
repository for all type of plant genetic 
resources information in China. At present, 
there are over 4000 MB data on 200 kinds of 
crops, 410,000 accessions of germplasm 
stored in CGRIS. 
13
The Germplasm Restful API (Architecture) 
Redirect to Detail 
CGRIS 
Germplasm 
Database 
Tomcat 
(Jersey API) 
CGRIS 
Website 
CGRIS Germplasm Restful API 
AGROVOC 
CAT 
Preflabel2URI 
Mapping 
Reading Only 
Accessing & Linking 
Container 
Third Part 
Application
The Germplasm Restful API (Features) 
• Aims to provide a light-weight solution to expose 
the records of CGRIS Germplasm to the third party 
applications. 
• Providing several ways to access the records, such 
as query with scientific name, vernacular name, 
catalogNumber, AGROVOC Concept URI, Chinese 
Agricultural Thesaurus (CAT) URI. 
• The output formats include RDF/XML following the 
darwincore-germplasm schema and plain JSON. 
• Authentication and Detail Logging for evaluations
The Germplasm Restful API (Samples) 
Get records with vernacular name 
Get reGcoetr drse cwoirtdhs s wciietnht AifGicR nOaVmOeC URI
Contents 
 Open CASDD as Restful APIs 
 Open Germplasm as Restful APIs 
 Integration and Extension to AGRIS 
 Fruitful Results 
17
The Extended AGRIS in Chinese 
18 
The Extended AGRIS in Chinese 
QUERY STATISTICS (CASDD) SEARCH RESULT BROWSING 
Restful API 
SINGLE RECORD MASHUPS 
( Germplasm) 
AGRIS SERVICES LAYER 
Read 
TOOLS LAYER 
CASDD 
Germplasm 
DATA LAYER 
JAVA APPLICATION 
Custom Modules 
CASDD New 
Page 
CASDD AGRIS 
CAT 
AGROVOC 
RDF Core 
Mapping of 
CAT and 
AGROVOC 
Chinese 
Query 
Solr 4.7 
SQE Plugin 
CASDD Box 
Germplasm Other Resources
Enhanced Search in Chinese 
• Semantic Query Extension 
– Solr Query Expander (SQE)2.0 
• Integrating and Linking CASDD API 
• Integrating and Linking Germplasm API 
• Other Improvements: 
– User Query Automatic Suggestion 
– Update AGRIS AP XML files Indexer to Solr 4.7 
– Integrating Bing Cloud Dictionary 
19
Improved and Updated SQE 2.0 
• Totally be compliance with Solr 4.5. 
• Work with SKOS files with suffix .rdf (RDF/XML), .n3 
(N3),.ttl (TURTLE) and .zip (ZIP) 
• Supports load more than one SKOS files at one time 
• Supports customized relationship types expansion, 
such as PREF, ALT, HIDDEN, BROADER, NARROWER, 
BROADERTRANSITIVE, RELATED. 
• Excellent performance with the improved version of 
IKAnalyzer2012FF (supports English phrase analysis 
and tagging based on English dictionary) 
20
Semantic Expansion Search with SQE2.0 
21
Integrating and Linking CASDD 
• AGRIS Search Results(CASDD Box) 
– The box displays the search results of CASDD (first 
five records) 
– Records include title, author, keywords, 
submission date, and abstract. 
– get more related records 
– get more (detail information) 
22
Integrating and Linking CASDD 
• Detail information(Single Record information) 
– Title(ZH/EN), Keywords(ZH/EN), Authors, 
Submission Date, Abstract(ZH/EN), CAT keywords, 
AGROVOC keywords, Journal, ISSN 
• More Related Records 
– Display more related records 
– Browing records with pagination 
23
Linking CGRIS Germplasm Resources 
• Germplasm Mashup 
– get more…(detail information) 
First five CGRIS Germplasm records information 
• Navigating to CGRIS Website 
– CGRIS website 
24
Contents 
 Open CASDD as Restful APIs 
 Open Germplasm as Restful APIs 
 Integration and Extension to AGRIS 
 Fruitful Results 
25
26 
Linking CASDD Records with Box 
http://agris.fao.org/agris-search/searchIndex.do?query=barley&x=-430&y=-58
Detail Info of a CASDD Record 
27
More Related Records From CASDD 
28
CGRIS Germplasm Mashup 
29
Thanks for Your Listening! 
30

More Related Content

PDF
An Empirical Evaluation of RDF Graph Partitioning Techniques
PPTX
ICESat-2 Metadata and Status
PPTX
Data Analytics using MATLAB and HDF5
PPT
HDF5 Performance Enhancements with the Elimination of Unlimited Dimension
PPTX
Pilot Project for HDF5 Metadata Structures for SWOT
PPTX
Matlab, Big Data, and HDF Server
PPTX
Utilizing HDF4 File Content Maps for the Cloud Computing
An Empirical Evaluation of RDF Graph Partitioning Techniques
ICESat-2 Metadata and Status
Data Analytics using MATLAB and HDF5
HDF5 Performance Enhancements with the Elimination of Unlimited Dimension
Pilot Project for HDF5 Metadata Structures for SWOT
Matlab, Big Data, and HDF Server
Utilizing HDF4 File Content Maps for the Cloud Computing

What's hot (20)

PDF
HDFCloud Workshop: HDF5 in the Cloud
PPTX
Efficiently serving HDF5 via OPeNDAP
PPTX
HDF Group Support for NPP/NPOESS/JPSS
PPTX
Moving form HDF4 to HDF5/netCDF-4
PPTX
PPT
Summary of HDF-EOS5 Files, Data Model and File Format
PPTX
HDF-EOS Data Product Developer's Guide
PPTX
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
PPTX
Working with Scientific Data in MATLAB
PPSX
Data Are from Mars, Tools Are from Venus
PPTX
PPTX
HDF & HDF-EOS Data & Support at NSIDC
PPTX
Product Designer Hub - Taking HPD to the Web
PPTX
Are you a Tortoise or a Hare?
PPTX
HDF and netCDF Data Support in ArcGIS
PPTX
Incorporating ISO Metadata Using HDF Product Designer
PPTX
ProteomeXchange: data deposition and data retrieval made easy
PPTX
Understanding Hadoop Clusters and the Network
HDFCloud Workshop: HDF5 in the Cloud
Efficiently serving HDF5 via OPeNDAP
HDF Group Support for NPP/NPOESS/JPSS
Moving form HDF4 to HDF5/netCDF-4
Summary of HDF-EOS5 Files, Data Model and File Format
HDF-EOS Data Product Developer's Guide
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
Working with Scientific Data in MATLAB
Data Are from Mars, Tools Are from Venus
HDF & HDF-EOS Data & Support at NSIDC
Product Designer Hub - Taking HPD to the Web
Are you a Tortoise or a Hare?
HDF and netCDF Data Support in ArcGIS
Incorporating ISO Metadata Using HDF Product Designer
ProteomeXchange: data deposition and data retrieval made easy
Understanding Hadoop Clusters and the Network
Ad

Viewers also liked (18)

PPTX
Agri-Profiles: Agricultural tacit knowledge discovery tool
PPTX
Research Data Management at International Food Policy Research Institute-IFPRI
PDF
Development of Data Integration & Analysis System in Japan
PPTX
Turning three thesauri into a Global Agricultural Concept Scheme
PPT
RDA Wheat Data Interoperability Cookbook and last developments
PPT
Open Research Data – the KALRO experience
PDF
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
PPTX
Open AGRIgate by Benjamin Cave
PPT
Wheat Data Interoperability (2) by Esther DZALE YEUMO KABORE and Richard FULSS
PPTX
Social Media in: Disseminating and Sharing Agriculture Data/Information
PPTX
JKUAT Case on Open Access
PPTX
The CIARD RINGValeri
PPT
CC licenses: Fostering reuse and sharing
PPTX
The GACS Project by Caterina Caracciolo
PPTX
Open Access Progress and Promise in the CGIAR Consortium
PPT
Knowledge Management at KEFRI
PPTX
JKUAT Case on Open Access
PPTX
Efficient & effective data management for research projects : ILRI's Data Ma...
Agri-Profiles: Agricultural tacit knowledge discovery tool
Research Data Management at International Food Policy Research Institute-IFPRI
Development of Data Integration & Analysis System in Japan
Turning three thesauri into a Global Agricultural Concept Scheme
RDA Wheat Data Interoperability Cookbook and last developments
Open Research Data – the KALRO experience
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...
Open AGRIgate by Benjamin Cave
Wheat Data Interoperability (2) by Esther DZALE YEUMO KABORE and Richard FULSS
Social Media in: Disseminating and Sharing Agriculture Data/Information
JKUAT Case on Open Access
The CIARD RINGValeri
CC licenses: Fostering reuse and sharing
The GACS Project by Caterina Caracciolo
Open Access Progress and Promise in the CGIAR Consortium
Knowledge Management at KEFRI
JKUAT Case on Open Access
Efficient & effective data management for research projects : ILRI's Data Ma...
Ad

Similar to Opening and Integration of CASDD and Germplasm Data to AGRIS by Prof. Xuefu Zhang and Dr. Guojian Xian (20)

PPTX
The CIARD RING , a global directory of datasets for agriculture, by Valeria P...
PPTX
The new CIARD RING , a machine-readable directory of datasets for agriculture
PPTX
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
PPTX
Presentation at the EMBL-EBI Industry RDF meeting
ODP
2009 0807 Lod Gmod
PPT
2005 09 Dc Keynote
PPT
Grid Projects In The US July 2008
PDF
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
PDF
Terminology Services
PPTX
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
PDF
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
PDF
What’s Evolving in the Elastic Stack
PPT
2010 03 Lodoxf Openflydata
ODP
PPTX
Global RDF Descriptors for Germplasm Data
PDF
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
PPT
Testbed-12 Semantic Portrayal, Registry and Mediation Engineering Report Pr...
PDF
Large Scale Crawling with Apache Nutch and Friends
ODP
Large Scale Crawling with Apache Nutch and Friends
The CIARD RING , a global directory of datasets for agriculture, by Valeria P...
The new CIARD RING , a machine-readable directory of datasets for agriculture
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
Presentation at the EMBL-EBI Industry RDF meeting
2009 0807 Lod Gmod
2005 09 Dc Keynote
Grid Projects In The US July 2008
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
Terminology Services
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
What’s Evolving in the Elastic Stack
2010 03 Lodoxf Openflydata
Global RDF Descriptors for Germplasm Data
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Testbed-12 Semantic Portrayal, Registry and Mediation Engineering Report Pr...
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends

More from CIARD Movement (15)

PDF
DSpace at ILRI : A semi-technical overview of “CGSpace”
PPTX
University of Nairobi, Open Access Initiatives
PPTX
Open Data and Open Science in Agriculture: Management
PPTX
Open Access Initiatives and Challenges in Kenya: Universities
PPT
ICT Centre of Excellence and Open Data –iCEOD
PPTX
Open Data and Big Data Capacity Building Initiative
PPTX
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
PPT
Open Data and Open Science in Agriculture : Experiences and Opinions
PPTX
Open Access, Open Data and Open Science in the context of agricultural research
PPTX
Introducing the GODAN Secretariat
PPTX
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
PPT
Wheat Data Interoperability (3) by Esther DZALE YEUMO KABORE and Richard FULSS
PPT
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
PPT
agINFRA work on Vocabularies for Soil Data as Linked Data by Valeria Pesce, C...
PPTX
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
DSpace at ILRI : A semi-technical overview of “CGSpace”
University of Nairobi, Open Access Initiatives
Open Data and Open Science in Agriculture: Management
Open Access Initiatives and Challenges in Kenya: Universities
ICT Centre of Excellence and Open Data –iCEOD
Open Data and Big Data Capacity Building Initiative
Forum on Open Data and Open Science in Agriculture in Kenya: African Journal ...
Open Data and Open Science in Agriculture : Experiences and Opinions
Open Access, Open Data and Open Science in the context of agricultural research
Introducing the GODAN Secretariat
Enabling Global Solutions for Agricultural and Nutrition Challenges through L...
Wheat Data Interoperability (3) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
agINFRA work on Vocabularies for Soil Data as Linked Data by Valeria Pesce, C...
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...

Recently uploaded (20)

PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
History, Philosophy and sociology of education (1).pptx
PPTX
Lesson notes of climatology university.
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
Indian roads congress 037 - 2012 Flexible pavement
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Classroom Observation Tools for Teachers
PDF
Empowerment Technology for Senior High School Guide
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
Computing-Curriculum for Schools in Ghana
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Chinmaya Tiranga quiz Grand Finale.pdf
A powerpoint presentation on the Revised K-10 Science Shaping Paper
History, Philosophy and sociology of education (1).pptx
Lesson notes of climatology university.
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
Indian roads congress 037 - 2012 Flexible pavement
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Final Presentation General Medicine 03-08-2024.pptx
Complications of Minimal Access Surgery at WLH
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Practical Manual AGRO-233 Principles and Practices of Natural Farming
A systematic review of self-coping strategies used by university students to ...
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Classroom Observation Tools for Teachers
Empowerment Technology for Senior High School Guide
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Computing-Curriculum for Schools in Ghana
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc

Opening and Integration of CASDD and Germplasm Data to AGRIS by Prof. Xuefu Zhang and Dr. Guojian Xian

  • 1. Opening and Integration of CASDD and Germplasm Data to AGRIS Prof. Xuefu Zhang & Dr. Guojian Xian Agricultural Information Institute of CAAS Research Data Alliance Fourth Plenary Meeting, 22-24 September, 2014, Amsterdam
  • 2. Contents  Open CASDD as Restful APIs  Open Germplasm as Restful APIs  Integration and Extension to AGRIS  Fruitful Results 2
  • 3. Main Materials • Chinese Agricultural Sci-tech Documents Database (CASDD) – 440,113 records • CGRIS Germplasm Data • AGROVOC – agrovoc_2013-12-17_core.rdf • Chinese Agricultural Thesaurus(CAT) • KOS Mapping Results: – AGROVOC_CAT.nt • AGRIS 2.0 – (Latest version: 20140427) 3
  • 4. About CASDD • Chinese Agricultural Sci-tech Documents Database (CASDD), as agricultural bibliographic/abstracts database in China developed by CAAS, has the largest number of records and the longest time span of documents. • Covering over 1000 kinds of agricultural academic journals and other materials, over 6 million records, in the fields of agronomy, horticulture, plant protection, soil sciences, animal husbandry, veterinary, agricultural engineering, agricultural products processing, agricultural economic,etc. • It is the most comprehensive, reliable and accessible information resources of agricultural science and technology information from research institutions, education and related departments. 4
  • 5. Refining and Analyzing CASDD 5 Tagging CAT and AGROVOC concepts to CASDD CASDD CAT AGROVOC RDF Core Mapping of CAT and AGROVOC Solr 4.7 Write&Read Tagging(URI,Preflabel) Indexing CASDD Index Virtuoso Triple Store Tagging(URI,Preflabel) Sparql query MMseg4J/ IKAnalyzer Java Application SQE Plugin
  • 6. English Coverage Analysis of CASDD Records 6 Total Records: 440,113 Fields Records Percentage English Title 289,314 65.74% English Keywords 286,032 64.99% English Abstract 286,921 65.19%
  • 7. 7 The CAT Concepts Coverage in CASDD
  • 8. 8 The AGROVOC Concepts Coverage in CASDD TermCount TermFreq. Record Number Match Ratio TermFreq>=3 TermCount>=1 400,009 90.89% TermFreq>=3 TermCount>=2 320,472 72.82% TermFreq>=3 TermCount>=3 227,481 51.69% TermFreq>=3 TermCount>=5 83,992 19.08% TermFreq>=5 TermCount>=3 51,726 11.75%
  • 9. CASDD Restful API (Architecture) Accessing & Linking CASDD Restful Web Service (API) Endpoint CASDD Database Tomcat (Jersey API) Reading Only Container Solr 4.7 (SQE Plugin) Third Part Application Container AGRIS agINFRA Index CAT + AGROVOC + Mapping
  • 10. CASDD Restful API(Features) • Aims to provide a light-weight solution to expose the records of CASDD to the third party applications. • Providing several ways to access the records, such as query with keywords, ARN, PublicationDate, AGROVOC Concept URIs, Chinese Agricultural Thesaurus (CAT) URIs. • The results also supporting pagination and sorting. • The output formats include RDF/XML following the AGRIS AP standard and plain JSON. • Authentication and Detail Logging for evaluations 10
  • 11. CASDD Restful API(Samples) 11 BrowsGinetg rreeccoorrddss wwiitthh ApGagRiOnaVtOioCn URI
  • 12. Contents  Open CASDD as Restful APIs  Open Germplasm as Restful APIs  Integration and Extension to AGRIS  Fruitful Results 12
  • 13. Germplasm Data of CGRIS • CGRIS germplasm database is a central repository for all type of plant genetic resources information in China. At present, there are over 4000 MB data on 200 kinds of crops, 410,000 accessions of germplasm stored in CGRIS. 13
  • 14. The Germplasm Restful API (Architecture) Redirect to Detail CGRIS Germplasm Database Tomcat (Jersey API) CGRIS Website CGRIS Germplasm Restful API AGROVOC CAT Preflabel2URI Mapping Reading Only Accessing & Linking Container Third Part Application
  • 15. The Germplasm Restful API (Features) • Aims to provide a light-weight solution to expose the records of CGRIS Germplasm to the third party applications. • Providing several ways to access the records, such as query with scientific name, vernacular name, catalogNumber, AGROVOC Concept URI, Chinese Agricultural Thesaurus (CAT) URI. • The output formats include RDF/XML following the darwincore-germplasm schema and plain JSON. • Authentication and Detail Logging for evaluations
  • 16. The Germplasm Restful API (Samples) Get records with vernacular name Get reGcoetr drse cwoirtdhs s wciietnht AifGicR nOaVmOeC URI
  • 17. Contents  Open CASDD as Restful APIs  Open Germplasm as Restful APIs  Integration and Extension to AGRIS  Fruitful Results 17
  • 18. The Extended AGRIS in Chinese 18 The Extended AGRIS in Chinese QUERY STATISTICS (CASDD) SEARCH RESULT BROWSING Restful API SINGLE RECORD MASHUPS ( Germplasm) AGRIS SERVICES LAYER Read TOOLS LAYER CASDD Germplasm DATA LAYER JAVA APPLICATION Custom Modules CASDD New Page CASDD AGRIS CAT AGROVOC RDF Core Mapping of CAT and AGROVOC Chinese Query Solr 4.7 SQE Plugin CASDD Box Germplasm Other Resources
  • 19. Enhanced Search in Chinese • Semantic Query Extension – Solr Query Expander (SQE)2.0 • Integrating and Linking CASDD API • Integrating and Linking Germplasm API • Other Improvements: – User Query Automatic Suggestion – Update AGRIS AP XML files Indexer to Solr 4.7 – Integrating Bing Cloud Dictionary 19
  • 20. Improved and Updated SQE 2.0 • Totally be compliance with Solr 4.5. • Work with SKOS files with suffix .rdf (RDF/XML), .n3 (N3),.ttl (TURTLE) and .zip (ZIP) • Supports load more than one SKOS files at one time • Supports customized relationship types expansion, such as PREF, ALT, HIDDEN, BROADER, NARROWER, BROADERTRANSITIVE, RELATED. • Excellent performance with the improved version of IKAnalyzer2012FF (supports English phrase analysis and tagging based on English dictionary) 20
  • 21. Semantic Expansion Search with SQE2.0 21
  • 22. Integrating and Linking CASDD • AGRIS Search Results(CASDD Box) – The box displays the search results of CASDD (first five records) – Records include title, author, keywords, submission date, and abstract. – get more related records – get more (detail information) 22
  • 23. Integrating and Linking CASDD • Detail information(Single Record information) – Title(ZH/EN), Keywords(ZH/EN), Authors, Submission Date, Abstract(ZH/EN), CAT keywords, AGROVOC keywords, Journal, ISSN • More Related Records – Display more related records – Browing records with pagination 23
  • 24. Linking CGRIS Germplasm Resources • Germplasm Mashup – get more…(detail information) First five CGRIS Germplasm records information • Navigating to CGRIS Website – CGRIS website 24
  • 25. Contents  Open CASDD as Restful APIs  Open Germplasm as Restful APIs  Integration and Extension to AGRIS  Fruitful Results 25
  • 26. 26 Linking CASDD Records with Box http://agris.fao.org/agris-search/searchIndex.do?query=barley&x=-430&y=-58
  • 27. Detail Info of a CASDD Record 27
  • 28. More Related Records From CASDD 28
  • 30. Thanks for Your Listening! 30