SlideShare a Scribd company logo
Open Genomic Data Web Dr Jun Zhao Image Bioinformatics Research Group Department of Zoology University of Oxford 6 August 2009 GMOD Meeting Europe
Web of Data: “ may more accurately be described as  a web of things in the world, described   by data on the Web.”
Linked Data Design Issues
Resource Description Framework (RDF) rdf:type chado:pub fbgn:FBgn11367 so:Gene fbgn:FBgn11367 pubmed:PMID12
Resource Description Framework (RDF) rdf:type chado:pub fbgn:FBgn11367 so:Gene
SPARQL queries PREFIX chado: <http://purl.org/net/chado/schema> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX xs: <http://www.w3.org/2001/XML_Schema#> SELECT ?flybaseID  WHERE { ?feature rdf:type chado:Feature ; chado:name “schuy”^^xs:string ; chado:uniquename ?flybaseID . } SELECT ?feature.uniquename AS flybaseID FROM feature WHERE feature.name = “schuy” SPARQL SQL
SPARQL protocol GET /query/flybase?query=[URL encoded query] HTTP/1.1   Host: openflydata.org   Accept: application/sparql-results+json   POST /query/flybase HTTP/1.1   Host: openflydata.org   Accept: application/sparql-results+json   Content-Type: application/x-www-form-urlencoded   Content-Length: 456   query=[URL encoded query] HTTP  GET HTTP POST
open interoperable
 
Two Exemplar Applications OpenFlyData.org
Connect TCM with Western Medicine
OpenFlyData: mRNA gene expression study Microarray analysis How much of a given transcript (mRNA) is present in a sample
In a quantitative way
Lack of spatial information  RNA  in situ  hybridization Reveal both spatial and temporal aspects of gene expression during the development
But not quantitative
Barriers for accessing these data Data are scattered at different web sites
Searches have to be repeated, different search interfaces, different use of terminology
Limited (if any) programmatic access to data … hard work to answer questions that span data sources
OpenFlyData.org demonstration Three gene express cross-database search applications Search by gene, gene expression mashup: [ go ]
Search gene expression by gene batch [ go ]
Search gene expression by tissue expression profile [ go ]
System architecture SPARQL endpoint Web browser FlyUI application FlyUI widget HTTP Client side  SPARQL server (SPARQLite, Tomcat, Apache)‏ RDF cache (Jena TDB) ‏ FlyBase BDGP FlyTED FlyAtlas AffyMetrix Server side
Creating RDF from data sources D2RQ mapping FlyBase and BDGP, native relational databases
Conservative mapping, with minimum interpretation OAI2SPARQL Harvesting N3 RDF metadata via the OAI-PMH protocol, built-in support by Eprints
Further from ESWC2008 paper Custom Python program FlyAtlas
Generating N3 from spreadsheet table

More Related Content

PPTX
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
PPTX
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
PPTX
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
PPTX
Linked data 101: Getting Caught in the Semantic Web
PPT
2010 03 Lodoxf Openflydata
ODP
2010 06 ipaw_prv
PPTX
Supporting Dataset Descriptions in the Life Sciences
PPTX
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
Linked data 101: Getting Caught in the Semantic Web
2010 03 Lodoxf Openflydata
2010 06 ipaw_prv
Supporting Dataset Descriptions in the Life Sciences
The HCLS Community Profile: Describing Datasets, Versions, and Distributions

What's hot (19)

PPT
2009 Dils Flyweb
PDF
20160818 Semantics and Linkage of Archived Catalogs
PPT
Friday talk 11.02.2011
PDF
Freedom for bibliographic references: OpenCitations arise
PDF
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
PPT
BHL Tech Overview for BHL-Europe
PDF
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
PDF
How to clean data less through Linked (Open Data) approach?
PPTX
Hack U Barcelona 2011
PDF
The Role of Metadata in Reproducible Computational Research
PPTX
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
PDF
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PDF
Yosemite part-4 webinar-final
PPTX
247th ACS Meeting: The Eureka Research Workbench
PPTX
An Identifier Scheme for the Digitising Scotland Project
PPTX
Expanding the content categories at JaLC
PPTX
PPTX
Using Linked Data to Mine RDF from Wikipedia's Tables
PDF
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
2009 Dils Flyweb
20160818 Semantics and Linkage of Archived Catalogs
Friday talk 11.02.2011
Freedom for bibliographic references: OpenCitations arise
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
BHL Tech Overview for BHL-Europe
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
How to clean data less through Linked (Open Data) approach?
Hack U Barcelona 2011
The Role of Metadata in Reproducible Computational Research
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
Yosemite part-4 webinar-final
247th ACS Meeting: The Eureka Research Workbench
An Identifier Scheme for the Digitising Scotland Project
Expanding the content categories at JaLC
Using Linked Data to Mine RDF from Wikipedia's Tables
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
Ad

Viewers also liked (12)

PPTX
Query-generation-for-provo-data-201406
PDF
2012 04-ldow-prov
PDF
2012 05-swpm-provo
PDF
2010 10 provxg_datagovuk
PPT
Talk_linked_data_for_hcls_at_iswc2009
ODP
2011 03-provenance-workshop-edingurgh
PPT
Horeca in zwaar weer? Kansen zien en benutten!
PDF
2010 05 edinburgh
PPT
2009 09 Lod London
PDF
WordPress Meetup Bandung - December 2014
PPTX
Benefits of Blogging
PDF
2010 06 rdf_next
Query-generation-for-provo-data-201406
2012 04-ldow-prov
2012 05-swpm-provo
2010 10 provxg_datagovuk
Talk_linked_data_for_hcls_at_iswc2009
2011 03-provenance-workshop-edingurgh
Horeca in zwaar weer? Kansen zien en benutten!
2010 05 edinburgh
2009 09 Lod London
WordPress Meetup Bandung - December 2014
Benefits of Blogging
2010 06 rdf_next
Ad

Similar to 2009 0807 Lod Gmod (20)

PPT
2008 11 13 Hcls Call
PDF
Use of open_linked_data_in_bioinformatics
PDF
Bio2RDF presentation at Combine 2012
PDF
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
PDF
Bio2RDF @ W3C HCLS2009
PPTX
The Progress on Sagace and Data Integration
PDF
Bio2RDF @ DILS 2008
PPTX
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
PPT
Finding knowledge, data and answers on the Semantic Web
PPTX
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
PPTX
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
PDF
Connecting the dots: drug information and Linked Data
PPTX
Building a Network of Interoperable and Independently Produced Linked and Ope...
PPTX
Applied semantic technology and linked data
PPTX
BioPAX Models and Pathways
PPTX
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
PPTX
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
PDF
Producing, publishing and consuming linked data - CSHALS 2013
PPTX
Bio2RDF and Beyond!
PDF
BioSD Tutorial 2014 Editition
2008 11 13 Hcls Call
Use of open_linked_data_in_bioinformatics
Bio2RDF presentation at Combine 2012
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Bio2RDF @ W3C HCLS2009
The Progress on Sagace and Data Integration
Bio2RDF @ DILS 2008
E.Gombocz: Semantics in a Box (SemTech 2013-04-30)
Finding knowledge, data and answers on the Semantic Web
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Connecting the dots: drug information and Linked Data
Building a Network of Interoperable and Independently Produced Linked and Ope...
Applied semantic technology and linked data
BioPAX Models and Pathways
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Producing, publishing and consuming linked data - CSHALS 2013
Bio2RDF and Beyond!
BioSD Tutorial 2014 Editition

More from Jun Zhao (7)

PDF
Www sociam-2016-policy-reviews
PDF
2011 03-provenance-workshop-edingurgh
PDF
2010 09 opm_tutorial_02-jun-opmv
PPT
2010 09 opm_tutorial_01-jun-usecase-datagovuk
PPT
myExperiment and AIDA
PPT
2008 Jun Zhao Eswc
PDF
2008 04 22 Jun Zhao Ldow
Www sociam-2016-policy-reviews
2011 03-provenance-workshop-edingurgh
2010 09 opm_tutorial_02-jun-opmv
2010 09 opm_tutorial_01-jun-usecase-datagovuk
myExperiment and AIDA
2008 Jun Zhao Eswc
2008 04 22 Jun Zhao Ldow

Recently uploaded (20)

PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
cuic standard and advanced reporting.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Modernizing your data center with Dell and AMD
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
A Presentation on Artificial Intelligence
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPT
Teaching material agriculture food technology
PPTX
Cloud computing and distributed systems.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
cuic standard and advanced reporting.pdf
Understanding_Digital_Forensics_Presentation.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Modernizing your data center with Dell and AMD
Big Data Technologies - Introduction.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Review of recent advances in non-invasive hemoglobin estimation
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced methodologies resolving dimensionality complications for autism neur...
A Presentation on Artificial Intelligence
Diabetes mellitus diagnosis method based random forest with bat algorithm
Dropbox Q2 2025 Financial Results & Investor Presentation
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Teaching material agriculture food technology
Cloud computing and distributed systems.

2009 0807 Lod Gmod

Editor's Notes

  • #14: Note that the thumbnail images are retrieved from the original web sites
  • #15: FlyUI: a library of Javascript widgets as front ends to SPARQL data sources Built on Yahoo User Interface (YUI) library Widgets are composed in a browser to create the complete application Each widget provides: A Service that implements SPARQL queries A Model encapsulating SPARQL query results A Renderer
  • #16: Initially hoped to use D2R server&apos;s SPARQL query rewriting, but some queries would kill the server, so went for SPARQLite alternative Different techniques for generating RDF applied to different kinds of data source Resulting RDF is loaded into the Jena TDB triple store.
  • #21: Alzheimer’s herbs with side effects. Red: Alzheimer’s herbs. Blue: drugs with no side effects reported. Purple: drugswith reported side effects.