SlideShare a Scribd company logo
Linked Data for Health Care and Life Science Research Jun Zhao  University of Oxford
Outline What is Linked Data? What do you need to make Linked Data? What can you do with Linked Data?
EntrezGene UniProt KEGG Pathway STITCH Drugbank SIDER http://purl.org/commons/record/ncbi_gene/3772180 http://purl.org/commons/record/P19339/
What are the differences? These are not data warehouses Individual stores, individual SPARQL access points Easier to maintain and to update They are taking advantage of the Web Using the web as the platform Using URIs to identify and link entities Building a Web-scale knowledge base
How to make linked data? Publish data as RDF Assign unique identifiers to data entities Use HTTP URIs so that people can look up those names Include links to other data resources so that they can discover more things Provide SPARQL endpoints so that data can be accessed and queried
How….?  cont. Linked data publication tools D2R server Triplify Pubby Virtuoso Sponge Transformation scripts are widely shared and open accessible Automatic link creation tools Silk, see presentation on Thursday 2 pm
Linked Open Drug Data A task force of the W3C Health Care Life Science Interest Group, started since October 2008 Enrich the Web of Data by publishing drug-related and as Linked Data Investigate the benefits of LODD for drug discovery and biomedical research ~ 12 active participants, including researchers and pharmas
Dataset Outgoing links LinkedCT 220, 569 DrugBank 59, 661 DailyMed 38, 220 RDF-TCM 3, 438 Diseasome 31,065 SIDER 19, 281
Dataset Content Publishing tool Triples LinkedCT Derived from ClinicalTrials.gov; more than 60,000 trials conducted in the US and other countries D2R Server 7,036, 000 DrugBank Nearly 5,000 FDA-approved small molecule and biotech drugs D2R Server 767,000 DailyMed Published by National Library of Medicine (NLM); high quality packaging information on 4,300 marketed drugs D2R Server 164, 300 RDF-TCM 850 herbs, herb-gene and herb-disease associations Pubby 117, 600 Diseasome A network of disorders and disorder genes, obtained from Online Mendelian Inheritance in Man (OMIM) D2R Server 91, 200 SIDER Information on 930 marketed drugs and 1,700 related side effects D2R Server 192,500 8, 400, 000
Create linked data Heterogeneous source data Relational database dumps, tab-delimited data … Used D2R Server and OpenLink Virtuos to publish linked data Used Silk and  LinQuer to create links We got to a long way without data integration or consensus of the semantics The difficulties Understand the semantics of the source data Heterogeneous semantics between source data
 
 
 
What is the alternative medicine of Varenicline used for treating Epilepsy?
SELECT DISTINCT ?diseaseLabel ?altMedicineLabel WHERE { <http://www4.wiwiss.fu-berlin.de/drugbank/resource/drugs/DB01273> drugbank:possibleDiseaseTarget ?disease . ?disease owl:sameAs ?sameDisease . ?altMedicine tcm:treatment ?sameDisease . ?altMedicine rdf:type tcm:Medicine . ?sameDisease rdfs:label ?diseaseLabel . ?altMedicine rdfs:label ?altMedicineLabel . } ------------------------------------------  | diseaseLabel | altMedicineLabel  |  ==========================================  | &quot;Epilepsy&quot;  | &quot;Ginkgo biloba&quot;  |  | &quot;Epilepsy&quot;  | &quot;Cynanchum otophyllum&quot;  |  | &quot;Epilepsy&quot;  | &quot;Piper longum&quot;  |  | &quot;Epilepsy&quot;  | &quot;Datura stramonium&quot;  |  | &quot;Epilepsy&quot;  | &quot;Uncaria rhynchophylla&quot; |  | &quot;Epilepsy&quot;  | &quot;Cannabis sativa&quot;  |  | &quot;Epilepsy&quot;  | &quot;Gastrodia elata&quot;  |  ------------------------------------------  Query 6 datasets as if they are one SQUIN.org Thanks to Olaf Hartig
Are there any Raccoons in India?
Relation Finder: http://relfinder.dbpedia.org/
http://esw.w3.org/topic/HCLSIG/LODD/ Thank you!

More Related Content

PPT
2009 09 Lod London
PPT
The Benefits to Chemical Vendors of Putting their data on ChemSpider
PPT
Stratergies for the intergration of information (IPI_ConfEX)
PDF
3 surya gupta - tabloid proteome
PPT
ChemSpider hosting linking and curating chemistry data for the community
PPT
Chemical Abstracts to Scifinder Scholar
PPT
Chemistry Resources Science Teachers
2009 09 Lod London
The Benefits to Chemical Vendors of Putting their data on ChemSpider
Stratergies for the intergration of information (IPI_ConfEX)
3 surya gupta - tabloid proteome
ChemSpider hosting linking and curating chemistry data for the community
Chemical Abstracts to Scifinder Scholar
Chemistry Resources Science Teachers

What's hot (20)

PPTX
Creating Incentives
PDF
Scibite - We Do.
PPTX
Scifinder scholar ppt
PPTX
Orcid poster 09092013
PPT
PPT
Integrating and curating internet based chemistry resources to serve life sci...
PDF
FundRef Webinar
PPT
DataCite overview 2014
PPTX
Sci finder ppt
PPTX
Reusable data for biomedicine: A data licensing odyssey
PPT
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
PPT
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
PDF
Webtools For Reference Search
PDF
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
PPT
SciFinder Scholar
PPTX
2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...
PPT
Creating Incentives
Scibite - We Do.
Scifinder scholar ppt
Orcid poster 09092013
Integrating and curating internet based chemistry resources to serve life sci...
FundRef Webinar
DataCite overview 2014
Sci finder ppt
Reusable data for biomedicine: A data licensing odyssey
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider and How The Wisdom Of The Crowds Can Improve The Quality Of ...
Webtools For Reference Search
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
SciFinder Scholar
2013 DataCite Summer Meeting - FundRef cooperation with CrossRef (Chuck Koshe...
Ad

Viewers also liked (6)

PDF
Sentara Linked Data Workshop - Sept 10, 2012
DOCX
NIC Linked Data: the OHIO project
PDF
US EPA Resource Conservation and Recovery Act published as Linked Open Data
PDF
Brief on Linked Data for U.S. EPA's Chief Data Scientist
PPT
How to be saved (romans road)
PDF
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Sentara Linked Data Workshop - Sept 10, 2012
NIC Linked Data: the OHIO project
US EPA Resource Conservation and Recovery Act published as Linked Open Data
Brief on Linked Data for U.S. EPA's Chief Data Scientist
How to be saved (romans road)
Briefing on US EPA Open Data Strategy using a Linked Data Approach
Ad

Similar to Talk_linked_data_for_hcls_at_iswc2009 (20)

PDF
Connecting the dots: drug information and Linked Data
PDF
Linked data in pharma R&D
ODP
2009 0807 Lod Gmod
PPT
A Reason Able View To The Web Of Pathway Data
PPT
2011-10-11 Open PHACTS at BioIT World Europe
PDF
Use of open_linked_data_in_bioinformatics
PPTX
Generating Biomedical Hypotheses Using Semantic Web Technologies
PDF
Clinical Quality Linked Data on health.data.gov
PPTX
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
PPTX
Quantifying the content of biomedical semantic resources as a core for drug d...
PDF
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
PPTX
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
PPT
Semantic Web for Health Care and Biomedical Informatics
PPT
CQLD on health.data.gov @ SemTech 2011
PPTX
Building a Network of Interoperable and Independently Produced Linked and Ope...
PPTX
Applied semantic technology and linked data
PDF
Linked Data for improved organization of research data
PPTX
The Progress on Sagace and Data Integration
PDF
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
PDF
Current advances to bridge the usability-expressivity gap in biomedical seman...
Connecting the dots: drug information and Linked Data
Linked data in pharma R&D
2009 0807 Lod Gmod
A Reason Able View To The Web Of Pathway Data
2011-10-11 Open PHACTS at BioIT World Europe
Use of open_linked_data_in_bioinformatics
Generating Biomedical Hypotheses Using Semantic Web Technologies
Clinical Quality Linked Data on health.data.gov
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Quantifying the content of biomedical semantic resources as a core for drug d...
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Semantic Web for Health Care and Biomedical Informatics
CQLD on health.data.gov @ SemTech 2011
Building a Network of Interoperable and Independently Produced Linked and Ope...
Applied semantic technology and linked data
Linked Data for improved organization of research data
The Progress on Sagace and Data Integration
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Current advances to bridge the usability-expressivity gap in biomedical seman...

More from Jun Zhao (18)

PDF
Www sociam-2016-policy-reviews
PPTX
Query-generation-for-provo-data-201406
PDF
2012 05-swpm-provo
PDF
2012 04-ldow-prov
PDF
2011 03-provenance-workshop-edingurgh
ODP
2011 03-provenance-workshop-edingurgh
PDF
2010 10 provxg_datagovuk
PDF
2010 09 opm_tutorial_02-jun-opmv
PPT
2010 09 opm_tutorial_01-jun-usecase-datagovuk
PDF
2010 06 rdf_next
ODP
2010 06 ipaw_prv
PDF
2010 05 edinburgh
PPT
2010 03 Lodoxf Openflydata
PPT
2009 Dils Flyweb
PPT
myExperiment and AIDA
PPT
2008 11 13 Hcls Call
PPT
2008 Jun Zhao Eswc
PDF
2008 04 22 Jun Zhao Ldow
Www sociam-2016-policy-reviews
Query-generation-for-provo-data-201406
2012 05-swpm-provo
2012 04-ldow-prov
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
2010 10 provxg_datagovuk
2010 09 opm_tutorial_02-jun-opmv
2010 09 opm_tutorial_01-jun-usecase-datagovuk
2010 06 rdf_next
2010 06 ipaw_prv
2010 05 edinburgh
2010 03 Lodoxf Openflydata
2009 Dils Flyweb
myExperiment and AIDA
2008 11 13 Hcls Call
2008 Jun Zhao Eswc
2008 04 22 Jun Zhao Ldow

Recently uploaded (20)

PPTX
CHEM421 - Biochemistry (Chapter 1 - Introduction)
PPTX
Post Op complications in general surgery
PPTX
PRESENTACION DE TRAUMA CRANEAL, CAUSAS, CONSEC, ETC.
PPTX
surgery guide for USMLE step 2-part 1.pptx
PPTX
Acute Coronary Syndrome for Cardiology Conference
PPTX
Anatomy and physiology of the digestive system
PPTX
Stimulation Protocols for IUI | Dr. Laxmi Shrikhande
PPTX
y4d nutrition and diet in pregnancy and postpartum
PPT
Rheumatology Member of Royal College of Physicians.ppt
PPTX
MANAGEMENT SNAKE BITE IN THE TROPICALS.pptx
PPTX
ONCOLOGY Principles of Radiotherapy.pptx
PPTX
2 neonat neotnatology dr hussein neonatologist
PPTX
regulatory aspects for Bulk manufacturing
PPT
MENTAL HEALTH - NOTES.ppt for nursing students
PPT
nephrology MRCP - Member of Royal College of Physicians ppt
PPT
neurology Member of Royal College of Physicians (MRCP).ppt
PPTX
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
PPTX
NASO ALVEOLAR MOULDNIG IN CLEFT LIP AND PALATE PATIENT
PDF
Transcultural that can help you someday.
PPTX
Acid Base Disorders educational power point.pptx
CHEM421 - Biochemistry (Chapter 1 - Introduction)
Post Op complications in general surgery
PRESENTACION DE TRAUMA CRANEAL, CAUSAS, CONSEC, ETC.
surgery guide for USMLE step 2-part 1.pptx
Acute Coronary Syndrome for Cardiology Conference
Anatomy and physiology of the digestive system
Stimulation Protocols for IUI | Dr. Laxmi Shrikhande
y4d nutrition and diet in pregnancy and postpartum
Rheumatology Member of Royal College of Physicians.ppt
MANAGEMENT SNAKE BITE IN THE TROPICALS.pptx
ONCOLOGY Principles of Radiotherapy.pptx
2 neonat neotnatology dr hussein neonatologist
regulatory aspects for Bulk manufacturing
MENTAL HEALTH - NOTES.ppt for nursing students
nephrology MRCP - Member of Royal College of Physicians ppt
neurology Member of Royal College of Physicians (MRCP).ppt
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
NASO ALVEOLAR MOULDNIG IN CLEFT LIP AND PALATE PATIENT
Transcultural that can help you someday.
Acid Base Disorders educational power point.pptx

Talk_linked_data_for_hcls_at_iswc2009

  • 1. Linked Data for Health Care and Life Science Research Jun Zhao University of Oxford
  • 2. Outline What is Linked Data? What do you need to make Linked Data? What can you do with Linked Data?
  • 3. EntrezGene UniProt KEGG Pathway STITCH Drugbank SIDER http://purl.org/commons/record/ncbi_gene/3772180 http://purl.org/commons/record/P19339/
  • 4. What are the differences? These are not data warehouses Individual stores, individual SPARQL access points Easier to maintain and to update They are taking advantage of the Web Using the web as the platform Using URIs to identify and link entities Building a Web-scale knowledge base
  • 5. How to make linked data? Publish data as RDF Assign unique identifiers to data entities Use HTTP URIs so that people can look up those names Include links to other data resources so that they can discover more things Provide SPARQL endpoints so that data can be accessed and queried
  • 6. How….? cont. Linked data publication tools D2R server Triplify Pubby Virtuoso Sponge Transformation scripts are widely shared and open accessible Automatic link creation tools Silk, see presentation on Thursday 2 pm
  • 7. Linked Open Drug Data A task force of the W3C Health Care Life Science Interest Group, started since October 2008 Enrich the Web of Data by publishing drug-related and as Linked Data Investigate the benefits of LODD for drug discovery and biomedical research ~ 12 active participants, including researchers and pharmas
  • 8. Dataset Outgoing links LinkedCT 220, 569 DrugBank 59, 661 DailyMed 38, 220 RDF-TCM 3, 438 Diseasome 31,065 SIDER 19, 281
  • 9. Dataset Content Publishing tool Triples LinkedCT Derived from ClinicalTrials.gov; more than 60,000 trials conducted in the US and other countries D2R Server 7,036, 000 DrugBank Nearly 5,000 FDA-approved small molecule and biotech drugs D2R Server 767,000 DailyMed Published by National Library of Medicine (NLM); high quality packaging information on 4,300 marketed drugs D2R Server 164, 300 RDF-TCM 850 herbs, herb-gene and herb-disease associations Pubby 117, 600 Diseasome A network of disorders and disorder genes, obtained from Online Mendelian Inheritance in Man (OMIM) D2R Server 91, 200 SIDER Information on 930 marketed drugs and 1,700 related side effects D2R Server 192,500 8, 400, 000
  • 10. Create linked data Heterogeneous source data Relational database dumps, tab-delimited data … Used D2R Server and OpenLink Virtuos to publish linked data Used Silk and LinQuer to create links We got to a long way without data integration or consensus of the semantics The difficulties Understand the semantics of the source data Heterogeneous semantics between source data
  • 11.  
  • 12.  
  • 13.  
  • 14. What is the alternative medicine of Varenicline used for treating Epilepsy?
  • 15. SELECT DISTINCT ?diseaseLabel ?altMedicineLabel WHERE { <http://www4.wiwiss.fu-berlin.de/drugbank/resource/drugs/DB01273> drugbank:possibleDiseaseTarget ?disease . ?disease owl:sameAs ?sameDisease . ?altMedicine tcm:treatment ?sameDisease . ?altMedicine rdf:type tcm:Medicine . ?sameDisease rdfs:label ?diseaseLabel . ?altMedicine rdfs:label ?altMedicineLabel . } ------------------------------------------ | diseaseLabel | altMedicineLabel | ========================================== | &quot;Epilepsy&quot; | &quot;Ginkgo biloba&quot; | | &quot;Epilepsy&quot; | &quot;Cynanchum otophyllum&quot; | | &quot;Epilepsy&quot; | &quot;Piper longum&quot; | | &quot;Epilepsy&quot; | &quot;Datura stramonium&quot; | | &quot;Epilepsy&quot; | &quot;Uncaria rhynchophylla&quot; | | &quot;Epilepsy&quot; | &quot;Cannabis sativa&quot; | | &quot;Epilepsy&quot; | &quot;Gastrodia elata&quot; | ------------------------------------------ Query 6 datasets as if they are one SQUIN.org Thanks to Olaf Hartig
  • 16. Are there any Raccoons in India?

Editor's Notes

  • #4: TODO
  • #9: TODO: check the updated figure from Anja
  • #10: TODO: statistics about the number of triples from Anja’s doc