SlideShare a Scribd company logo
Powering Scientific Discovery with the Semantic Web
1
Michel Dumontier, Ph.D.
Associate Professor of Medicine (Biomedical Informatics)
Stanford University
@micheldumontier::VanBUG:13-03-2014
2 @micheldumontier::VanBUG:13-03-2014HTTP://XKCD.COM/242/
Science
@micheldumontier::VanBUG:13-03-20143
We use the biomedical literature to gather evidence
4 @micheldumontier::VanBUG:13-03-2014
we query literature-curated databases
(problems: access, format, identifiers & linking)
5 @micheldumontier::VanBUG:13-03-2014
we use software to analyze, predict and evaluate
(problems: OS, versioning, input/output formats)
6 @micheldumontier::VanBUG:13-03-2014
we develop fairly sophisticated workflows to
gather support for our hypotheses
7 @micheldumontier::VanBUG:13-03-2014
Wouldn’t it be great if we could just find the evidence
required to support or dispute a scientific hypothesis using
the most up-to-date and relevant data, tools and scientific
knowledge?
8 @micheldumontier::VanBUG:13-03-2014
madness!
1. Build a massive network of interconnected data
and software using web standards
– Bio2RDF + SADI
2. Generate and test scientific hypotheses
– prepare data (tactical formalization)
– discover associations (ontology-based enrichment)
– gather evidence (HyQue)
3. Contribute back to the global knowledge graph
@micheldumontier::VanBUG:13-03-20149
The Semantic Web
is the new global web of knowledge
10 @micheldumontier::VanBUG:13-03-2014
standards for publishing, sharing and querying
facts, expert knowledge and services
scalable approach for the discovery
of independently formulated
and distributed knowledge
Resource Description Framework
• It’s a language to represent knowledge
– Logic-based formalism -> automated reasoning
– graph-like properties -> data analysis
• Good for
– Describing in terms of type, attributes, relations
– Integrating data from different sources
– Sharing the data (W3C standard)
– Reusing what is available, developing what you need,
and contributing back to the web of data.
@micheldumontier::VanBUG:13-03-201411
we’re building an incredible network of linked data
12 @micheldumontier::VanBUG:13-03-2014Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch
@micheldumontier::VanBUG:13-03-2014
Linked Data for the Life Sciences
13
Bio2RDF is an open source project to unify the
representation and interlinking of biological data using RDF.
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
• Release 2 (Jan 2013): 1B+ interlinked statements
from 19 high value datasets
• dataset description, provenance & statistics
• Partnerships with EBI, NCBI, DBCLS, NCBO,
OpenPHACTS, and commercial tool providers
@micheldumontier::VanBUG:13-03-2014
drugbank:DB00586
drugbank_vocabulary:Drug
rdf:type
drugbank_target:290
drugbank_vocabulary:Target
rdf:type
drugbank_vocabulary:targets
rdfs:label
Prostaglandin G/H synthase 2
[drugbank_target:290]
rdfs:label
Diclofenac [drugbank:DB00586]
14
The linked data network expands
with every reference
@micheldumontier::VanBUG:13-03-2014
drugbank:DB00586
pharmgkb_vocabulary:Drug
rdf:type
rdfs:label
diclofenac [drugbank:DB00586]
pharmgkb:PA449293
drugbank_vocabulary:Drug
pharmgkb_vocabulary:xref
diclofenac [pharmgkb:PA449293]
rdfs:label
DrugBank
PharmGKB
15
@micheldumontier::VanBUG:13-03-201416
@micheldumontier::VanBUG:13-03-201417
@micheldumontier::VanBUG:13-03-201418
Bio2RDF R2 connectivity
@micheldumontier::VanBUG:13-03-201419
Graph summaries in query formulation
@micheldumontier::VanBUG:13-03-2014
PREFIX drugbank_vocabulary: <http://bio2rdf.org/drugbank_vocabulary:>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
SELECT ?ddi ?d1name ?d2name
WHERE {
?ddi a drugbank_vocabulary:Drug-Drug-Interaction .
?d1 drugbank_vocabulary:ddi-interactor-in ?ddi .
?d1 rdfs:label ?d1name .
?d2 drugbank_vocabulary:ddi-interactor-in ?ddi .
?d2 rdfs:label ?d2name.
FILTER (?d1 != ?d2)
}
20
You can use the SPARQLed query
assistant with updated endpoints
@micheldumontier::VanBUG:13-03-2014
http://sindicetech.com/sindice-suite/sparqled/
graph: http://sindicetech.com/analytics21
Use virtuoso’s built in faceted
browser to construct increasingly
complex queries with little effort
@micheldumontier::VanBUG:13-03-201422
Federated Queries over Independent
SPARQL EndPoints
Get all protein catabolic processes (and more specific) in biomodels
SELECT ?go ?label count(distinct ?x)
WHERE {
service <http://bioportal.bio2rdf.org/sparql> {
?go rdfs:label ?label .
?go rdfs:subClassOf ?tgo
?tgo rdfs:label ?tlabel .
FILTER regex(?tlabel, "^protein catabolic process")
}
service <http://biomodels.bio2rdf.org/sparql> {
?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go .
?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> .
}
}
@micheldumontier::VanBUG:13-03-201423
24 @micheldumontier::VanBUG:13-03-2014
Bio2RDF: 1M+ SPARQL queries per month
UniProt: 6.4M queries per month
EBI: 3.5M queries (Oct 2013)
Despite all the data, it’s still hard to find answers to questions
Because there are many ways to represent the same data
and each dataset represents it differently
@micheldumontier::VanBUG:13-03-201425
multiple formalizations of the same kind of
data do emerge, each with their own merit
@micheldumontier::VanBUG:13-03-201426
Massive Proliferation of Ontologies / Vocabularies
@micheldumontier::VanBUG:13-03-201427
Multi-Stakeholder Efforts to Standardize
Representations are Reasonable,
Long Term Strategies for Data Integration
@micheldumontier::VanBUG:13-03-201428
uniprot:P05067
uniprot:Protein
is a
sio:gene
is a is a
Semantic data integration, consistency checking and
query answering over Bio2RDF with the
Semanticscience Integrated Ontology (SIO)
dataset
ontology
Knowledge Base
@micheldumontier::VanBUG:13-03-2014
pharmgkb:PA30917
refseq:Protein
is a
is a
omim:189931
omim:Gene pharmgkb:Gene
Querying Bio2RDF Linked Open Data with a Global Schema. Alison Callahan, José Cruz-Toledo and
Michel Dumontier. Bio-ontologies 2012.
29
@micheldumontier::VanBUG:13-03-201430
SRIQ(D)
10700+ axioms
1300+ classes
201 object properties (inc. inverses)
1 datatype property
Bio2RDF and SIO powered SPARQL 1.1 federated query:
Find chemicals (from CTD) and proteins (from SGD) that
participate in the same process (from GOA)
SELECT ?chem, ?prot, ?proc
FROM <http://bio2rdf.org/ctd>
WHERE {
SERVICE <http://ctd.bio2rdf.org/sparql> {
?chemical a sio:chemical-entity.
?chemical rdfs:label ?chem.
?chemical sio:is-participant-in ?process.
?process rdfs:label ?proc.
FILTER regex (?process, "http://bio2rdf.org/go:")
}
SERVICE <http://sgd.bio2rdf.org/sparql> {
?protein a sio:protein .
?protein sio:is-participant-in ?process.
?protein rdfs:label ?prot .
}
} @micheldumontier::VanBUG:13-03-201431
tactical formalization
@micheldumontier::VanBUG:13-03-201432
Take what you need
and represent it in a way that directly serves your objective
STANDARDSUSER DRIVEN REPRESENTATION
identifying aberrant and pharmacological pathways
predicting drug targets using organism phenotypes
Biopax-pathway exploration
SPARQL-powered genome navigation
aberrant and pharmacological pathways
Q1. Can we identify pathways that are associated
with a particular disease or class of diseases?
Q2. Can we identify pathways are associated with
a particular drug or class of drugs?
drug
pathway
disease
@micheldumontier::VanBUG:13-03-201433
Identification of
drug and disease enriched pathways
• Approach
– Integrate 3 datasets
• DrugBank, PharmGKB and CTD
– Integrate 7 terminologies
• MeSH, ATC, ChEBI, UMLS, SNOMED, ICD, DO
– Formalize selected elements
– Identify significant associations using enrichment
analysis over the fully inferred knowledge base
Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics.
Bioinformatics. 2012. @micheldumontier::VanBUG:13-03-201434
Formal knowledge
representation
as a strategy for
data integration
@micheldumontier::VanBUG:13-03-201435
Have you heard of OWL?
@micheldumontier::VanBUG:13-03-201436
mercaptopurine
[pharmgkb:PA450379]
mercaptopurine
[drugbank:DB01033]
purine-6-thiol
[CHEBI:2208]
Class Equivalence
mercaptopurine
[ATC:L01BB02]
Top Level Classes
(disjointness)
drug diseasegenepathway
property chains
Class subsumption
mercaptopurine
[mesh:D015122]
Reciprocal
Existentials
drug disease
pathway gene
Formalized as an OWL-EL ontology
650,000+ classes, 3.2M subClassOf axioms, 75,000
equivalentClass axioms
@micheldumontier::VanBUG:13-03-201437
Benefits: Enhanced Query Capability
– Use any mapped terminology to query a target resource.
– Use knowledge in target ontologies to formulate more
precise questions
• ask for drugs that are associated with diseases of the joint:
‘Chikungunya’ (do:0050012) is defined as a viral infectious disease
located in the ‘joint’ (fma:7490) and caused by a ‘Chikungunya
virus’ (taxon:37124).
– Learn relationships that are inferred by automated
reasoning.
• alcohol (ChEBI:30879) is associated with alcoholism (PA443309)
since alcoholism is directly associated with ethanol (CHEBI:16236)
• ‘parasitic infectious disease’ (do:0001398) associated with 129
drugs, 15 more than are directly linked.
@micheldumontier::VanBUG:13-03-201438
Knowledge Discovery through Data
Integration and Enrichment Analysis
• OntoFunc: Tool to discover significant associations between sets of objects
and ontology categories. enrichment of attribute among a selected set of
input items as compared to a reference set. hypergeometric or the
binomial distribution, Fisher's exact test, or a chi-square test.
• We found 22,653 disease-pathway associations, where for each pathway
we find genes that are linked to disease.
– Mood disorder (do:3324) associated with Zidovudine Pathway
(pharmgkb:PA165859361). Zidovudine is for treating HIV/AIDS. Side
effects include fatigue, headache, myalgia, malaise and anorexia
• We found 13,826 pathway-chemical associations
– Clopidogrel (chebi:37941) associated with Endothelin signaling
pathway (pharmgkb:PA164728163). Endothelins are proteins that
constrict blood vessels and raise blood pressure. Clopidogrel inhibits
platelet aggregation and prolongs bleeding time.
@micheldumontier::VanBUG:13-03-201439
Tactical Formalization + Automated Reasoning
Offers Compelling Value for Certain Problems
We need to be smart about the goal, and how best to
achieve it. Tactical formalization is another tool in the
toolbox.
We’ve formalized data as OWL ontologies to verify, fix and
exploit Linked Data through expressive OWL reasoning
• To identify mistakes in human curated knowledge
• To identify conflicting meaning in terms
• To identify mistakes in the representation of RDF data
o incorrect use of relations
o incorrect assertion of identity (owl:sameAs)
Many other applications can be envisioned.
@micheldumontier::VanBUG:13-03-201440
PhenomeDrug
A computational approach to predict drug
targets, drug effects, and drug indications using
phenotypes
@micheldumontier::VanBUG:13-03-201441
Mouse model phenotypes provide information about human drug targets.
Hoehndorf R, Hiebert T, Hardy NW, Schofield PN, Gkoutos GV, Dumontier M.
Bioinformatics. 2013.
animal models provide insight for on target effects
• In the majority of 100 best selling drugs ($148B in US
alone), there is a direct correlation between knockout
phenotype and drug effect
• Gastroesophageal reflux
– Proton pump inhibitors (e.g Prilosec - $5B)
– KO of alpha or beta unit of H/K ATPase are achlorhydric (normal
pH) in stomach compared to wild type (pH 3-5) when exposed
to acidic solution
• Immunological Indications
– Anti-histamines (Claritin, Allegra, Zyrtec)
– KO of histamine H1 receptor leads to decreased responsiveness
of immune system
– Predicts on target effects : drowsiness, reduced anxiety
@micheldumontier::VanBUG:13-03-201442
Zambrowicz and Sands. Nat Rev Drug Disc. 2003.
Identifying drug targets
from mouse knock-out phenotypes
@micheldumontier::VanBUG:13-03-201443
drug
gene
phenotypes effects
human gene
non-functional
gene model
ortholog
similar
inhibits
Main idea: if a drug’s phenotypes matches the phenotypes of a
null model, this suggests that the drug is an inhibitor of the gene
Terminological Interoperability
(we must compare apples with apples)
Mouse
Phenotypes
Drug effects
(mappings from UMLS to DO, NBO, MP)
Mammalian
Phenotype
OntologyPhenomeNet
PhenomeDrug
@micheldumontier::VanBUG:13-03-2014
Semantic Similarity
@micheldumontier::VanBUG:13-03-201445
Given a drug effect profile D and a mouse model M, we
compute the semantic similarity as an information weighted
Jaccard metric.
The similarity measure used is non-symmetrical and
determines the amount of information about a drug effect
profile D that is covered by a set of mouse model
phenotypes M.
Loss of function models predict
targets of inhibitor drugs
• 14,682 drugs; 7,255 mouse genotypes
• Validation against known and predicted inhibitor-target pairs
– 0.76 ROC AUC for human targets (DrugBank)
– 0.81 ROC AUC for mouse targets (STITCH)
• diclofenac (STITCH:000003032)
– NSAID used to treat pain, osteoarthritis and rheumatoid arthritis
– Drug effects include liver inflammation (hepatitis), swelling of liver
(hepatomegaly), redness of skin (erythema)
– 49% explained by PPARg knockout
• peroxisome proliferator activated receptor gamma (PPARg) regulates metabolism,
proliferation, inflammation and differentiation,
• Diclofenac is a known inhibitor
– 46% explained by COX-2 knockout
• Diclofenac is a known inhibitor
@micheldumontier::VanBUG:13-03-2014
Using the Semantic Web to Gather
Evidence for Scientific Hypotheses
What evidence supports or disputes that TKIs are cardiotoxic?
@micheldumontier::VanBUG:13-03-201447
• Tyrosine Kinase Inhibitors (TKI)
– Imatinib, Sorafenib, Sunitinib, Dasatinib, Nilotinib, Lapatinib
– Used to treat cancer
– Linked to cardiotoxicity.
• FDA launched drug safety program to detect toxicity
– Need to integrate data and ontologies (Abernethy, CPT 2011)
– Abernethy (2013) suggest using public data in genetics,
pharmacology, toxicology, systems biology, to
predict/validate adverse events
• What evidence could we gather to give credence that
TKI’s causes non-QT cardiotoxicity?
@micheldumontier::VanBUG:13-03-201448
FDA Use Case:
TKI non-QT Cardiotoxicity
@micheldumontier::VanBUG:13-03-201449
Jane P.F. Bai and Darrell R. Abernethy. Systems Pharmacology to Predict Drug Toxicity: Integration Across Levels of
Biological Organization. Annu. Rev. Pharmacol. Toxicol. 2013.53:451-473
• The goal of HyQue is retrieve and evaluate evidence
that supports/disputes a hypothesis
– hypotheses are described as a set of events
• e.g. binding, inhibition, phenotypic effect
– events are associated with types of evidence
• a query is written to retrieve data
• a weight is assigned to provide significance
• Hypotheses are written by people who seek answers
• data retrieval rules are written by people who know the
data and how it should be interpreted 
@micheldumontier::VanBUG:13-03-201450
HyQue
1. HyQue: Evaluating hypotheses using Semantic Web technologies. J Biomed Semantics. 2011 May 17;2 Suppl 2:S3.
2. Evaluating scientific hypotheses using the SPARQL Inferencing Notation. Extended Semantic Web Conference (ESWC
2012). Heraklion, Crete. May 27-31, 2012.
HyQue: A Semantic Web Application
@micheldumontier::VanBUG:1
51
Software
OntologiesData
Hypothesis Evaluation
What evidence might we gather?
• clinical: Are there cardiotoxic effects associated with the drug?
– Literature (studies) [curated db]
– Product labels (studies) [r3:sider]
– Clinical trials (studies) [r3:clinicaltrials]
– Adverse event reports [r2:pharmgkb/onesides]
– Electronic health records (observations)
• pre-clinical associations:
– genotype-phenotype (null/disease models) [r2:mgi, r2:sgd; r3:wormbase]
– in vitro assays (IC50) [r3:chembl]
– drug targets [r2:drugbank; r2:ctd; r3:stitch]
– drug-gene expression [r3:gxa]
– pathways [r2:kegg; r3:reactome]
– Drug-pathway, disease-pathway enrichments [aberrant pathways]
– Chemical properties [r2:pubchem; r2.drugbank]
– Toxicology [r1.toxkb/cebs]
@micheldumontier::VanBUG:13-03-201452
Data retrieval is done with SPARQL
@micheldumontier::VanBUG:13-03-201453
Data Evaluation is done with SPIN
rules
@micheldumontier::VanBUG:13-03-201454
@micheldumontier::VanBUG:13-03-201455
@micheldumontier::VanBUG:13-03-201456
http://bio2rdf.org/drugbank:DB01268
@micheldumontier::VanBUG:13-03-201457
@micheldumontier::VanBUG:13-03-201458
@micheldumontier::VanBUG:13-03-201459
In Summary
• This talk was about making sense of the
structured data we already have
• RDF-based Linked Open Data acts as a substrate
for query answering and task-based formalization
in OWL
• Discovery through the generation of testable
hypotheses in the target domain.
• Using Linked Data to gather evidence regarding
scientific hypotheses
• The big challenge lies in establishing discovery
pipelines with user-generated scientific data.
@micheldumontier::VanBUG:13-03-201460
Acknowledgements
Bio2RDF Release 2:
Allison Callahan, Jose Cruz-Toledo, Peter Ansell
Aberrant Pathways: Robert Hoehndorf, Georgios Gkoutos
PhenomeDrug: Tanya Hiebert, Robert Hoehndorf, Georgios
Gkoutos, Paul Schofield
TKI Cardiotoxicity: Alison Callahan, Tania Hiebert, Beatriz
Lujan, Sira Sarntivijai (FDA)
@micheldumontier::VanBUG:13-03-201461
dumontierlab.com
michel.dumontier@stanford.edu
Website: http://dumontierlab.com
Presentations: http://slideshare.com/micheldumontier
62 @micheldumontier::VanBUG:13-03-2014

More Related Content

PPTX
Generating Biomedical Hypotheses Using Semantic Web Technologies
PDF
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
PPTX
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
PPTX
2016 bmdid-mappings
PDF
Link Analysis of Life Sciences Linked Data
PPTX
Exploring Chemical and Biological Knowledge Spaces with PubChem
PDF
Nucl. Acids Res.-2014-Howe-nar-gku1244
PDF
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
Generating Biomedical Hypotheses Using Semantic Web Technologies
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
2016 bmdid-mappings
Link Analysis of Life Sciences Linked Data
Exploring Chemical and Biological Knowledge Spaces with PubChem
Nucl. Acids Res.-2014-Howe-nar-gku1244
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...

What's hot (20)

PDF
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
PPTX
Model Organism Linked Data
PPTX
DOCX
Natalya Csatari_Library Database and Website Search
PDF
Open Access and Property Rights on a Collision Course with Scholars
PPTX
PubChem and Its Applications for Drug Discovery
PPTX
PubChem: a public chemical information resource for big data chemistry
PPTX
PubChem as a resource for chemical information training
PPTX
Toxicological information in PubChem
PDF
PubChem for chemical information literacy training
PPTX
Mashing Up Drug Discovery
PPTX
Searching for chemical information using PubChem
PPTX
Guide to Malaria Pharmacology, GEMM 2019
PPTX
GuideToImmunopharmacology_SIF_Nov2019
PPTX
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
PDF
PubChem for drug discovery in the age of big data and artificial intelligence
PPTX
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
PDF
dkNET Poster Experimental Biology 2019
PDF
DisGeNET: a discovery platform to support translational research and drug dis...
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Model Organism Linked Data
Natalya Csatari_Library Database and Website Search
Open Access and Property Rights on a Collision Course with Scholars
PubChem and Its Applications for Drug Discovery
PubChem: a public chemical information resource for big data chemistry
PubChem as a resource for chemical information training
Toxicological information in PubChem
PubChem for chemical information literacy training
Mashing Up Drug Discovery
Searching for chemical information using PubChem
Guide to Malaria Pharmacology, GEMM 2019
GuideToImmunopharmacology_SIF_Nov2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
PubChem for drug discovery in the age of big data and artificial intelligence
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
dkNET Poster Experimental Biology 2019
DisGeNET: a discovery platform to support translational research and drug dis...
Ad

Viewers also liked (20)

PPT
Lourenza
PPT
Blerta Interpreter 1
PPT
Sentimenduak
PPTX
Squizz presentation
PPT
Classical Series
KEY
IT for Nursing - 5
PPSX
Innovation Benefits Realization for Industrial Research (Part-3)
PPT
Web 2.0, Dutch Railways & Teststation Leiden
PPT
Survival of the Quickest
PPT
Web Configurator
PDF
Ishii presentation
PPTX
CloudCoffee
PPT
Technical Communication Lab Projects
PPSX
Innovation Benefits Realization for Industrial Research (Part-2)
PPT
PDF
Force Majeure: a Time Bomb
PPT
Guerrilla Readers - Portfolio
PPT
Titian Presentation
PPT
Android Bootcamp Santa Fe GTUG
PPT
Dfi Portfolio
Lourenza
Blerta Interpreter 1
Sentimenduak
Squizz presentation
Classical Series
IT for Nursing - 5
Innovation Benefits Realization for Industrial Research (Part-3)
Web 2.0, Dutch Railways & Teststation Leiden
Survival of the Quickest
Web Configurator
Ishii presentation
CloudCoffee
Technical Communication Lab Projects
Innovation Benefits Realization for Industrial Research (Part-2)
Force Majeure: a Time Bomb
Guerrilla Readers - Portfolio
Titian Presentation
Android Bootcamp Santa Fe GTUG
Dfi Portfolio
Ad

Similar to Powering Scientific Discovery with the Semantic Web (VanBUG 2014) (20)

PPTX
Building a Network of Interoperable and Independently Produced Linked and Ope...
PDF
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
PPT
TB Mobile: Appifying data on antituberculosis molecule targets
PPTX
Pistoia Alliance USA Conference 2016
PDF
Methodologies Of Multiomics Data Integration And Data Mining 1st Edition Kang...
PPTX
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
PDF
Clustering In Bioinformatics And Drug Discovery John D Maccuish
PPTX
Bioinformatics
PPTX
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
PPTX
Cancer Research Meets Data Science — What Can We Do Together?
PPTX
Introduction to Gene Mining Part A: BLASTn-off!
PDF
Introduction to Bioinformatics.
PDF
Methodologies of Multi-Omics Data Integration and Data Mining 1st Edition Kan...
PDF
Open science and medical evidence generation - Kees van Bochove - The Hyve
PDF
Integrating Omics Data 1st Edition George C Tseng Debashis Ghosh
PDF
Integrating Omics Data 1st Edition George C Tseng Debashis Ghosh
PPT
2011-10-11 Open PHACTS at BioIT World Europe
PDF
Informatics In Proteomics 1st Edition Sudhir Srivastava
PPT
Data Mining and Big Data Analytics in Pharma
Building a Network of Interoperable and Independently Produced Linked and Ope...
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
TB Mobile: Appifying data on antituberculosis molecule targets
Pistoia Alliance USA Conference 2016
Methodologies Of Multiomics Data Integration And Data Mining 1st Edition Kang...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
Clustering In Bioinformatics And Drug Discovery John D Maccuish
Bioinformatics
GenomeTrakr: Whole-Genome Sequencing for Food Safety and A New Way Forward in...
Cancer Research Meets Data Science — What Can We Do Together?
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Bioinformatics.
Methodologies of Multi-Omics Data Integration and Data Mining 1st Edition Kan...
Open science and medical evidence generation - Kees van Bochove - The Hyve
Integrating Omics Data 1st Edition George C Tseng Debashis Ghosh
Integrating Omics Data 1st Edition George C Tseng Debashis Ghosh
2011-10-11 Open PHACTS at BioIT World Europe
Informatics In Proteomics 1st Edition Sudhir Srivastava
Data Mining and Big Data Analytics in Pharma

More from Michel Dumontier (20)

PPTX
Generating (useful) synthetic data for medical research and AI application
PDF
FAIR & AI Ready KGs for Explainable Predictions.pdf
PPTX
FAIR & AI Ready KGs for Explainable Predictions
PPTX
A metadata standard for Knowledge Graphs
PPTX
Data-Driven Discovery Science with FAIR Knowledge Graphs
PDF
Evaluating FAIRness
PPTX
The Role of the FAIR Guiding Principles for an effective Learning Health System
PPTX
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
PPTX
The role of the FAIR Guiding Principles in a Learning Health System
PPTX
Acclerating biomedical discovery with an internet of FAIR data and services -...
PPTX
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
PPTX
Are we FAIR yet? And will it be worth it?
PPTX
The Future of FAIR Data: An international social, legal and technological inf...
PDF
Keynote at the 2018 Maastricht University Dinner
PPTX
The future of science and business - a UM Star Lecture
PPTX
Are we FAIR yet?
PPTX
Developing and assessing FAIR digital resources
PPTX
Advancing Biomedical Knowledge Reuse with FAIR
PPTX
A Framework to develop the FAIR Metrics
PPTX
FAIR principles and metrics for evaluation
Generating (useful) synthetic data for medical research and AI application
FAIR & AI Ready KGs for Explainable Predictions.pdf
FAIR & AI Ready KGs for Explainable Predictions
A metadata standard for Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
Evaluating FAIRness
The Role of the FAIR Guiding Principles for an effective Learning Health System
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
The role of the FAIR Guiding Principles in a Learning Health System
Acclerating biomedical discovery with an internet of FAIR data and services -...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Are we FAIR yet? And will it be worth it?
The Future of FAIR Data: An international social, legal and technological inf...
Keynote at the 2018 Maastricht University Dinner
The future of science and business - a UM Star Lecture
Are we FAIR yet?
Developing and assessing FAIR digital resources
Advancing Biomedical Knowledge Reuse with FAIR
A Framework to develop the FAIR Metrics
FAIR principles and metrics for evaluation

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Big Data Technologies - Introduction.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Electronic commerce courselecture one. Pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
Dropbox Q2 2025 Financial Results & Investor Presentation
Digital-Transformation-Roadmap-for-Companies.pptx
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Big Data Technologies - Introduction.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Electronic commerce courselecture one. Pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
The AUB Centre for AI in Media Proposal.docx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...

Powering Scientific Discovery with the Semantic Web (VanBUG 2014)

  • 1. Powering Scientific Discovery with the Semantic Web 1 Michel Dumontier, Ph.D. Associate Professor of Medicine (Biomedical Informatics) Stanford University @micheldumontier::VanBUG:13-03-2014
  • 4. We use the biomedical literature to gather evidence 4 @micheldumontier::VanBUG:13-03-2014
  • 5. we query literature-curated databases (problems: access, format, identifiers & linking) 5 @micheldumontier::VanBUG:13-03-2014
  • 6. we use software to analyze, predict and evaluate (problems: OS, versioning, input/output formats) 6 @micheldumontier::VanBUG:13-03-2014
  • 7. we develop fairly sophisticated workflows to gather support for our hypotheses 7 @micheldumontier::VanBUG:13-03-2014
  • 8. Wouldn’t it be great if we could just find the evidence required to support or dispute a scientific hypothesis using the most up-to-date and relevant data, tools and scientific knowledge? 8 @micheldumontier::VanBUG:13-03-2014
  • 9. madness! 1. Build a massive network of interconnected data and software using web standards – Bio2RDF + SADI 2. Generate and test scientific hypotheses – prepare data (tactical formalization) – discover associations (ontology-based enrichment) – gather evidence (HyQue) 3. Contribute back to the global knowledge graph @micheldumontier::VanBUG:13-03-20149
  • 10. The Semantic Web is the new global web of knowledge 10 @micheldumontier::VanBUG:13-03-2014 standards for publishing, sharing and querying facts, expert knowledge and services scalable approach for the discovery of independently formulated and distributed knowledge
  • 11. Resource Description Framework • It’s a language to represent knowledge – Logic-based formalism -> automated reasoning – graph-like properties -> data analysis • Good for – Describing in terms of type, attributes, relations – Integrating data from different sources – Sharing the data (W3C standard) – Reusing what is available, developing what you need, and contributing back to the web of data. @micheldumontier::VanBUG:13-03-201411
  • 12. we’re building an incredible network of linked data 12 @micheldumontier::VanBUG:13-03-2014Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch
  • 13. @micheldumontier::VanBUG:13-03-2014 Linked Data for the Life Sciences 13 Bio2RDF is an open source project to unify the representation and interlinking of biological data using RDF. chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications • Release 2 (Jan 2013): 1B+ interlinked statements from 19 high value datasets • dataset description, provenance & statistics • Partnerships with EBI, NCBI, DBCLS, NCBO, OpenPHACTS, and commercial tool providers
  • 15. The linked data network expands with every reference @micheldumontier::VanBUG:13-03-2014 drugbank:DB00586 pharmgkb_vocabulary:Drug rdf:type rdfs:label diclofenac [drugbank:DB00586] pharmgkb:PA449293 drugbank_vocabulary:Drug pharmgkb_vocabulary:xref diclofenac [pharmgkb:PA449293] rdfs:label DrugBank PharmGKB 15
  • 20. Graph summaries in query formulation @micheldumontier::VanBUG:13-03-2014 PREFIX drugbank_vocabulary: <http://bio2rdf.org/drugbank_vocabulary:> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> SELECT ?ddi ?d1name ?d2name WHERE { ?ddi a drugbank_vocabulary:Drug-Drug-Interaction . ?d1 drugbank_vocabulary:ddi-interactor-in ?ddi . ?d1 rdfs:label ?d1name . ?d2 drugbank_vocabulary:ddi-interactor-in ?ddi . ?d2 rdfs:label ?d2name. FILTER (?d1 != ?d2) } 20
  • 21. You can use the SPARQLed query assistant with updated endpoints @micheldumontier::VanBUG:13-03-2014 http://sindicetech.com/sindice-suite/sparqled/ graph: http://sindicetech.com/analytics21
  • 22. Use virtuoso’s built in faceted browser to construct increasingly complex queries with little effort @micheldumontier::VanBUG:13-03-201422
  • 23. Federated Queries over Independent SPARQL EndPoints Get all protein catabolic processes (and more specific) in biomodels SELECT ?go ?label count(distinct ?x) WHERE { service <http://bioportal.bio2rdf.org/sparql> { ?go rdfs:label ?label . ?go rdfs:subClassOf ?tgo ?tgo rdfs:label ?tlabel . FILTER regex(?tlabel, "^protein catabolic process") } service <http://biomodels.bio2rdf.org/sparql> { ?x <http://bio2rdf.org/biopax_vocabulary:identical-to> ?go . ?x a <http://www.biopax.org/release/biopax-level3.owl#BiochemicalReaction> . } } @micheldumontier::VanBUG:13-03-201423
  • 24. 24 @micheldumontier::VanBUG:13-03-2014 Bio2RDF: 1M+ SPARQL queries per month UniProt: 6.4M queries per month EBI: 3.5M queries (Oct 2013)
  • 25. Despite all the data, it’s still hard to find answers to questions Because there are many ways to represent the same data and each dataset represents it differently @micheldumontier::VanBUG:13-03-201425
  • 26. multiple formalizations of the same kind of data do emerge, each with their own merit @micheldumontier::VanBUG:13-03-201426
  • 27. Massive Proliferation of Ontologies / Vocabularies @micheldumontier::VanBUG:13-03-201427
  • 28. Multi-Stakeholder Efforts to Standardize Representations are Reasonable, Long Term Strategies for Data Integration @micheldumontier::VanBUG:13-03-201428
  • 29. uniprot:P05067 uniprot:Protein is a sio:gene is a is a Semantic data integration, consistency checking and query answering over Bio2RDF with the Semanticscience Integrated Ontology (SIO) dataset ontology Knowledge Base @micheldumontier::VanBUG:13-03-2014 pharmgkb:PA30917 refseq:Protein is a is a omim:189931 omim:Gene pharmgkb:Gene Querying Bio2RDF Linked Open Data with a Global Schema. Alison Callahan, José Cruz-Toledo and Michel Dumontier. Bio-ontologies 2012. 29
  • 30. @micheldumontier::VanBUG:13-03-201430 SRIQ(D) 10700+ axioms 1300+ classes 201 object properties (inc. inverses) 1 datatype property
  • 31. Bio2RDF and SIO powered SPARQL 1.1 federated query: Find chemicals (from CTD) and proteins (from SGD) that participate in the same process (from GOA) SELECT ?chem, ?prot, ?proc FROM <http://bio2rdf.org/ctd> WHERE { SERVICE <http://ctd.bio2rdf.org/sparql> { ?chemical a sio:chemical-entity. ?chemical rdfs:label ?chem. ?chemical sio:is-participant-in ?process. ?process rdfs:label ?proc. FILTER regex (?process, "http://bio2rdf.org/go:") } SERVICE <http://sgd.bio2rdf.org/sparql> { ?protein a sio:protein . ?protein sio:is-participant-in ?process. ?protein rdfs:label ?prot . } } @micheldumontier::VanBUG:13-03-201431
  • 32. tactical formalization @micheldumontier::VanBUG:13-03-201432 Take what you need and represent it in a way that directly serves your objective STANDARDSUSER DRIVEN REPRESENTATION identifying aberrant and pharmacological pathways predicting drug targets using organism phenotypes Biopax-pathway exploration SPARQL-powered genome navigation
  • 33. aberrant and pharmacological pathways Q1. Can we identify pathways that are associated with a particular disease or class of diseases? Q2. Can we identify pathways are associated with a particular drug or class of drugs? drug pathway disease @micheldumontier::VanBUG:13-03-201433
  • 34. Identification of drug and disease enriched pathways • Approach – Integrate 3 datasets • DrugBank, PharmGKB and CTD – Integrate 7 terminologies • MeSH, ATC, ChEBI, UMLS, SNOMED, ICD, DO – Formalize selected elements – Identify significant associations using enrichment analysis over the fully inferred knowledge base Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics. Bioinformatics. 2012. @micheldumontier::VanBUG:13-03-201434
  • 35. Formal knowledge representation as a strategy for data integration @micheldumontier::VanBUG:13-03-201435
  • 36. Have you heard of OWL? @micheldumontier::VanBUG:13-03-201436
  • 37. mercaptopurine [pharmgkb:PA450379] mercaptopurine [drugbank:DB01033] purine-6-thiol [CHEBI:2208] Class Equivalence mercaptopurine [ATC:L01BB02] Top Level Classes (disjointness) drug diseasegenepathway property chains Class subsumption mercaptopurine [mesh:D015122] Reciprocal Existentials drug disease pathway gene Formalized as an OWL-EL ontology 650,000+ classes, 3.2M subClassOf axioms, 75,000 equivalentClass axioms @micheldumontier::VanBUG:13-03-201437
  • 38. Benefits: Enhanced Query Capability – Use any mapped terminology to query a target resource. – Use knowledge in target ontologies to formulate more precise questions • ask for drugs that are associated with diseases of the joint: ‘Chikungunya’ (do:0050012) is defined as a viral infectious disease located in the ‘joint’ (fma:7490) and caused by a ‘Chikungunya virus’ (taxon:37124). – Learn relationships that are inferred by automated reasoning. • alcohol (ChEBI:30879) is associated with alcoholism (PA443309) since alcoholism is directly associated with ethanol (CHEBI:16236) • ‘parasitic infectious disease’ (do:0001398) associated with 129 drugs, 15 more than are directly linked. @micheldumontier::VanBUG:13-03-201438
  • 39. Knowledge Discovery through Data Integration and Enrichment Analysis • OntoFunc: Tool to discover significant associations between sets of objects and ontology categories. enrichment of attribute among a selected set of input items as compared to a reference set. hypergeometric or the binomial distribution, Fisher's exact test, or a chi-square test. • We found 22,653 disease-pathway associations, where for each pathway we find genes that are linked to disease. – Mood disorder (do:3324) associated with Zidovudine Pathway (pharmgkb:PA165859361). Zidovudine is for treating HIV/AIDS. Side effects include fatigue, headache, myalgia, malaise and anorexia • We found 13,826 pathway-chemical associations – Clopidogrel (chebi:37941) associated with Endothelin signaling pathway (pharmgkb:PA164728163). Endothelins are proteins that constrict blood vessels and raise blood pressure. Clopidogrel inhibits platelet aggregation and prolongs bleeding time. @micheldumontier::VanBUG:13-03-201439
  • 40. Tactical Formalization + Automated Reasoning Offers Compelling Value for Certain Problems We need to be smart about the goal, and how best to achieve it. Tactical formalization is another tool in the toolbox. We’ve formalized data as OWL ontologies to verify, fix and exploit Linked Data through expressive OWL reasoning • To identify mistakes in human curated knowledge • To identify conflicting meaning in terms • To identify mistakes in the representation of RDF data o incorrect use of relations o incorrect assertion of identity (owl:sameAs) Many other applications can be envisioned. @micheldumontier::VanBUG:13-03-201440
  • 41. PhenomeDrug A computational approach to predict drug targets, drug effects, and drug indications using phenotypes @micheldumontier::VanBUG:13-03-201441 Mouse model phenotypes provide information about human drug targets. Hoehndorf R, Hiebert T, Hardy NW, Schofield PN, Gkoutos GV, Dumontier M. Bioinformatics. 2013.
  • 42. animal models provide insight for on target effects • In the majority of 100 best selling drugs ($148B in US alone), there is a direct correlation between knockout phenotype and drug effect • Gastroesophageal reflux – Proton pump inhibitors (e.g Prilosec - $5B) – KO of alpha or beta unit of H/K ATPase are achlorhydric (normal pH) in stomach compared to wild type (pH 3-5) when exposed to acidic solution • Immunological Indications – Anti-histamines (Claritin, Allegra, Zyrtec) – KO of histamine H1 receptor leads to decreased responsiveness of immune system – Predicts on target effects : drowsiness, reduced anxiety @micheldumontier::VanBUG:13-03-201442 Zambrowicz and Sands. Nat Rev Drug Disc. 2003.
  • 43. Identifying drug targets from mouse knock-out phenotypes @micheldumontier::VanBUG:13-03-201443 drug gene phenotypes effects human gene non-functional gene model ortholog similar inhibits Main idea: if a drug’s phenotypes matches the phenotypes of a null model, this suggests that the drug is an inhibitor of the gene
  • 44. Terminological Interoperability (we must compare apples with apples) Mouse Phenotypes Drug effects (mappings from UMLS to DO, NBO, MP) Mammalian Phenotype OntologyPhenomeNet PhenomeDrug @micheldumontier::VanBUG:13-03-2014
  • 45. Semantic Similarity @micheldumontier::VanBUG:13-03-201445 Given a drug effect profile D and a mouse model M, we compute the semantic similarity as an information weighted Jaccard metric. The similarity measure used is non-symmetrical and determines the amount of information about a drug effect profile D that is covered by a set of mouse model phenotypes M.
  • 46. Loss of function models predict targets of inhibitor drugs • 14,682 drugs; 7,255 mouse genotypes • Validation against known and predicted inhibitor-target pairs – 0.76 ROC AUC for human targets (DrugBank) – 0.81 ROC AUC for mouse targets (STITCH) • diclofenac (STITCH:000003032) – NSAID used to treat pain, osteoarthritis and rheumatoid arthritis – Drug effects include liver inflammation (hepatitis), swelling of liver (hepatomegaly), redness of skin (erythema) – 49% explained by PPARg knockout • peroxisome proliferator activated receptor gamma (PPARg) regulates metabolism, proliferation, inflammation and differentiation, • Diclofenac is a known inhibitor – 46% explained by COX-2 knockout • Diclofenac is a known inhibitor @micheldumontier::VanBUG:13-03-2014
  • 47. Using the Semantic Web to Gather Evidence for Scientific Hypotheses What evidence supports or disputes that TKIs are cardiotoxic? @micheldumontier::VanBUG:13-03-201447
  • 48. • Tyrosine Kinase Inhibitors (TKI) – Imatinib, Sorafenib, Sunitinib, Dasatinib, Nilotinib, Lapatinib – Used to treat cancer – Linked to cardiotoxicity. • FDA launched drug safety program to detect toxicity – Need to integrate data and ontologies (Abernethy, CPT 2011) – Abernethy (2013) suggest using public data in genetics, pharmacology, toxicology, systems biology, to predict/validate adverse events • What evidence could we gather to give credence that TKI’s causes non-QT cardiotoxicity? @micheldumontier::VanBUG:13-03-201448 FDA Use Case: TKI non-QT Cardiotoxicity
  • 49. @micheldumontier::VanBUG:13-03-201449 Jane P.F. Bai and Darrell R. Abernethy. Systems Pharmacology to Predict Drug Toxicity: Integration Across Levels of Biological Organization. Annu. Rev. Pharmacol. Toxicol. 2013.53:451-473
  • 50. • The goal of HyQue is retrieve and evaluate evidence that supports/disputes a hypothesis – hypotheses are described as a set of events • e.g. binding, inhibition, phenotypic effect – events are associated with types of evidence • a query is written to retrieve data • a weight is assigned to provide significance • Hypotheses are written by people who seek answers • data retrieval rules are written by people who know the data and how it should be interpreted  @micheldumontier::VanBUG:13-03-201450 HyQue 1. HyQue: Evaluating hypotheses using Semantic Web technologies. J Biomed Semantics. 2011 May 17;2 Suppl 2:S3. 2. Evaluating scientific hypotheses using the SPARQL Inferencing Notation. Extended Semantic Web Conference (ESWC 2012). Heraklion, Crete. May 27-31, 2012.
  • 51. HyQue: A Semantic Web Application @micheldumontier::VanBUG:1 51 Software OntologiesData Hypothesis Evaluation
  • 52. What evidence might we gather? • clinical: Are there cardiotoxic effects associated with the drug? – Literature (studies) [curated db] – Product labels (studies) [r3:sider] – Clinical trials (studies) [r3:clinicaltrials] – Adverse event reports [r2:pharmgkb/onesides] – Electronic health records (observations) • pre-clinical associations: – genotype-phenotype (null/disease models) [r2:mgi, r2:sgd; r3:wormbase] – in vitro assays (IC50) [r3:chembl] – drug targets [r2:drugbank; r2:ctd; r3:stitch] – drug-gene expression [r3:gxa] – pathways [r2:kegg; r3:reactome] – Drug-pathway, disease-pathway enrichments [aberrant pathways] – Chemical properties [r2:pubchem; r2.drugbank] – Toxicology [r1.toxkb/cebs] @micheldumontier::VanBUG:13-03-201452
  • 53. Data retrieval is done with SPARQL @micheldumontier::VanBUG:13-03-201453
  • 54. Data Evaluation is done with SPIN rules @micheldumontier::VanBUG:13-03-201454
  • 60. In Summary • This talk was about making sense of the structured data we already have • RDF-based Linked Open Data acts as a substrate for query answering and task-based formalization in OWL • Discovery through the generation of testable hypotheses in the target domain. • Using Linked Data to gather evidence regarding scientific hypotheses • The big challenge lies in establishing discovery pipelines with user-generated scientific data. @micheldumontier::VanBUG:13-03-201460
  • 61. Acknowledgements Bio2RDF Release 2: Allison Callahan, Jose Cruz-Toledo, Peter Ansell Aberrant Pathways: Robert Hoehndorf, Georgios Gkoutos PhenomeDrug: Tanya Hiebert, Robert Hoehndorf, Georgios Gkoutos, Paul Schofield TKI Cardiotoxicity: Alison Callahan, Tania Hiebert, Beatriz Lujan, Sira Sarntivijai (FDA) @micheldumontier::VanBUG:13-03-201461