SlideShare a Scribd company logo
Investigating Term Reuse and Overlap
in Biomedical Ontologies
International Conference on Biomedical Ontology
Lisbon, 27th -30th July 2015
MAU LI K R. K AM D AR , TANI A TUDORACHE A N D MARK A . MUS E N
Are we there yet?
C0011849Diabetes
Mellitus
Diabetes
Mellitus
Unified Medical Language System (UMLS)
SNOMEDCT ICD9CM
C0011849Diabetes
Mellitus
Diabetes
Mellitus
Unified Medical Language System (UMLS)
Open Biomedical Ontologies (OBO) Foundry
SNOMEDCT ICD9CM
Binding to RNA
(GRO#BindingToRNA)
GO:0003723
IRI xref
RNA Binding
(GO:0003723)
Gene Expression
Ontology (GEXO)
Gene Regulation
Ontology (GEXO)
Gene Ontology (GO)
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Same IRI
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Same IRI
Intent for
Reuse
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
Xref
mapping
Same IRI
Intent for
Reuse
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
September 2009
Ghazvinian, Amir, et al. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
OBO Reuse vs Overlap in 2010
September 2010
Key Findings
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
BioPortal Import Plugin
DOG4DAG
Ontofox Web tool
Neurological Disease Ontology
Neurological Disease Ontology
OBI
Reuse of an Ontology
Neurological Disease Ontology
Reuse of Terms
OGMS
Neurological Disease Ontology
NDO
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
BioPortal
N-triples dump
Biomedical
Ontologies
Terms, Labels,
xrefs, CUIs
Xref ReuseIRI Reuse CUI Reuse
Clustering Determine
Source Ontology
Term Overlap
Analysis
509 ontologies
377 ontologies
Remove ontology views
5,718,276 class terms
Label
normalization
Source-Target
Ontology pairs
>35% reuse
for ontology
reuse
14.4% Naïve Term Overlap!
• Normalized String Matching on
Term Labels
14.4%
(823621)
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
156/377 ontologies reuse no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
IRI Reuse
315/377 ontologies xref link to no terms from other ontologies!
<5% of Terms reused from other Ontologies!
>
Xref Reuse
263/377 ontologies have no terms reused by other ontologies!
Reuse from a small set of ontologies only!
>
IRI Reuse
286/377 ontologies have no terms xref linked by other ontologies!
Reuse from a small set of ontologies only!
>
Xref Reuse
0-5% of total terms reused explicitly or using
xref, with >150 ontologies showing 0% reuse.
Average Term Reuse ~ 3%
Reuse from a small set of ontologies only with
terms from >250 ontologies never reused
>100% term reuse from some ontologies! Why?
0
10
20
30
40
50
60
70
80
90
100 BFO
GO
IAO
OBI
PATO
CHEBI
CL
NCBITAXON
UO
SO
UBERON
CARO
NCIT
FMA
MP
SNOMEDCT
NumberofOntologiesReusingTerms(#)
Ontologies
>100% terms reused from some ontologies!
xref Reuse (No.
of Ontologies
IRI Reuse (No. of
Ontologies)
0
10
20
30
40
50
60
70
80
90
100 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
NumberofOntologiesReusingTerms(#)
Ontologies
>100% terms reused from some ontologies!
% of Terms
reused IRIs
% of Terms
reused xref
BFO:10
1/39
… Reuse from a small set of popular or upper-
level ontologies only with terms from >250
ontologies never reused
>100% terms reused w.r.t current version of the
BFO, PATO, CARO, UO, SO ontologies!
Needs rigorous analysis through term overlap …
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
0 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
1-5 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
6-10 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
11-15 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs shared
16-20 Terminologies
CUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs sharedCUI Reuse
NumberofTerms(LogScale)
1
10
100
1000
10000
100000
1000000
ICD10PCS
HCPCS
NCBITAXON
LOINC
MESH
HL7
ICD10CM
OMIM
RXNORM
CPT
PDQ
MEDDRA
ICD9CM
NDDF
ICPC
ICPC2P
MDDB
NDFRT
SNOMEDCT
VANDF
CRISP
RCD
MEDLINEPLUS
SNMI
COSTART
WHO-ART
Procedural Terminologies do not share CUIs!
CUIs sharedCUI Reuse
NumberofTerms(LogScale)
Minimum sharing of CUIs, especially across
UMLS Procedural Terminologies
- ICD10PCS, HCPCS and CPT
Several unique terms introduced as we migrate
from ICD9CM -> ICD10CM, leading to decrease
in Term reuse.
Should there actually be Term Reuse?
Overlap decreases using correct representations!
14.4%
(823621)
• Normalized String Matching on Term Labels
13.2%
(752,176)
• Removing Explicitly Reused Terms
10.8%
(617509)
• Removing Terms Mapped to the same UMLS CUI
1.6%
(93,650)
• Removing almost-similar terms (same identifier
and source ontology but different representation)
Average 3% Term reuse across ontologies using
any method, yet a 14.4% naïve Term overlap!
Term overlap decreases substantially on
removing almost similar terms …
Examples for almost similar terms?
Version 1.0/Version1.1
Subcellular Anatomy Ontology (SAO)
Suggested Ontology for Pharmacogenomics (SOPHARM)
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
NCIT:C53037/NCIT:Cerebral_Vein
Cigarette Smoke Exposure (CSEO)
Sage Bionetworks Synapse (SYN)
Ontology Engineers show an intent for reuse!
OBO:FMA_31396
OBO:owlapi/fma#FMA_31396
OBO:owl/FMA#FMA_31396
OBO:fma#Cartilage_of_inferior_surface …
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
http://purl.bioontology.org/ontology/MESH
http://phenomebrowser.net/ontologies/mesh/mesh.owl
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
Ontology Engineers show an intent for reuse!
Intent
Different
Versions
BFO
NCIT
Different
Notations
FMA
Different
Namespaces
MESH
SNOMEDCT
http://ihtsdo.org/snomedct/
http://purl.bioontology.org/ontology/SNOMEDCT
Ontology Engineers show an intent for reuse!
Different versions, notations, namespaces
• >100% Reuse of few source ontologies
• Increase in Term Overlap
Incorrect representations without mappings do
not provide advantages of Term Reuse!
Key Findings
 ~3% Term Reuse
 Only popular or upper-
level ontologies reused
 14.4% Term Overlap
 Semantically-similar
terms reused together
 Similarity metric for a
Recommender system
Onto 1 Onto 2 Onto 3 Onto 4 Onto 5 Onto 6 Onto 7
Term 1 1 1 1 0 0 0 0
Term 2 0 0 0 1 1 0 0
Term 3 0 0 0 0 0 1 1
Term 4 1 1 0 0 1 0 0
Term 5 1 1 1 0 0 0 1
Term 6 0 0 0 1 1 1 0
Term 7 0 0 1 0 1 0 0
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
Term-
Ontology
Matrix
K-modes
Clustering
Term-Term
Affinity
Matrix
Spectral
Clustering
Understanding how Term Reuse Occurs
• Weighted Similarity Score between Term pairs
– Shared Ontologies
– Jaccard Semantic Similarity Score
– CUI Hierarchy from UMLS Metathesaurus
Semantically-similar terms are reused together!
Semantic Similarity < 0.9
Cluster Size
Semantic Similarity > 0.9
Semantically-similar terms are reused together!
Semantic Similarity > 0.9
Semantically-similar terms are reused together!
Semantic Similarity > 0.9
Semantic-similar terms (Parent-child or siblings)
are reused together …
Similarity Metric and BioPortal can be used to
provide recommendations to ontology
developers through a Web Protégé plugin!
Challenges to Term Reuse
• Substantial term overlap but less than 5% reuse.
• Lexically-similar terms may represent different concepts (e.g.,
anatomical concepts between ZFA and XAO).
• Lexically-different terms may represent same concepts (e.g.
myocardium and cardiac muscle)
• Same terms use different IRI representations, and without explicit
CUI or xref mappings.
• Lack of guidelines and semi-automated tools.
Future Work: WebProtégé Plugin
Term reuse recommendations using
Item-based Collaborative Filtering method.
Two-fold (A Posteriori and User-Centered) Evaluation
GO:0033036
GO:0008104
GO:1902432 GO:1903260
GO:0061472
GO:0090174
GO:0071850
GO:0044770
GO:0044839
GO:0045786
GO:0007050
GO:0044843 GO:1902969 GO:0036226
- Still far from achieving ideal term reuse, beyond upper
level and popular ontologies
- Newer ontologies added in BioPortal
- Without strict guidelines and semi-automated tools,
we will deviate more away …
The Road Ahead …
Acknowledgments
Musen Lab, Stanford
BMI PhD Program, Stanford
US NIH Grants
GM086587
GM103316
maulikrk@stanford.edu
http://stanford.edu/~maulikrk/data/OntologyReuse

More Related Content

PDF
Unifying ontology services for functional genomic annotations
PDF
Ontology-based data access and semantic mining with Aber-OWL
PPT
Evaluating web authority
PDF
Page0052
PPTX
Insurance Issues for IAQA Members – What You Really Need To Know To Protect Y...
PPTX
All about me
PDF
Sémninaire psyché et cerveau 12 décembre (1)
KEY
iPads in School Libraries TCEA Presentation
Unifying ontology services for functional genomic annotations
Ontology-based data access and semantic mining with Aber-OWL
Evaluating web authority
Page0052
Insurance Issues for IAQA Members – What You Really Need To Know To Protect Y...
All about me
Sémninaire psyché et cerveau 12 décembre (1)
iPads in School Libraries TCEA Presentation

Viewers also liked (10)

PPTX
PÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDID
PDF
Master in Finance
PDF
OLSR setup
PPTX
Hands only CPR - maha hammmady
PPTX
バス列の現状(2016ORFバージョン)
PPTX
Open Science and ORCID (in Japanese)
PDF
Excelのどうでもよいtipsの紹介
PPTX
Towards Knowledge-Enabled Society
PDF
【UDC2015】アイデア 061 tenbin
PDF
"What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2...
PÕLVNEMISEST TULENEV ÜLALPIDAMISKOHUSTUS JA ELATIS LAPSELE EHK ALIMENDID
Master in Finance
OLSR setup
Hands only CPR - maha hammmady
バス列の現状(2016ORFバージョン)
Open Science and ORCID (in Japanese)
Excelのどうでもよいtipsの紹介
Towards Knowledge-Enabled Society
【UDC2015】アイデア 061 tenbin
"What Is RUS?" - Requisite Unifying Structure (RUS) - Requisite Technology (2...
Ad

Similar to Investigating Term Reuse and Overlap in Biomedical Ontologies (20)

PDF
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
PDF
Sense and Similarity: making sense of similarity for ontologies
PPT
Ontology Mapping - Out Of The Babel Tower
PDF
My ontology is better than yours! Building and evaluating ontologies for inte...
PDF
Reuse of Ontology Mappings
PDF
Semantic decomposition of ontologies for creation of flexible biomedical conc...
PDF
Overview of CPR Ontology
PPTX
NCBO haendel talk 2013
PPTX
Kboom phenoday-2016
PPTX
Semantics as a service at EMBL-EBI
PPT
PPTX
FAIR data requires FAIR ontologies, how do we do?
PDF
Powering Biomedical Artificial Intelligence with a Holistic Knowledge Graph (...
PPTX
Enhancing the Quality of ImmPort Data
PDF
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
PPTX
schema.org and biomedical ontologies
PDF
Tutorial: “How to use ontology repositories and ontology–based services”
PPT
Driving Deep Semantics in Middleware and Networks: What, why and how?
PPT
Formal Ontology Meets Industry: Best Practices
PPTX
Ontologies: Necessary, but not sufficient
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
Sense and Similarity: making sense of similarity for ontologies
Ontology Mapping - Out Of The Babel Tower
My ontology is better than yours! Building and evaluating ontologies for inte...
Reuse of Ontology Mappings
Semantic decomposition of ontologies for creation of flexible biomedical conc...
Overview of CPR Ontology
NCBO haendel talk 2013
Kboom phenoday-2016
Semantics as a service at EMBL-EBI
FAIR data requires FAIR ontologies, how do we do?
Powering Biomedical Artificial Intelligence with a Holistic Knowledge Graph (...
Enhancing the Quality of ImmPort Data
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
schema.org and biomedical ontologies
Tutorial: “How to use ontology repositories and ontology–based services”
Driving Deep Semantics in Middleware and Networks: What, why and how?
Formal Ontology Meets Industry: Best Practices
Ontologies: Necessary, but not sufficient
Ad

More from Maulik Kamdar (16)

PDF
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
PDF
Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...
PDF
Invited Talk at NASA Ames Research Center
PDF
Mechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data Cloud
PDF
Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
PDF
BiOnIC: A Catalog of User Interactions with Biomedical Ontologies
PDF
Preproposal Talk
PPTX
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
PDF
BMI Research in Progress - Thursday talk
PPTX
PRISM: A data-driven platform for monitoring mental health
PPTX
Integrating Wearables and User Interaction Patterns to Monitor Mental Health
PDF
Current advances to bridge the usability-expressivity gap in biomedical seman...
PPT
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...
PPT
Isolation and characterization of an extracellular antifungal protein from an...
PDF
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
PDF
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Text Snippets to Corroborate Medical Relations: An Unsupervised Approach usin...
Invited Talk at NASA Ames Research Center
Mechanism-Based Pharmacovigilance Over the Life-Sciences Linked-Open-Data Cloud
Analyzing User Interactions with Biomedical Ontologies: A Visual Perspective
BiOnIC: A Catalog of User Interactions with Biomedical Ontologies
Preproposal Talk
Graph Analytics in Pharmacology over the Web of Life Sciences Linked Open Data
BMI Research in Progress - Thursday talk
PRISM: A data-driven platform for monitoring mental health
Integrating Wearables and User Interaction Patterns to Monitor Mental Health
Current advances to bridge the usability-expressivity gap in biomedical seman...
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer rese...
Isolation and characterization of an extracellular antifungal protein from an...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
ReVeaLD: A User-driven Domain Specific Interactive Search Platform for Biomed...

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Foundation of Data Science unit number two notes
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Business Analytics and business intelligence.pdf
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
annual-report-2024-2025 original latest.
PDF
Lecture1 pattern recognition............
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
Clinical guidelines as a resource for EBP(1).pdf
climate analysis of Dhaka ,Banglades.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Supervised vs unsupervised machine learning algorithms
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
ISS -ESG Data flows What is ESG and HowHow
Foundation of Data Science unit number two notes
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Business Analytics and business intelligence.pdf
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
oil_refinery_comprehensive_20250804084928 (1).pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
annual-report-2024-2025 original latest.
Lecture1 pattern recognition............
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Mega Projects Data Mega Projects Data
Clinical guidelines as a resource for EBP(1).pdf

Investigating Term Reuse and Overlap in Biomedical Ontologies

Editor's Notes

  • #3: To support the interoperability, the Unified Medical Language System (UMLS) uses the notion of a Concept Unique Identifier (CUI) to map terms with similar meaning in different terminologies
  • #4: To support the interoperability, the Unified Medical Language System (UMLS) uses the notion of a Concept Unique Identifier (CUI) to map terms with similar meaning in different terminologies
  • #5: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #6: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #7: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #8: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #9: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #10: Ghazvinian, Amir, Natalya Fridman Noy, and Mark A. Musen. "How orthogonal are the OBO Foundry ontologies?." J. Biomedical Semantics 2.S-2 (2011): S2.
  • #11: Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  • #12: Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  • #13: Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  • #14: Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  • #15: Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  • #16: Dresden Ontology Generator for Directed Acyclic Graphs MIREOT Principles
  • #21: Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  • #24: 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  • #25: 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  • #26: 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  • #27: 175,347 terms (3.1%) were explicitly shared using the same IRIs. Source ontology for all but 37 terms, whose ontologies were not present in BioPortal (e.g., owl:Thing and time#datetimedescription). After removing the imported ontology terms (term reuse > 35% threshold), only 59,618 terms (1.1%) were actually reused We found a total of 4,370,350 xref axioms across all the BioPortal ontologies. After extracting xrefs, which assert equivalence between BioPortal ontology terms, we found 171,069 ‘outlinking’ terms (3.9%) xref-linked to 386,442 `inlinking' terms (8.84%)
  • #31: BFO, PATO(Phenotypic Quality Ontology) CARO (Core anatomy reference ontology), UO (Units of Measurement) and SO (Sequence and Cell Feature Types ontology)
  • #32: BFO, PATO(Phenotypic Quality Ontology) CARO (Core anatomy reference ontology), UO (Units of Measurement) and SO (Sequence and Cell Feature Types ontology)
  • #34: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #35: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #36: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #37: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #38: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #39: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #40: Healthcare Common Procedure Coding System (HCPCS) Current Procedural Terminology
  • #42: Executing normalised string matching on the term labels, we found a term overlap of 823,621 shared term labels (14.4%). Removing explicitly-reused terms, list reduced to 752,176 labels (13.2%). Removing terms mapped to the same UMLS CUI, list reduced to 617,509 labels (10.8%). On extracting the resource identifier from each term IRI, we removed terms with almost similar term IRIs (same identifier and source ontology, but a different or incorrect representation) List reduced to 93,650 term labels (1.6%). The last step does not represent actual reuse between ontologies, but rather that ontology developers showed an intention to reuse terms, but used different and sometimes incorrect term representations (discussed below)
  • #44: SO (
  • #45: SO (
  • #46: SO (
  • #47: SO (
  • #48: SO (
  • #50: Contributions: A set of descriptive statistics describing the level of reuse in biomedical ontologies stored in BioPortal, An interactive visualization technique for displaying the reuse dependencies among biomedical ontologies A clustering method to help identify patterns of reuse using semantic similarity between the terms A discussion on the state and challenges of reuse in biomedical ontologies and development of a semi-automated tool enabling reuse
  • #51: Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  • #52: Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  • #53: Term-ontology matrix. The rows contain the explicitly-reused terms and the columns contain the ontology in which the term appears. Sparse K-means algorithm with the Gap-Estimate method (K=6) For each pair of terms in each cluster, we compute similarity scores. Use spectral clustering method with the term-term affinity matrix.
  • #58: Reuse dependencies could guide term reuse based on the structure of ontologies in related domains. Identifying reuse patterns and providing personalized recommendations could help increase term reuse.
  • #59: Item-based Collaborative Filtering Method (used by Amazon) to provide term reuse recommendations to users through a Web Protégé Plugin, and also allow automated updating. Two-fold Evaluation a posteriori: check if the term-reuse recommendations match those actually reused by users, as analyzed from the logs user-centered: monitoring term reuse when developers build an ontology combining existing ontologies, and surveys