Experiment Markup Language:
A Combined Markup Language and
Ontology to Represent Science
Stuart J. Chalk
Department of Chemistry
University of North Florida
schalk@unf.edu
2014 Spring ACS Meeting – CINF Paper 19
 Digital Representation of Science
 Electronic Notebooks
 The Eureka Research Workbench
 Experiment Markup Language
 ExptML Schema and Files
 Semantic Data and Ontologies
 File Storage
 Eureka Interface
 Web Interface
 Conclusion
Outline
 Most research on digital science is focused on the data
 Standards exist for the digital representation of
 Data -> individual measurements, time series, spectra
 Molecules
 Chemical Reactions
 Context is important!
 Context can be added ad-hoc
 Needs to be added systematically - to be searchable
 We need a digital representation of the scientific process
Digital Representation of Science
 Conceptualized in 2006
 Need a way to store
 Research activities
 Laboratory resources
 Data
 Need to capture the workflow of scientists – not define it
 Writing in a lab notebook is equivalent to blogging…
 …but the context of the entries is important and varies
 Many data types, so how to capture information?
 Experiment Markup Language (ExptML)
Eureka Research Workbench
 A specification (written in XML) that describes different
types of information recorded during the scientific process
(http://exptml.sourceforge.net)
Experiment Markup Language (ExptML)
 Sample
 Solution
 Space
 Specimen
 Substance
 Task
 Template
 Timeline
 User
 Vendor
 Annotation
 Api
 Calculation
 Chemical
 Citation
 Customer
 Data
 Dataset
 Definition
 Element
 Equipment
 Event
 Experiment
 Group
 Message
 Project
 Protocol
 Quote
 Report
 Result
ExptML Chemical Schema
ExptML
Chemical
Schema
ExptML Chemical (Instance)
 To allow ExptML to capture a scientific workflow, an ontology
is needed to represent the structure
 Needs to be
 Flexible – able to be used in a wide variety of areas
 Logical – the links make sense in the context of science
 Searchable – so we can find research done in a similar way
 Comprehensive! This is the BIG problem
 Many existing ontologies
Linking ExptML Files
 In computer science and ontology
“formally represents knowledge as a set of concepts within
a domain, and the relationships between those concepts. It
can be used to model a domain and support reasoning about
concepts.”*
 In essence, an ontology allows us to define the
relationships and assertions about concepts
 For samples represented in ExptML we define
 isSample (assertion)
 hasSample (relationship)
 isSampleOf (relationship)
ExptML Ontology
*https://en.wikipedia.org/wiki/Ontology_(information_science)
ExptML Ontology
 XML is nice for storage, archiving and transmitting
information…
 …but it is not so easy to use in software
 Many XML readers but each have their own syntax
 Can be cumbersome to deal in software with
 File size (XML is verbose)
 Namespaces
 Data types (e.g. string, decimal, etc…)
 So the solution is…
Developments in ExptML
 JSONize it!
 Compact string representation of arrays of data
 Used in AJAX requests in web browsers
Javascript Object Notation (JSON)
{
“exptmlid”: “exptml:ann1”,
“anntype”: “comment”,
“text”: “Had to wait for the biochemistry lab
to finish using the spectrophotometer before the I
could get on it. The standards sat around for 1 hr
30 minutes before I could run them.”,
“date”: “2011-11-25T11:05:17-04:00”
}
<annotation id="exptml_ann1" xmlns="urn:exptml:schema:draft:0.4"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="urn:exptml:schema:draft:0.4
http://exptml.sourceforge.net/files/schema/exptml_annotation.xsd"
version="0.4">
<anntype>comment</anntype>
<text>Had to wait for the biochemistry lab to finish using
the spectrophotometer before the I could get on it. The standards
sat around for 1 hr 30 minutes before I could run them.</text>
<date>2011-11-25T11:05:17-04:00</date>
</annotation>
 JSON-based Serialization for Linked Data
 Current W3C recommendation*
 Allows us to define a specification for the JSON data
 “@content” is equivalent to an XML Schema
JSON-LD
*http://www.w3.org/TR/json-ld
{
“@context”:
{
“exptmlid”: “http://www.w3.org/2001/XMLSchema#string”,
“anntype”: “http://www.w3.org/2001/XMLSchema#string”,
“text”: “http://www.w3.org/2001/XMLSchema#string”,
“date”: “http://www.w3.org/2001/XMLSchema#dateTime”
}
}
JSON-LD
{
“@context”:
{
“exptmlid”: “http://www.w3.org/2001/XMLSchema#string”,
“anntype”: “http://www.w3.org/2001/XMLSchema#string”,
“text”: “http://www.w3.org/2001/XMLSchema#string”,
“date”: “http://www.w3.org/2001/XMLSchema#dateTime”
}
“exptmlid”: “exptml:ann1”,
“anntype”: “comment”,
“text”: “Had to wait for the biochemistry lab to finish
using the spectrophotometer before the I could get on it. The
standards sat around for 1 hr 30 minutes before I could run
them.”,
“date”: “2011-11-25T11:05:17-04:00”
}
 @id represents an Internationalized Resource Identifier (IRI)
 The IRI identifies a node and allows this data to be linked
JSON-LD
{
“@context”: “http://exptld.org/annotation.jsonld”
“@id”: “https://eureka.coas.unf.edu/exptml:ann1”,
“anntype”: “comment”,
“text”: “Had to wait for the biochemistry lab to finish
using the spectrophotometer before the I could get on it. The
standards sat around for 1 hr 30 minutes before I could run
them.”,
“date”: “2011-11-25T11:05:17-04:00”
}
 Current the ontology defines generic relationships
 Should be expanded to provide additional context
Developments in the Ontology
<rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution">
<rdfs:label>has solution</rdfs:label>
<rdfs:comment>Indicates that an experiment makes use of a particular
solution</rdfs:comment>
<rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#rels"/>
</rdf:Property>
<rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasBuffer">
<rdfs:label>has buffer</rdfs:label>
<rdfs:comment>Indicates that an experiment makes use of a buffer (solution)</rdfs:comment>
<rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution"/>
</rdf:Property>
<rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasReagent">
<rdfs:label>has reagent</rdfs:label>
<rdfs:comment>Indicates that an experiment makes use of a reagent (solution)</rdfs:comment>
<rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution"/>
</rdf:Property>
<rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasCalibrationStandard">
<rdfs:label>has calibration standard</rdfs:label>
<rdfs:comment>Indicates that an experiment makes use of a calibration standard</rdfs:comment>
<rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution”/>
</rdf:Property>
 BIG Problem!
 Context is specific to the science and the scientist
 How many sub-properties of “hasSolution” are needed?
 Additional context is domain specific so…
 … we need to integrate other related ontologies
 Map “hasSolution” to predicates in other ontologies
 Use VIVO to choose the ‘best’ domain specific ontology
 Aggregate science ontologies? – requires software/time
 Evaluate ElasticSearch (http://www.elasticsearch.org)
Expand the Ontology
 JSON-LD is a concrete RDF syntax!*
 JSON-LD can be converted to triples
Combine ML and Ontology?
*http://www.w3.org/TR/json-ld/#relationship-to-rdf
{
"@context": "http://exptld.org/annotation.jsonld",
"@id": "https://eureka.coas.unf.edu/exptml:ann1",
"anntype": "comment",
"text": "Had to wait for the biochemistry lab to finish using the
spectrophotometer before the I could get on it. The standards
sat around for 1 hr 30 minutes before I could run them.",
"date": "2011-11-25T11:05:17-04:00",
"hasUser": [
{ "@id": "https://eureka.coas.unf.edu/exptml:usr1” },
{ "@id": "https://eureka.coas.unf.edu/exptml:usr11”}
],
"hasExperiment": { "@id": "https://eureka.coas.unf.edu/exptml:exp1" }
}
 Nice start - allows for conceptual evaluation of the approach
 Needs work – “science cannot be described by one alone”
 TODO
 Integrate and aggregate existing ontologies
 Work with ELN developers e.g. LabTrove and elnItemManifest*
 Encourage ontology development in areas where gaps exist
e.g. Chemical Analysis
 Contribute to standards development
e.g. Research Data Alliance (RDA) – http://rd-alliance.org
Conclusion
* “First steps towards semantic descriptions of electronic laboratory notebook records“,
S J Coles, J G Frey, C L Bird, R J Whitby and A E Day, J. Cheminformatics, 2013, 5:52 http://doi.dx.org/10.1186/1758-2946-5-52
References
 Eureka – http://sourceforge.net/projects/eureka
 Fedora-Commons – http://fedora-commons.org
 XML – http://www.w3.org/standards/xml
 ExptML – http://exptml.sourceforge.net/
 JSON-LD – http://www.w3.org/TR/json-ld
 UnitsML – http://unitsml.nist.gov/
 RDF – http://www.w3.org/RDF/
 CIR – http://cactus.nci.nih.gov/chemical/structure
 RDA – http://rd-alliance.org
 Research Data Alliance (https://rd-alliance.org/)
 http://www.nytimes.com/2013/08/13/science/how-to-share-scientific-data.html

More Related Content

PPTX
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
ODP
2011 03-provenance-workshop-edingurgh
PPTX
247th ACS Meeting: The Eureka Research Workbench
PPT
Reasoning on the Semantic Web
PPT
2011linked science4mccuskermcguinnessfinal
PPT
TreeBASE CIPRES
PPT
Ontologies in Ubiquitous Computing
PDF
Mid-Ontology Learning from Linked Data @JIST2011
Eureka Research Workbench: A Semantic Approach to an Open Source Electroni...
2011 03-provenance-workshop-edingurgh
247th ACS Meeting: The Eureka Research Workbench
Reasoning on the Semantic Web
2011linked science4mccuskermcguinnessfinal
TreeBASE CIPRES
Ontologies in Ubiquitous Computing
Mid-Ontology Learning from Linked Data @JIST2011

What's hot (13)

PDF
Instance-Based Ontological Knowledge Acquisition
PPTX
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
PDF
Phd tesis olga giraldo 10mayo
PPT
Project proposal for a fishery ontology service
PDF
Meghyn slides-hse-2014
PPTX
Building a Standard for Standards: The ChAMP Project
PPT
NeXML - phylogenetic data as XML
PPT
The Chemtools LaBLog
PDF
Roadmap for a multilingual BioPortal
PDF
DataScience Meeting II - Roman Kern - Building an open source based search so...
PPTX
Pattern-based Acquisition of Scientific Entities from Scholarly Article Title...
PDF
SWiM – A wiki for collaborating on mathematical ontologies
PDF
Perspectives on mining knowledge graphs from text
Instance-Based Ontological Knowledge Acquisition
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
Phd tesis olga giraldo 10mayo
Project proposal for a fishery ontology service
Meghyn slides-hse-2014
Building a Standard for Standards: The ChAMP Project
NeXML - phylogenetic data as XML
The Chemtools LaBLog
Roadmap for a multilingual BioPortal
DataScience Meeting II - Roman Kern - Building an open source based search so...
Pattern-based Acquisition of Scientific Entities from Scholarly Article Title...
SWiM – A wiki for collaborating on mathematical ontologies
Perspectives on mining knowledge graphs from text
Ad

Similar to 247th ACS Meeting: Experiment Markup Language (ExptML) (20)

PPTX
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
PPTX
Liberating Laboratory Data - Eureka
PPT
Lecture 7: Semantic Technologies and Interoperability
PDF
Semantic Web from the 2013 Perspective
PDF
Sem facet paper
PDF
SemFacet paper
PDF
Semantic IoT Semantic Inter-Operability Practices - Part 1
PPT
Integrating a Domain Ontology Development Environment and an Ontology Search ...
PDF
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
PDF
Vivo ontology overviewanddirections.2013-04-25
PDF
Cross-lingual event-mining using wordnet as a shared knowledge interface
PPTX
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
PDF
E05412327
PDF
The Nature of Information
PPT
Semantic Web: From Representations to Applications
PDF
Wanna search? Piece of cake!
PPTX
ACS 248th Paper 67 Eureka Collaboration
PDF
Expression of Query in XML object-oriented database
PDF
Expression of Query in XML object-oriented database
PDF
Expression of Query in XML object-oriented database
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Liberating Laboratory Data - Eureka
Lecture 7: Semantic Technologies and Interoperability
Semantic Web from the 2013 Perspective
Sem facet paper
SemFacet paper
Semantic IoT Semantic Inter-Operability Practices - Part 1
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Vivo ontology overviewanddirections.2013-04-25
Cross-lingual event-mining using wordnet as a shared knowledge interface
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
E05412327
The Nature of Information
Semantic Web: From Representations to Applications
Wanna search? Piece of cake!
ACS 248th Paper 67 Eureka Collaboration
Expression of Query in XML object-oriented database
Expression of Query in XML object-oriented database
Expression of Query in XML object-oriented database
Ad

More from Stuart Chalk (18)

PPTX
Semantic properties and units
PPTX
Open semantic chemical structures
PPTX
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
PPTX
AnIML: A New Analytical Data Standard
PPTX
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
PPTX
Scientific Units in the Electronic Age
PPTX
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
PPTX
The Electronic Notebook Ontology
PPTX
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
PPTX
Bringing Flow injection Analysis to the Semantic Web
PPTX
Reactions to the Open Spectral Database
PPTX
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
PPTX
A Standard Data Format for Computational Chemistry: CSX
PPTX
Overview of the Analytical Information Markup Language (AnIML)
PPTX
ACS 248th Paper 108 NIST-IUPAC Solubility Data
PPTX
ACS 248th Paper 104 ChemData Project
PPTX
ACS 248th Paper 71 ChAMP Project
PPTX
Liberating Laboratory Data - AnIML
Semantic properties and units
Open semantic chemical structures
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
AnIML: A New Analytical Data Standard
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Scientific Units in the Electronic Age
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
The Electronic Notebook Ontology
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Bringing Flow injection Analysis to the Semantic Web
Reactions to the Open Spectral Database
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
A Standard Data Format for Computational Chemistry: CSX
Overview of the Analytical Information Markup Language (AnIML)
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 104 ChemData Project
ACS 248th Paper 71 ChAMP Project
Liberating Laboratory Data - AnIML

Recently uploaded (20)

PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PPT
Geologic Time for studying geology for geologist
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
TEXTILE technology diploma scope and career opportunities
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PPTX
Configure Apache Mutual Authentication
PDF
UiPath Agentic Automation session 1: RPA to Agents
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
Chapter 5: Probability Theory and Statistics
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Comparative analysis of machine learning models for fake news detection in so...
PDF
Consumable AI The What, Why & How for Small Teams.pdf
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Enhancing plagiarism detection using data pre-processing and machine learning...
Geologic Time for studying geology for geologist
OpenACC and Open Hackathons Monthly Highlights July 2025
Convolutional neural network based encoder-decoder for efficient real-time ob...
1 - Historical Antecedents, Social Consideration.pdf
Improvisation in detection of pomegranate leaf disease using transfer learni...
TEXTILE technology diploma scope and career opportunities
sbt 2.0: go big (Scala Days 2025 edition)
Configure Apache Mutual Authentication
UiPath Agentic Automation session 1: RPA to Agents
Final SEM Unit 1 for mit wpu at pune .pptx
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
NewMind AI Weekly Chronicles – August ’25 Week III
Chapter 5: Probability Theory and Statistics
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Comparative analysis of machine learning models for fake news detection in so...
Consumable AI The What, Why & How for Small Teams.pdf

247th ACS Meeting: Experiment Markup Language (ExptML)

  • 1. Experiment Markup Language: A Combined Markup Language and Ontology to Represent Science Stuart J. Chalk Department of Chemistry University of North Florida schalk@unf.edu 2014 Spring ACS Meeting – CINF Paper 19
  • 2.  Digital Representation of Science  Electronic Notebooks  The Eureka Research Workbench  Experiment Markup Language  ExptML Schema and Files  Semantic Data and Ontologies  File Storage  Eureka Interface  Web Interface  Conclusion Outline
  • 3.  Most research on digital science is focused on the data  Standards exist for the digital representation of  Data -> individual measurements, time series, spectra  Molecules  Chemical Reactions  Context is important!  Context can be added ad-hoc  Needs to be added systematically - to be searchable  We need a digital representation of the scientific process Digital Representation of Science
  • 4.  Conceptualized in 2006  Need a way to store  Research activities  Laboratory resources  Data  Need to capture the workflow of scientists – not define it  Writing in a lab notebook is equivalent to blogging…  …but the context of the entries is important and varies  Many data types, so how to capture information?  Experiment Markup Language (ExptML) Eureka Research Workbench
  • 5.  A specification (written in XML) that describes different types of information recorded during the scientific process (http://exptml.sourceforge.net) Experiment Markup Language (ExptML)  Sample  Solution  Space  Specimen  Substance  Task  Template  Timeline  User  Vendor  Annotation  Api  Calculation  Chemical  Citation  Customer  Data  Dataset  Definition  Element  Equipment  Event  Experiment  Group  Message  Project  Protocol  Quote  Report  Result
  • 9.  To allow ExptML to capture a scientific workflow, an ontology is needed to represent the structure  Needs to be  Flexible – able to be used in a wide variety of areas  Logical – the links make sense in the context of science  Searchable – so we can find research done in a similar way  Comprehensive! This is the BIG problem  Many existing ontologies Linking ExptML Files
  • 10.  In computer science and ontology “formally represents knowledge as a set of concepts within a domain, and the relationships between those concepts. It can be used to model a domain and support reasoning about concepts.”*  In essence, an ontology allows us to define the relationships and assertions about concepts  For samples represented in ExptML we define  isSample (assertion)  hasSample (relationship)  isSampleOf (relationship) ExptML Ontology *https://en.wikipedia.org/wiki/Ontology_(information_science)
  • 12.  XML is nice for storage, archiving and transmitting information…  …but it is not so easy to use in software  Many XML readers but each have their own syntax  Can be cumbersome to deal in software with  File size (XML is verbose)  Namespaces  Data types (e.g. string, decimal, etc…)  So the solution is… Developments in ExptML
  • 13.  JSONize it!  Compact string representation of arrays of data  Used in AJAX requests in web browsers Javascript Object Notation (JSON) { “exptmlid”: “exptml:ann1”, “anntype”: “comment”, “text”: “Had to wait for the biochemistry lab to finish using the spectrophotometer before the I could get on it. The standards sat around for 1 hr 30 minutes before I could run them.”, “date”: “2011-11-25T11:05:17-04:00” } <annotation id="exptml_ann1" xmlns="urn:exptml:schema:draft:0.4" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="urn:exptml:schema:draft:0.4 http://exptml.sourceforge.net/files/schema/exptml_annotation.xsd" version="0.4"> <anntype>comment</anntype> <text>Had to wait for the biochemistry lab to finish using the spectrophotometer before the I could get on it. The standards sat around for 1 hr 30 minutes before I could run them.</text> <date>2011-11-25T11:05:17-04:00</date> </annotation>
  • 14.  JSON-based Serialization for Linked Data  Current W3C recommendation*  Allows us to define a specification for the JSON data  “@content” is equivalent to an XML Schema JSON-LD *http://www.w3.org/TR/json-ld { “@context”: { “exptmlid”: “http://www.w3.org/2001/XMLSchema#string”, “anntype”: “http://www.w3.org/2001/XMLSchema#string”, “text”: “http://www.w3.org/2001/XMLSchema#string”, “date”: “http://www.w3.org/2001/XMLSchema#dateTime” } }
  • 15. JSON-LD { “@context”: { “exptmlid”: “http://www.w3.org/2001/XMLSchema#string”, “anntype”: “http://www.w3.org/2001/XMLSchema#string”, “text”: “http://www.w3.org/2001/XMLSchema#string”, “date”: “http://www.w3.org/2001/XMLSchema#dateTime” } “exptmlid”: “exptml:ann1”, “anntype”: “comment”, “text”: “Had to wait for the biochemistry lab to finish using the spectrophotometer before the I could get on it. The standards sat around for 1 hr 30 minutes before I could run them.”, “date”: “2011-11-25T11:05:17-04:00” }
  • 16.  @id represents an Internationalized Resource Identifier (IRI)  The IRI identifies a node and allows this data to be linked JSON-LD { “@context”: “http://exptld.org/annotation.jsonld” “@id”: “https://eureka.coas.unf.edu/exptml:ann1”, “anntype”: “comment”, “text”: “Had to wait for the biochemistry lab to finish using the spectrophotometer before the I could get on it. The standards sat around for 1 hr 30 minutes before I could run them.”, “date”: “2011-11-25T11:05:17-04:00” }
  • 17.  Current the ontology defines generic relationships  Should be expanded to provide additional context Developments in the Ontology <rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution"> <rdfs:label>has solution</rdfs:label> <rdfs:comment>Indicates that an experiment makes use of a particular solution</rdfs:comment> <rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#rels"/> </rdf:Property> <rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasBuffer"> <rdfs:label>has buffer</rdfs:label> <rdfs:comment>Indicates that an experiment makes use of a buffer (solution)</rdfs:comment> <rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution"/> </rdf:Property> <rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasReagent"> <rdfs:label>has reagent</rdfs:label> <rdfs:comment>Indicates that an experiment makes use of a reagent (solution)</rdfs:comment> <rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution"/> </rdf:Property> <rdf:Property rdf:ID="http://exptml.sourceforge.net/exptml_ontology.owl#hasCalibrationStandard"> <rdfs:label>has calibration standard</rdfs:label> <rdfs:comment>Indicates that an experiment makes use of a calibration standard</rdfs:comment> <rdfs:subPropertyOf rdf:resource="http://exptml.sourceforge.net/exptml_ontology.owl#hasSolution”/> </rdf:Property>
  • 18.  BIG Problem!  Context is specific to the science and the scientist  How many sub-properties of “hasSolution” are needed?  Additional context is domain specific so…  … we need to integrate other related ontologies  Map “hasSolution” to predicates in other ontologies  Use VIVO to choose the ‘best’ domain specific ontology  Aggregate science ontologies? – requires software/time  Evaluate ElasticSearch (http://www.elasticsearch.org) Expand the Ontology
  • 19.  JSON-LD is a concrete RDF syntax!*  JSON-LD can be converted to triples Combine ML and Ontology? *http://www.w3.org/TR/json-ld/#relationship-to-rdf { "@context": "http://exptld.org/annotation.jsonld", "@id": "https://eureka.coas.unf.edu/exptml:ann1", "anntype": "comment", "text": "Had to wait for the biochemistry lab to finish using the spectrophotometer before the I could get on it. The standards sat around for 1 hr 30 minutes before I could run them.", "date": "2011-11-25T11:05:17-04:00", "hasUser": [ { "@id": "https://eureka.coas.unf.edu/exptml:usr1” }, { "@id": "https://eureka.coas.unf.edu/exptml:usr11”} ], "hasExperiment": { "@id": "https://eureka.coas.unf.edu/exptml:exp1" } }
  • 20.  Nice start - allows for conceptual evaluation of the approach  Needs work – “science cannot be described by one alone”  TODO  Integrate and aggregate existing ontologies  Work with ELN developers e.g. LabTrove and elnItemManifest*  Encourage ontology development in areas where gaps exist e.g. Chemical Analysis  Contribute to standards development e.g. Research Data Alliance (RDA) – http://rd-alliance.org Conclusion * “First steps towards semantic descriptions of electronic laboratory notebook records“, S J Coles, J G Frey, C L Bird, R J Whitby and A E Day, J. Cheminformatics, 2013, 5:52 http://doi.dx.org/10.1186/1758-2946-5-52
  • 21. References  Eureka – http://sourceforge.net/projects/eureka  Fedora-Commons – http://fedora-commons.org  XML – http://www.w3.org/standards/xml  ExptML – http://exptml.sourceforge.net/  JSON-LD – http://www.w3.org/TR/json-ld  UnitsML – http://unitsml.nist.gov/  RDF – http://www.w3.org/RDF/  CIR – http://cactus.nci.nih.gov/chemical/structure  RDA – http://rd-alliance.org  Research Data Alliance (https://rd-alliance.org/)  http://www.nytimes.com/2013/08/13/science/how-to-share-scientific-data.html