SlideShare a Scribd company logo
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




Realising the Potential of Algal Biomass
Production through Semantic Web and
              Linked data
        The LEAPS Framework

               Monika Solanki
       Knowledge Based Engineering Lab
        Birmingham City University, UK
                   Joint work with
                Johannes Skarka
       Karlsruhe Institute of Technology, ITAS
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Outline

1   Motivation

2   Modelling Algal Biomass Knowledge

3   Lifting XML datasets to Linked data

4   System Architecture

5   Querying Linked Algal Biomass Data

6   Conclusion and Future work
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




      Motivation
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Algal biomass as biofuels
   Extensive research* is being undertaken in the search and
   production of naturally viable and sustainable energy
   sources.
   The idea that algae biomass based biofuels could serve as
   an alternative to fossil fuels has been embraced by
   councils across the globe.
   Major companies, government bodies and dedicated non
   profit organisations* are getting involved.
   The domain is a rich source of data/information/knowledge.


                        *http://www.algalbiomass.org/
                     *http://www.eaba-association.eu/
                              *http://www.enalgae.eu/
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Algal biomass as biofuels: Observations

   No systematic analysis of the algae biomass potential for
   North-Western Europe.
   Most of the knowledge buried in various formats of images,
   spreadsheets, proprietary data sources and grey literature.
   Lack of a knowledge level infrastructure that is equipped
   with the capabilities to provide semantic grounding to the
   datasets for algal biomass.
   Low levels of motivation among stakeholders, for datasets
   to be interlinked, shared and reused within the biomass
   community.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


LEAPS: A Potential Solution
Linked Entities for Algal Plant Sites
    motivate the use of Semantic Web technologies and LOD
    for the algal biomass domain.
    laying out a set of ontological requirements for knowledge
    representation that support the publication of algal
    biomass data.
    elaborating on how algal biomass datasets are transformed
    to their corresponding RDF model representation.
    interlinking the generated RDF datasets along spatial
    dimensions with other datasets on the Web of data.
    visualising the linked datasets via an end user LOD REST
    Web service.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


EnAlgae: Energetic Algae
  Aims to reduce CO2 emissions and dependency on
  unsustainable energy sources in North West Europe.
  4 Year Strategic initiative of Interreg IVb NWE programme.

  19 partners and 14 Observers across 7 EU states.

  Coordinated set of activities focussing on sharing best
  practice, developing effective stakeholder engagement and
  encouraging transnational cooperation.



                                       http://www.enalgae.eu/
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


EnAlgae: Some of the objectives
   Accelerate development of sustainable technologies for
   Biomass production.
   Create a network of pilot scale algal facilities across NWE
   in order to address the current lack of verifiable information
   on algal productivity.
   Maintain an up to date inventory in which pilots collect and
   share data in a standardised manner.
   Combine information across the entire algal bioenergy
   delivery chain into a comprehensive and user friendly
   Decision Support System for practitioners, policy makers
   and investors


                                         http://www.enalgae.eu/
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


SW and Linked data for Algal Biomass

  Algal biomass data manifests itself across several facets.
  The value/supply chain ranges from cultivation of algae to
  production of biofuels and other products.
  Cultivation, harvesting, processing and fuel production
  further involves several intermediate processes.
  Every stage in the algal supply chain is governed by
  requirements, regulatory policies and strategies.
  Each of the facets consumes and produces a large volume
  of unstructured data and information.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Algal Supply Chain
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


SW, Linked data and the Algal Supply Chain
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Competency questions for stage 1 datasets
Data driven
    Which are the algal operation sites with CO2 sources that
    have CO2 emissions less than 130000 kgs, where total
    costs of supplying CO2 is lower then 5000 GBP per ton of
    CO2 , areal yield is greater than 30 tons per hectare and
    which are located within the NUTS region “UKM61”?
    Supplement the data with supporting information about the
    region.
    Which are the top ten algal operation sites with the lowest
    impact on global warming potential?
    For a given algal operation site which are the first five most
    cost effective combinations of light, water, nutrients and
    CO2 sources?
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




Modelling Algal Biomass
      Knowledge
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Ontological requirements
Ontologies needed to represent
    Spatiality: location of possible algae cultivation sites,
    location of the sources of consumables (CO2 , nutrients
    and water).
    Geometries: area of the cultivation site - extents,
    polygons, linear and ring arrays.
    Units and Measurements: conventional measurement
    units such as Kgs for quantities and hectares for area,
    bespoke units of measurements, i.e., Kgs/hectare or
    Kgs/annum.
    Territorial units for statistics: core concepts of the NUTS
    system.
    Domain specific knowledge: algae cultivation sites, CO2
    sources, pipelines.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Ontologies for Algal Biomass: Reuse
   Spatial Data: WGS84, spatial relations, Geonames,
   NeoGeo
   Geometries: WGS84, extended NeoGeo.
   Units and Measurements: extended QUDT


                       http://www.w3.org/2003/01/geo/wgs84_pos
           http://www.ordnancesurvey.co.uk/oswebsite/ontology/
                                          spatialrelations.owl
          http://www.geonames.org/ontology/ontology_v2.2.1.rdf
                                  http://geovocab.org/geometry
                     http://qudt.org/1.1/vocab/dimensionalunit
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Ontologies for Algal Biomass: Reuse
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Ontologies for Algal Biomass: Domain
knowledge
  Ontologies for modelling spatial knowledge, units and
  measurements were reused.
  Discovering vocabularies conceptualising the domain
  knowledge for algal biomass was non trivial.
  Concepts and relationships for algal biomass had to be
  defined from ground-up in accordance to the principles of
  ontology development
  The design was very strongly guided by feedback from
  questionnaires made available to the stakeholders,
  interviews with domain experts, providers of raw datasets
  and grey literature from the algal biomass and biofuels
  domain.
Ontologies for Algal Biomass: Domain
knowledge




                 Ontologies available at http:/purl.org/biomass/ontologies
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Designing URIs for Algal Biomass Data
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




Lifting XML datasets to
      Linked data
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Lifting XML datasets to Linked data
Raw data
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Lifting XML datasets to Linked data
First step
   The first part of the data processing and the potential
   calculation are performed in a GIS-based model which was
   developed for this purpose using ArcGIS.
   Raw datasets with various origins and formats -
   transformed using bespoke computational algorithms to an
   ArchGIS specific XML format.
       brings uniformity in the format of representation of the
       datasets and in the process of transformation.
       important computations that are part of the final datasets
       are performed.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Lifting XML datasets to Linked data
Second step
   The original data sources had several limitations and a
   one-to-one transformation was not possible.
       The XML data sources related the biomass production sites
       and the CO2 sources via the pipeline dataset.
       In order to query for all sources that supplied CO2 to a
       specific site, the query would have to be made via the
       pipeline dataset.
       The site, source and NUTS identifiers in the datasets were
       string literals rather than URIs.
   A bespoke parser that exploits XPath to selectively query
   the XML datasets and generate linked data was
   implemented.
   It utilises a complex underlying data structure to facilitate
   the transformation.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Lifting XML datasets to Linked data
   Four datasets were transformed and stored in distributed
   triple store repositories.
   The NUTS regions dataset in RDF was available but there
   was no SPARQL endpoint or service to query the dataset.
   We retrieved the dataset dump and curated it in our local
   triple store as a separate repository.
   The transformed datasets interlinked resources defining
   sites, CO2 sources, pipelines, regions and NUTS data.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Lifting XML datasets to Linked data
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




System Architecture
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


System Architecture
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Architecture: Main components
                                    Parsing modules: lifting the data
                                    from their original formats to RDF.
                                    Ontologies.
                                    Linking engine: producing the linked
                                    data representation of the datasets.
                                    Triple store: OWLIM SE 5.0.
                                    REST Web services.
                                    SPARQL endpoints.
                                    Web Interface.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Querying Linked Algal Biomass Data
    Most queries over the datasets are based on retrieving
    knowledge centered around location information.
    The queries are federated across the various repositories
    holding the linked data.
    Representative Query:
Which are the algal operation sites with CO2 sources that have
CO2 emissions less than 130000 kgs, where total costs of
supplying CO2 is lower then 5000 GBP per ton of CO2 , areal
yield is greater than 30 tons per hectare and which are located
within the NUTS region “UKM61”? Supplement the data with
supporting information about the region.
Typical Query
WHERE {
    SERVICE <http://localhost/repositories/biomass>
    { ?site a site:OperationSite;
      site:inNUTSRegion ?region;
      geo:location ?loc. ?loc
      geo:lat ?lat.
      ?loc geo:long ?long.
      ?site site:hasSiteID ?siteID;
      site:hasArealYield ?z.
      ?z qudt:quantityValue ?y.
      ?y qudt:numericValue ?arealYield.
      ?y qudt:unit ?unit.
  }
  SERVICE <http://localhost/repositories/co2source>
  { ?source a co2:CO2Source;
     co2:hasSourceID ?sourceID;
     co2:hasCO2Emission ?emission.
     ?emission qudt:quantityValue ?emissionQty.
     ?emissionQty qudt:numericValue ?emissionValue.
  }

                                              continued...
Typical Query
    SERVICE <http://localhost/repositories/pipeline>
    { ?pipe a pipe:Pipeline;
      pipe:hasSiteID ?siteID;
      pipe:hasSourceID ?sourceID;
      pipe:hasTotalCO2Cost ?cost.
      ?cost qudt:quantityValue ?qty.
      ?qty qudt:numericValue ?totalCO2CostValue.
      ?qty qudt:unit ?totalCO2CostUnit.
    }
    SERVICE <http://localhost/repositories/region>
    { regionID a ramon:NUTSRegion;
      owl:sameAs ?related
    }
    FILTER((?emissionValue < 130000)
          && (contains(str(?region), "UKM61"))
          && (?arealYield > 30)
          && (?totalCO2CostValue < 5000))
}
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




Related Efforts,
Conclusions and
  Future Work
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Related efforts
   AquaFuels*: a taxonomy of algal strains available as PDF.
   BioEnerGIS *: a GIS based Decision support tool,
   BIOPOLE, for biomass plants feeding district heating
   systems.
   BioKDF *: Bioenergy knowledge discovery framework from
   the U.S. department of Energy.
   Reegle *: various energy related datasets as linked open
   data and a SPARQL endpoint to access the datasets.


                                   *http://www.aquafuels.eu/
                                  *http://www.bioenergis.eu/
                                  *https://bioenergykdf.net/
                                    *http://data.reegle.info
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Conclusions
  Investigations into using algal biomass as an alternative
  source of fuel is gaining widespread momentum.
  The Algal biomass community currently does not employ
  any knowledge representation techniques to formalise and
  structure valuable knowledge harnessed through their
  operations.
  As research in the sector progresses, a wealth of
  information will be available that could be exploited by
  domain specific applications.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Summary
The LEAPS framework exploits SW and LD for the algal
biomass community,
    enabling the screening of data for promising individual
    plant sites and provides base data for more detailed
    planning purposes.
    proposing a set of domain specific ontologies for algal
    plant sites, CO2 and pipelines to be shared and extended
    by the community.
    defining a linked data publishing architecture that
    transforms raw data in disparate formats to a uniform XML
    representation.
    using a set of well established and domain specific
    ontologies as metadata to transform it further into linked
    data.
    providing various data access options such as a SPARQL
    endpoint, an interactive Google map interface and a REST
    API for making the data accessible to stakeholders.
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz


Future Work
  Several other datasets need to be integrated once they
  become available.
  One of the core datasets - algal strains from Algaebase*.
  Multifaceted visualisation of the integrated datasets to
  facilitate the uptake of the framework by stakeholders.
  Rule based reasoning to model and inference domain
  specific constraints.


                                  *http://www.algaebase.org/
monika.solanki@bcu.ac.uk   I-Semantics 2012, 5th September 2012, Graz




Many Thanks!!!

More Related Content

PDF
LEAPS: A Semantic Web and Linked data framework for the Algal Biomass Domain
PDF
From Biomass to Energy via Semantic Web and Linked data
PDF
Linked Data for Improved Vaccine Information Systems
PDF
2017 06-01-eswc2017-ug
PPTX
OpenAIRE provide dashboard #OpenAIREweek2020
PPTX
Introduction to Big data
PPTX
The BlueBRIDGE Project - Pasquale Pagano
PPTX
Ontologies and Linked Open Data in the LifeWatch Greece Research Infrastructure
LEAPS: A Semantic Web and Linked data framework for the Algal Biomass Domain
From Biomass to Energy via Semantic Web and Linked data
Linked Data for Improved Vaccine Information Systems
2017 06-01-eswc2017-ug
OpenAIRE provide dashboard #OpenAIREweek2020
Introduction to Big data
The BlueBRIDGE Project - Pasquale Pagano
Ontologies and Linked Open Data in the LifeWatch Greece Research Infrastructure

What's hot (20)

PDF
SETAC Rome Non-Target Screening For Chemical Discovery
PDF
20200901 ECCB M. Kutmon
PDF
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
PPTX
OpenAIRE in the European Open Science Cloud (EOSC)
PPTX
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
PPTX
Building on the Atlas (of Living Australia)
PPT
Databases
PPTX
20191119_The OpenAIRE Research Graph
PPTX
Building the FAIR Research Commons: A Data Driven Society of Scientists
PDF
agriopenlink @Precision Dairy Farming 2015 (Rochester, MN)
PPT
PHIDIAS - Boosting the use of cloud services for marine data management, serv...
PPT
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
PPTX
Sebastian Bader | Semantic Technologies for Assisted Decision-Making in Indus...
PDF
Phidias: Steps forward in detection and identification of anomalous atmospher...
PDF
Long-term data curation, aka data preservation - EUDAT Summer School (Marjan ...
PPTX
An examination of data quality on QSAR Modeling in regards to the environment...
PPTX
Jack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy Ontology
PPTX
e-Research & the art of linking Astrophysics to Deforestation
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
The BlueBRIDGE Project - Pasquale Pagano
SETAC Rome Non-Target Screening For Chemical Discovery
20200901 ECCB M. Kutmon
DMCM2018 Community Resources Connecting Chemistry and Toxicity Knowledge
OpenAIRE in the European Open Science Cloud (EOSC)
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Building on the Atlas (of Living Australia)
Databases
20191119_The OpenAIRE Research Graph
Building the FAIR Research Commons: A Data Driven Society of Scientists
agriopenlink @Precision Dairy Farming 2015 (Rochester, MN)
PHIDIAS - Boosting the use of cloud services for marine data management, serv...
Gianluca Correndo, Simon Crowle, Juri Papay and Michael Boniface | Enhancing ...
Sebastian Bader | Semantic Technologies for Assisted Decision-Making in Indus...
Phidias: Steps forward in detection and identification of anomalous atmospher...
Long-term data curation, aka data preservation - EUDAT Summer School (Marjan ...
An examination of data quality on QSAR Modeling in regards to the environment...
Jack Verhoosel | Semantics in Dairy Farming: towards a Common Dairy Ontology
e-Research & the art of linking Astrophysics to Deforestation
Delivering The Benefits of Chemical-Biological Integration in Computational T...
The BlueBRIDGE Project - Pasquale Pagano
Ad

Viewers also liked (20)

PDF
Building Ontologies for Algal Biomass Operations 2012
PDF
Open Knowledge Repositories: Enablers of Data Integration across Business Col...
PDF
Representing Supply Chain Events on the Web of Data
PPTX
FIspace Meat Information Provenance Trial Open Call
PPTX
How Traceability Creates Profitability
PDF
Infographic - Food and Beverage Barcode Labeling
PDF
Extending Tables with Data from over a Million Websites
PPTX
The Role of Technology in Food Processing Compliance and Traceability
PDF
Querying Linked Data and Büchi automata
PDF
The potential role of open data in supply chain integration
PDF
The Internet of Lettuces: Legibility, Data and Alternative Food Networks
PPTX
The curious case of Blockchain Technology
PDF
Linked data driven EPCIS Event-based Traceability across Supply chain busine...
PDF
Detecting EPCIS exceptions in linked traceability streams across supply cha...
PDF
Consuming Linked data in Supply Chains: Enabling data visibility via Linked P...
PPTX
Global Supply Chain Innovation Summit, Shanghai
PDF
Semantic web and Linked Data
PDF
Linked data driven EPCIS Event based Traceability across Supply chain busine...
PDF
Linking transformations in EPCIS governing supply chain business processes
PDF
How Blockchain Can Help Retailers Fight Fraud, Boost Margins and Build Brands
Building Ontologies for Algal Biomass Operations 2012
Open Knowledge Repositories: Enablers of Data Integration across Business Col...
Representing Supply Chain Events on the Web of Data
FIspace Meat Information Provenance Trial Open Call
How Traceability Creates Profitability
Infographic - Food and Beverage Barcode Labeling
Extending Tables with Data from over a Million Websites
The Role of Technology in Food Processing Compliance and Traceability
Querying Linked Data and Büchi automata
The potential role of open data in supply chain integration
The Internet of Lettuces: Legibility, Data and Alternative Food Networks
The curious case of Blockchain Technology
Linked data driven EPCIS Event-based Traceability across Supply chain busine...
Detecting EPCIS exceptions in linked traceability streams across supply cha...
Consuming Linked data in Supply Chains: Enabling data visibility via Linked P...
Global Supply Chain Innovation Summit, Shanghai
Semantic web and Linked Data
Linked data driven EPCIS Event based Traceability across Supply chain busine...
Linking transformations in EPCIS governing supply chain business processes
How Blockchain Can Help Retailers Fight Fraud, Boost Margins and Build Brands
Ad

Similar to Realising the Potential of Algal Biomass Production through Semantic Web and Linked data (20)

PDF
15 energy from_biomass_p_paul
PDF
Download full ebook of Biorefineries An Introduction instant download pdf
PDF
Biomass Power For Energy and Sustainable Development
 
PDF
biorefinery-2.pdf
PPT
Developing a network of content providers: The case of Organic.Edunet
PDF
D7.3 Web-based solutions for enviroGRIDS publication database
KEY
Algae generation for biofuel using an enclosed system
PDF
Biofuel biomass 2021 proceeding book
PDF
Basic Research Advancement For Algal Biofuels Production Neha Srivastava
PDF
Pal gov.tutorial4.session5.lab ontologytools
PPTX
Agro-Know & the European agricultural research information ecosystem
KEY
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
PPT
Reorienting open repositories to the challenges of the Semantic Web: Experien...
PDF
Agrorama, Green Hackathon, Dec 16 2012
PPT
Νetworking content repositories to provide meaningful services to users
PDF
2010 1028 platt and levine sbc_spc_openforum_102810 final
PPTX
STI 2014 Schomaker & Noyons - methodological study to structure liquid biofue...
PDF
Bioenergy from wood
PDF
Harnessing Energy from Algae - TERI Energy Security Insights 2011
PPT
Aggregating Best-in-Class Green Open Educational Resources
15 energy from_biomass_p_paul
Download full ebook of Biorefineries An Introduction instant download pdf
Biomass Power For Energy and Sustainable Development
 
biorefinery-2.pdf
Developing a network of content providers: The case of Organic.Edunet
D7.3 Web-based solutions for enviroGRIDS publication database
Algae generation for biofuel using an enclosed system
Biofuel biomass 2021 proceeding book
Basic Research Advancement For Algal Biofuels Production Neha Srivastava
Pal gov.tutorial4.session5.lab ontologytools
Agro-Know & the European agricultural research information ecosystem
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Reorienting open repositories to the challenges of the Semantic Web: Experien...
Agrorama, Green Hackathon, Dec 16 2012
Νetworking content repositories to provide meaningful services to users
2010 1028 platt and levine sbc_spc_openforum_102810 final
STI 2014 Schomaker & Noyons - methodological study to structure liquid biofue...
Bioenergy from wood
Harnessing Energy from Algae - TERI Energy Security Insights 2011
Aggregating Best-in-Class Green Open Educational Resources

More from Monika Solanki (16)

PDF
Monika solanki-agrisemantics2021
PDF
What's in a field?
PDF
Enabling combined Software and Data engineering at Web-scale
PDF
Interoperability for smart appliances in the IoT world
PDF
Towards maintainable constraint validation and repair for taxonomies: The Poo...
PDF
Diversity2015
PDF
Design Intent Ontology presented at WOP2015
PDF
Ekaw2014
PDF
EPCIS Event-Based Traceability in Pharmaceutical Supply Chains via Automated ...
PDF
Reactor Pattern
PDF
Conformance To Standards: A content ontology design pattern
PDF
SEA: A Framework for Interactive Querying, Visualisation and Statistical Anal...
PDF
Pelagios 2011
PDF
Reconstructing the Chaine operatoire through Semantically Linked Open Data
PDF
Semantic web in Cultural Heritage and Archaeology
PDF
A Framework for transforming archaeological databases to ontological datasets
Monika solanki-agrisemantics2021
What's in a field?
Enabling combined Software and Data engineering at Web-scale
Interoperability for smart appliances in the IoT world
Towards maintainable constraint validation and repair for taxonomies: The Poo...
Diversity2015
Design Intent Ontology presented at WOP2015
Ekaw2014
EPCIS Event-Based Traceability in Pharmaceutical Supply Chains via Automated ...
Reactor Pattern
Conformance To Standards: A content ontology design pattern
SEA: A Framework for Interactive Querying, Visualisation and Statistical Anal...
Pelagios 2011
Reconstructing the Chaine operatoire through Semantically Linked Open Data
Semantic web in Cultural Heritage and Archaeology
A Framework for transforming archaeological databases to ontological datasets

Recently uploaded (20)

PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
GDM (1) (1).pptx small presentation for students
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Institutional Correction lecture only . . .
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Basic Mud Logging Guide for educational purpose
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Insiders guide to clinical Medicine.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
2.FourierTransform-ShortQuestionswithAnswers.pdf
VCE English Exam - Section C Student Revision Booklet
GDM (1) (1).pptx small presentation for students
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
TR - Agricultural Crops Production NC III.pdf
O7-L3 Supply Chain Operations - ICLT Program
Computing-Curriculum for Schools in Ghana
Pharmacology of Heart Failure /Pharmacotherapy of CHF
102 student loan defaulters named and shamed – Is someone you know on the list?
Microbial disease of the cardiovascular and lymphatic systems
Institutional Correction lecture only . . .
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Anesthesia in Laparoscopic Surgery in India
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Basic Mud Logging Guide for educational purpose
PPH.pptx obstetrics and gynecology in nursing
Insiders guide to clinical Medicine.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student

Realising the Potential of Algal Biomass Production through Semantic Web and Linked data

  • 1. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Realising the Potential of Algal Biomass Production through Semantic Web and Linked data The LEAPS Framework Monika Solanki Knowledge Based Engineering Lab Birmingham City University, UK Joint work with Johannes Skarka Karlsruhe Institute of Technology, ITAS
  • 2. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Outline 1 Motivation 2 Modelling Algal Biomass Knowledge 3 Lifting XML datasets to Linked data 4 System Architecture 5 Querying Linked Algal Biomass Data 6 Conclusion and Future work
  • 3. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Motivation
  • 4. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Algal biomass as biofuels Extensive research* is being undertaken in the search and production of naturally viable and sustainable energy sources. The idea that algae biomass based biofuels could serve as an alternative to fossil fuels has been embraced by councils across the globe. Major companies, government bodies and dedicated non profit organisations* are getting involved. The domain is a rich source of data/information/knowledge. *http://www.algalbiomass.org/ *http://www.eaba-association.eu/ *http://www.enalgae.eu/
  • 5. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Algal biomass as biofuels: Observations No systematic analysis of the algae biomass potential for North-Western Europe. Most of the knowledge buried in various formats of images, spreadsheets, proprietary data sources and grey literature. Lack of a knowledge level infrastructure that is equipped with the capabilities to provide semantic grounding to the datasets for algal biomass. Low levels of motivation among stakeholders, for datasets to be interlinked, shared and reused within the biomass community.
  • 6. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz LEAPS: A Potential Solution Linked Entities for Algal Plant Sites motivate the use of Semantic Web technologies and LOD for the algal biomass domain. laying out a set of ontological requirements for knowledge representation that support the publication of algal biomass data. elaborating on how algal biomass datasets are transformed to their corresponding RDF model representation. interlinking the generated RDF datasets along spatial dimensions with other datasets on the Web of data. visualising the linked datasets via an end user LOD REST Web service.
  • 7. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz EnAlgae: Energetic Algae Aims to reduce CO2 emissions and dependency on unsustainable energy sources in North West Europe. 4 Year Strategic initiative of Interreg IVb NWE programme. 19 partners and 14 Observers across 7 EU states. Coordinated set of activities focussing on sharing best practice, developing effective stakeholder engagement and encouraging transnational cooperation. http://www.enalgae.eu/
  • 8. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz EnAlgae: Some of the objectives Accelerate development of sustainable technologies for Biomass production. Create a network of pilot scale algal facilities across NWE in order to address the current lack of verifiable information on algal productivity. Maintain an up to date inventory in which pilots collect and share data in a standardised manner. Combine information across the entire algal bioenergy delivery chain into a comprehensive and user friendly Decision Support System for practitioners, policy makers and investors http://www.enalgae.eu/
  • 9. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz SW and Linked data for Algal Biomass Algal biomass data manifests itself across several facets. The value/supply chain ranges from cultivation of algae to production of biofuels and other products. Cultivation, harvesting, processing and fuel production further involves several intermediate processes. Every stage in the algal supply chain is governed by requirements, regulatory policies and strategies. Each of the facets consumes and produces a large volume of unstructured data and information.
  • 10. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Algal Supply Chain
  • 11. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz SW, Linked data and the Algal Supply Chain
  • 12. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Competency questions for stage 1 datasets Data driven Which are the algal operation sites with CO2 sources that have CO2 emissions less than 130000 kgs, where total costs of supplying CO2 is lower then 5000 GBP per ton of CO2 , areal yield is greater than 30 tons per hectare and which are located within the NUTS region “UKM61”? Supplement the data with supporting information about the region. Which are the top ten algal operation sites with the lowest impact on global warming potential? For a given algal operation site which are the first five most cost effective combinations of light, water, nutrients and CO2 sources?
  • 13. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Modelling Algal Biomass Knowledge
  • 14. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Ontological requirements Ontologies needed to represent Spatiality: location of possible algae cultivation sites, location of the sources of consumables (CO2 , nutrients and water). Geometries: area of the cultivation site - extents, polygons, linear and ring arrays. Units and Measurements: conventional measurement units such as Kgs for quantities and hectares for area, bespoke units of measurements, i.e., Kgs/hectare or Kgs/annum. Territorial units for statistics: core concepts of the NUTS system. Domain specific knowledge: algae cultivation sites, CO2 sources, pipelines.
  • 15. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Ontologies for Algal Biomass: Reuse Spatial Data: WGS84, spatial relations, Geonames, NeoGeo Geometries: WGS84, extended NeoGeo. Units and Measurements: extended QUDT http://www.w3.org/2003/01/geo/wgs84_pos http://www.ordnancesurvey.co.uk/oswebsite/ontology/ spatialrelations.owl http://www.geonames.org/ontology/ontology_v2.2.1.rdf http://geovocab.org/geometry http://qudt.org/1.1/vocab/dimensionalunit
  • 16. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Ontologies for Algal Biomass: Reuse
  • 17. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Ontologies for Algal Biomass: Domain knowledge Ontologies for modelling spatial knowledge, units and measurements were reused. Discovering vocabularies conceptualising the domain knowledge for algal biomass was non trivial. Concepts and relationships for algal biomass had to be defined from ground-up in accordance to the principles of ontology development The design was very strongly guided by feedback from questionnaires made available to the stakeholders, interviews with domain experts, providers of raw datasets and grey literature from the algal biomass and biofuels domain.
  • 18. Ontologies for Algal Biomass: Domain knowledge Ontologies available at http:/purl.org/biomass/ontologies
  • 19. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Designing URIs for Algal Biomass Data
  • 20. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data
  • 21. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data Raw data
  • 22. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data First step The first part of the data processing and the potential calculation are performed in a GIS-based model which was developed for this purpose using ArcGIS. Raw datasets with various origins and formats - transformed using bespoke computational algorithms to an ArchGIS specific XML format. brings uniformity in the format of representation of the datasets and in the process of transformation. important computations that are part of the final datasets are performed.
  • 23. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data Second step The original data sources had several limitations and a one-to-one transformation was not possible. The XML data sources related the biomass production sites and the CO2 sources via the pipeline dataset. In order to query for all sources that supplied CO2 to a specific site, the query would have to be made via the pipeline dataset. The site, source and NUTS identifiers in the datasets were string literals rather than URIs. A bespoke parser that exploits XPath to selectively query the XML datasets and generate linked data was implemented. It utilises a complex underlying data structure to facilitate the transformation.
  • 24. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data Four datasets were transformed and stored in distributed triple store repositories. The NUTS regions dataset in RDF was available but there was no SPARQL endpoint or service to query the dataset. We retrieved the dataset dump and curated it in our local triple store as a separate repository. The transformed datasets interlinked resources defining sites, CO2 sources, pipelines, regions and NUTS data.
  • 25. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Lifting XML datasets to Linked data
  • 26. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz System Architecture
  • 27. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz System Architecture
  • 28. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Architecture: Main components Parsing modules: lifting the data from their original formats to RDF. Ontologies. Linking engine: producing the linked data representation of the datasets. Triple store: OWLIM SE 5.0. REST Web services. SPARQL endpoints. Web Interface.
  • 29. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Querying Linked Algal Biomass Data Most queries over the datasets are based on retrieving knowledge centered around location information. The queries are federated across the various repositories holding the linked data. Representative Query: Which are the algal operation sites with CO2 sources that have CO2 emissions less than 130000 kgs, where total costs of supplying CO2 is lower then 5000 GBP per ton of CO2 , areal yield is greater than 30 tons per hectare and which are located within the NUTS region “UKM61”? Supplement the data with supporting information about the region.
  • 30. Typical Query WHERE { SERVICE <http://localhost/repositories/biomass> { ?site a site:OperationSite; site:inNUTSRegion ?region; geo:location ?loc. ?loc geo:lat ?lat. ?loc geo:long ?long. ?site site:hasSiteID ?siteID; site:hasArealYield ?z. ?z qudt:quantityValue ?y. ?y qudt:numericValue ?arealYield. ?y qudt:unit ?unit. } SERVICE <http://localhost/repositories/co2source> { ?source a co2:CO2Source; co2:hasSourceID ?sourceID; co2:hasCO2Emission ?emission. ?emission qudt:quantityValue ?emissionQty. ?emissionQty qudt:numericValue ?emissionValue. } continued...
  • 31. Typical Query SERVICE <http://localhost/repositories/pipeline> { ?pipe a pipe:Pipeline; pipe:hasSiteID ?siteID; pipe:hasSourceID ?sourceID; pipe:hasTotalCO2Cost ?cost. ?cost qudt:quantityValue ?qty. ?qty qudt:numericValue ?totalCO2CostValue. ?qty qudt:unit ?totalCO2CostUnit. } SERVICE <http://localhost/repositories/region> { regionID a ramon:NUTSRegion; owl:sameAs ?related } FILTER((?emissionValue < 130000) && (contains(str(?region), "UKM61")) && (?arealYield > 30) && (?totalCO2CostValue < 5000)) }
  • 32. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Related Efforts, Conclusions and Future Work
  • 33. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Related efforts AquaFuels*: a taxonomy of algal strains available as PDF. BioEnerGIS *: a GIS based Decision support tool, BIOPOLE, for biomass plants feeding district heating systems. BioKDF *: Bioenergy knowledge discovery framework from the U.S. department of Energy. Reegle *: various energy related datasets as linked open data and a SPARQL endpoint to access the datasets. *http://www.aquafuels.eu/ *http://www.bioenergis.eu/ *https://bioenergykdf.net/ *http://data.reegle.info
  • 34. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Conclusions Investigations into using algal biomass as an alternative source of fuel is gaining widespread momentum. The Algal biomass community currently does not employ any knowledge representation techniques to formalise and structure valuable knowledge harnessed through their operations. As research in the sector progresses, a wealth of information will be available that could be exploited by domain specific applications.
  • 35. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Summary The LEAPS framework exploits SW and LD for the algal biomass community, enabling the screening of data for promising individual plant sites and provides base data for more detailed planning purposes. proposing a set of domain specific ontologies for algal plant sites, CO2 and pipelines to be shared and extended by the community. defining a linked data publishing architecture that transforms raw data in disparate formats to a uniform XML representation. using a set of well established and domain specific ontologies as metadata to transform it further into linked data. providing various data access options such as a SPARQL endpoint, an interactive Google map interface and a REST API for making the data accessible to stakeholders.
  • 36. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Future Work Several other datasets need to be integrated once they become available. One of the core datasets - algal strains from Algaebase*. Multifaceted visualisation of the integrated datasets to facilitate the uptake of the framework by stakeholders. Rule based reasoning to model and inference domain specific constraints. *http://www.algaebase.org/
  • 37. monika.solanki@bcu.ac.uk I-Semantics 2012, 5th September 2012, Graz Many Thanks!!!