SlideShare a Scribd company logo
WOTS2E: A Search Engine for a
Semantic Web of Things
Unit for Reasoning, Querying and Stream Processing
Insight Centre for Data Analytics,
National University of Ireland, Galway
Andreas Kamilaris, Semih Yumusak, Ali Intizar
World Forum – IoT 2016
Reston, VA, USA- December 12-14, 2016
https://www.w3.org/WoT/images/iot.png
Web of Things
• Designed to connect
“things” to the Web
• A combination of
• Approaches
• Software Architectures
• Interfaces
https://www.w3.org/WoT/images/iot.png
• Increase Interoperability
among IoT platforms
• Mitigate Silo Architecture
• Avoid Multiple and Conflicting
Standards
• Global and Easy Discovery of
Devices
Why we need Web of Things?
• Few of the emerging WoT
platforms
• Sorcades
• ThingWorx
• SpitFire
• Evrythng
• Open.Sen.se
• WoTKit
• Auto WoT
• Xively
Web of Things Platforms
• Can we improve the discoverability of Web of Things?
• Can we use semantic technologies to improve device
discoverability?
• Are there any datasets produced by WoT devices available as
Open Data on the Web?
• Can we create a global and distributed index for search and
discovery of WoT devices?
Our Motivation
Discovery
• Machines needs to automatically discover devices/things and their
description
• Global repositories
• Indexing Things and their description
• Semantic Annotation to describe things
• SPARQL queries and data endpoints
• Discover devices on the fly (Late Binding)
Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
Repository Based Discovery
Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
Device Discovery Mechanisms
• Spatial Search
– BLE beacon based things
• Network Based Search
– mDNS, multicast CoAP
• Device Registration Directories
– CoRE resource directory, XMPP IoT directory, HyperCat
• Meta-Data Discovery
– CoRE Link Format
• Semantic Search and Discovery Techniqyes
– Offers high richness in search queries
• E.g. “search for all light bulbs in my house”
Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
Semantic Search & Discovery: Key Challenges
• Optimal Data Source Discovery
• Streams are everywhere
• Multiple data streams can answer the same query
• Optimal data stream selection
• Catering for user-defined constraints and preferences
• On-Demand Stream Federation
• Automated composition of primitive data streams to
answer complex queries
• Adaptation
• Data source properties can change over time
• Make sure selected sources remain “optimal”
throughout life cycle of the query
Stream Discovery, Federation and Adaptation
• Stream Discovery
– Data interoperability:
• Semantic descriptions (ontologies and annotations)
– Interface interoperability:
• Streams as event services (service discovery)
• Stream Federation
– Efficient processing of complicated event logics
• Data Stream Management Systems
• Complex Event Processing
Semantic
Web
Service
Oriented
Architectures
DSMS and
CEP
Semantic Description
• A sensor service description is annotated as:
sdesc = (td, g, qd, Pd, FoId, fd)
type grounding QoS
Observed
Properties
Feature Of
Iterest
Pd → FoId
• Similarly, a sensor service request is annotated:
sr = (tr, Pr, FoIr, fr, pref, C)
type Requested
Properties
Feature of
Interest
Pd → FoId
no
grounding
NFP Constraint and
Preferences
Middle-ware for Stream Discovery and Federation
Semantic Annotation
ACEIS Core
Resource
Management
Application
Interface
Knowledge Base
QoI/QoS
Stream
Description
Data Mgmt,
Indexing,
Caching
User Input
Event Request
Data
Federation
Resource Discovery
Event Service Composer
Composition Plan
Subscription Manager
Query Transformer
Query Engine
Query
Results
Constraint
Validation
Constraint
Violation
Adaptation
Manager
Data Store
IoT Data
Stream
Social Data
Stream
Web of Things Discovery
• Optimal Data Source Discovery
• Web of Things Search Space is
Global
• Across the whole Web
• Indexing
• Geo-Spatial Mapping
• Movable Objects/Things
• Require Frequent Updates in
Indexes
Problem StatementLinked Open Data Cloud
WOTS2E: ArchitectureWOTS2E: Overview
• A Search engine to
discover semantic meta-
description of things
• Crawls the Web to
discover Linked Data
Sources
• Analyzes Linked Data
sources to identify
relevant WoT devices
WOTS2E: ArchitectureWOTS2E: Overview
• Maintain a registry of
devices for discovery
• Support application
request and provide
details to interact with
the devices.
WOTS2E: ArchitectureWOTS2E: Operations
• Discovery of Linked Data Endpoints
– Web crawlers that continuously scan the
web for discovery of Linked Data
endpoints, frequency of one scan per day
• Examination of Discovered Linked
Data Endpoints
– Query endpoints for IoT/WoT relevant
ontologies
– Explore popular dataset descriptions, such
as VoID, SPARQL-SD
• Analysis of Linked Data Endpoints
and WoT Device Discovery
– Through SPARQL queries, VoID/SPARQL-
SD files, use of open APIs
• Recording of WoT Devices and
Services Discovered
– Service type, location, time, features,
interaction types, restrictions etc.
WOTS2E: ArchitectureWOTS2E: Implementation/Analysis
Common Patterns
<meta name=”Keywords” content=”OpenLink Virtuoso Sparql”>
Virtuoso SPARQL Query Editor
OpenLink Software
<label for=”debug”>Strict checking of void variables</label>
<a href=”http://www.openlinksw.com/virtuoso” >
<a href=”/isparql”>iSPARQL</a>
<label for=”query”>Query text</label>
• SPARQL Endpoint Listed at Datahub.io
• Common Patterns Identification
• SPARQL endpoint discovery
WOTS2E: ArchitectureWOTS2E: Implementation/Analysis
• Discovered patterns are used
as an input to our web
crawlers, in order to search the
web for available SPARQL
endpoints.
• For web crawling, we used a
meta-crawling service called
SpEnD.
• SpEnD exploits the search
functionality available over
popular search engines to
accelerate the performance of
web crawling.
WOTS2E: ArchitectureWOTS2E: Implementation/Analysis
• To analyze the URLs retrieved, the Jena Framework was
used to send SPARQL queries to the candidate endpoints,
checking whether they are valid SPARQL endpoints or not.
• All valid SPARQL endpoints were examined whether they
contain information related to IoT/WoT, i.e. contained
relevant ontologies.
SELECT DISTINCT ?Concept WHERE {[] a ?Concept} LIMIT 100
SELECT * WHERE {{?s ?p ?o}
UNION {GRAPH ?g {?s ?p ?o}}
FILTER regex(?o, "/SSN"). FILTER isIRI(?o).
• After labeling some SPARQL endpoint as related to IoT/WoT, the next step was to
analyze it, discovering which devices/services are available through it:
VoID file adapted to
reveal information about
WoT devices and services
Extend SSN to include discovery and
description information through some
ontology that describes web services.
:ExampleDataset a void:Dataset;
void:subset :ExampleSensor .
:ExampleSensor a void:Dataset;
dcterms:title "WoT Example Sensor";
dcterms:description "http://../sens.wadl";
dcterms:contributor "Insight Centre";
dcterms:source "140.203.154.11"
WOTS2E: Implementation/Analysis
WOTS2E: ArchitectureWOTS2E: Implementation/Analysis
• Information about SPARQL endpoints, devices/services
discovered and sensor meta-data information was stored
as RDF triples on a Virtuoso RDF store, installed on
WOTS2E.
• Use of the IoT ontologyPrefix iot: <http://purl.org/IoT/iot#>
Prefix ssn: <http://purl.oclc.org/../ssn#>
SELECT ?sm ?device ?service
WHERE {
?sm a iot:smart_entity
?sm iot:has_part_device
?device ?device ssn:observes
?service ?service a iot:Temperature
}
WOTS2E: ArchitectureWOTS2E: Evaluation
• The SpEnD (meta-)crawling service ran for 24 hours
• Using the common patterns for SPARQL endpoints
• Relevant URLs from the Bing, Yahoo, Google, Baidu, and
Yandex search engines.
• Comparison of discovered endpoints with Datahub.
Active Inactive Total
WOTS2E 638 640 1278
Datahub 258 296 554
WOTS2E: ArchitectureWOTS2E: Evaluation
• From the discovered 638 active SPARQL endpoints, we
examined them one by one for relevance to IoT/WoT
Ontology Number of Endpoints
SSN 13
DBPedia 13
SmartBuilding 3
DogOnt 2
DUL 2
km4city 2
OpenEI 2
RDFS, SKOS 4
Fan Fpai, Fiemser, IoT,
PROV, SAREF
5 (once each ontology)
WOTS2E: ArchitectureWOTS2E: Evaluation
• IoT/WoT-specific triples from the endpoints
Ontology Number of Triples
SSN 1.433,248
DUL 182
km4city 56
Fiemser 50
OpenIoT 44
SmartBuilding 36
DogOnt 24
SAREF 4
Fan Fpai 2
Conclusion
• Semantic Search and Discovery is essential for Web of
Things
• Currently only a handful of available SPARQL endpoints
(46, 7.2%) seem to relate to IoT/WoT.
• Lack of meta data availability
• Need for standardization for discovery mechanisms
• Our method aims to suggest a solid proposal on how to
achieve discovery on SWoT seamlessly and with minimum
effort.
• WOTS2E can support applications looking for on the fly
discovery and integration of devices
Future Work
• Improve Search mechanism by designing good
vocabularies/ontologies and descriptions for IoT/WoT
devices, services and data.
• A user-friendly website of WOTS2E, to incrementally let
users to access the discovered lists of services in a well-
organized way.
• From meta-crawling to efficient (classical) web crawling.
• Contribute to the standardization efforts on the WoT (W3C
WoT IG, OGC Sensor Web Interface for IoT SWG),
promoting WOTS2E as a viable solution for a SWoT search
engine.

More Related Content

PPT
From Watson to Ontology Repositories - Ontolog OOR panel
PDF
Intro to Elasticsearch
ODP
Searching Relational Data with Elasticsearch
PDF
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...
PDF
Developing Apps: Exposing Your Data Through Araport
PDF
Webinar: Event Processing & Data Analytics with Lucidworks Fusion
PDF
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
ODP
Elasticsearch for beginners
From Watson to Ontology Repositories - Ontolog OOR panel
Intro to Elasticsearch
Searching Relational Data with Elasticsearch
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...
Developing Apps: Exposing Your Data Through Araport
Webinar: Event Processing & Data Analytics with Lucidworks Fusion
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Elasticsearch for beginners

What's hot (20)

PPTX
Introduction to Elasticsearch with basics of Lucene
PDF
ElasticSearch: Distributed Multitenant NoSQL Datastore and Search Engine
PPTX
RELIANCE ROHub hackathon
PPTX
Elasticsearch
PPTX
ElasticSearch in Production: lessons learned
PDF
Webinar: Search and Recommenders
PDF
The WorldCat Search API
PDF
Lucene And Solr Document Classification
PDF
Roaring with elastic search sangam2018
PPT
Wis2011_presentation_Realtime_Events_on_LOD
PDF
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
PDF
Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...
PDF
Webinar: Fusion 2.3 Preview - Enhanced Features with Solr & Spark
PPTX
Elasticsearch Introduction
PDF
Elasticsearch Basics
PPTX
JBrowse within the Arabidopsis Information Portal - PAG XXIII
PPT
Search domain basics
PPTX
Building genomic data cyberinfrastructure with the online database software T...
PPTX
Solr 6.0 Graph Query Overview
PDF
Security Analytics using ELK stack
Introduction to Elasticsearch with basics of Lucene
ElasticSearch: Distributed Multitenant NoSQL Datastore and Search Engine
RELIANCE ROHub hackathon
Elasticsearch
ElasticSearch in Production: lessons learned
Webinar: Search and Recommenders
The WorldCat Search API
Lucene And Solr Document Classification
Roaring with elastic search sangam2018
Wis2011_presentation_Realtime_Events_on_LOD
HRGRN: enabling graph search and integrative analysis of Arabidopsis signalin...
Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...
Webinar: Fusion 2.3 Preview - Enhanced Features with Solr & Spark
Elasticsearch Introduction
Elasticsearch Basics
JBrowse within the Arabidopsis Information Portal - PAG XXIII
Search domain basics
Building genomic data cyberinfrastructure with the online database software T...
Solr 6.0 Graph Query Overview
Security Analytics using ELK stack
Ad

Viewers also liked (20)

PDF
Service Integration in the Web of Things
PPTX
WoT 2016 - Seventh International Workshop on the Web of Things
PPT
Towards the Web of Things: Web Mashups for the Real-World @ MEM 2009
PDF
A Distributional Approach for Terminological Semantic Search on the Linked Da...
PDF
Introduction to Swingly
PPT
Michael Caulfield: Developing a Connected Health Economy
PDF
Demo: Profiling & Exploration of Linked Open Data
PPTX
Search Engines After The Semanatic Web
PDF
PDF
Knowledge Patterns SSSW2016
PDF
Service Integration - A Web of Things Perspective
PPTX
SemTech 2011 Semantic Search tutorial
PDF
Adaptive Go-To-Market Plan for a Business DNA Search Engine: VisionaryD Software
PDF
VisionaryD's Business Model Canvas: Proposed Freemium, Multisided Business Mo...
PDF
PhD Dissertation Supporting tools for automated generation and visual editing...
PPTX
PDF
Towards an industrial Web of Things
PDF
Tutorial Knowledge Discovery
PDF
A component based architecture for the Web of Things
PDF
Web of Things presentation - Document Generation
 
Service Integration in the Web of Things
WoT 2016 - Seventh International Workshop on the Web of Things
Towards the Web of Things: Web Mashups for the Real-World @ MEM 2009
A Distributional Approach for Terminological Semantic Search on the Linked Da...
Introduction to Swingly
Michael Caulfield: Developing a Connected Health Economy
Demo: Profiling & Exploration of Linked Open Data
Search Engines After The Semanatic Web
Knowledge Patterns SSSW2016
Service Integration - A Web of Things Perspective
SemTech 2011 Semantic Search tutorial
Adaptive Go-To-Market Plan for a Business DNA Search Engine: VisionaryD Software
VisionaryD's Business Model Canvas: Proposed Freemium, Multisided Business Mo...
PhD Dissertation Supporting tools for automated generation and visual editing...
Towards an industrial Web of Things
Tutorial Knowledge Discovery
A component based architecture for the Web of Things
Web of Things presentation - Document Generation
 
Ad

Similar to WOTS2E: A Search Engine for a Semantic Web of Things (20)

PPTX
Do ”Web of Things Platforms” Truly Follow the Web of Things?
PPTX
Standard Provenance Reporting and Scientific Software Management in Virtual L...
PDF
Design patternsforiot
PDF
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
PPTX
Informix - The Ideal Database for IoT
PDF
Serverless SQL
PDF
Introduction to Apache Geode (Cork, Ireland)
PPTX
Hypermedia for Machine APIs
PDF
Apache Geode Meetup, Cork, Ireland at CIT
PDF
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
PDF
Ietf91 ad hoc-coap-lwm2m-ipso
PPTX
Data saturday malta - ADX Azure Data Explorer overview
PDF
Connecting to the internet of things (IoT)
PDF
IoT Stream: A Lightweight Ontology for Internet of Things Data Streams (GIoTS...
PDF
Bertenthal
PDF
Building Fast Applications for Streaming Data
PDF
Data Science with the Help of Metadata
PDF
DataStax and Esri: Geotemporal IoT Search and Analytics
PPTX
A machine learning and data science pipeline for real companies
PPTX
Web of Things to the edge
Do ”Web of Things Platforms” Truly Follow the Web of Things?
Standard Provenance Reporting and Scientific Software Management in Virtual L...
Design patternsforiot
Advanced Automated Analytics Using OSS Tools, GA Tech FDA Conference 2016
Informix - The Ideal Database for IoT
Serverless SQL
Introduction to Apache Geode (Cork, Ireland)
Hypermedia for Machine APIs
Apache Geode Meetup, Cork, Ireland at CIT
FIWARE Global Summit - The Way Towards Interoperability between Web Of Things...
Ietf91 ad hoc-coap-lwm2m-ipso
Data saturday malta - ADX Azure Data Explorer overview
Connecting to the internet of things (IoT)
IoT Stream: A Lightweight Ontology for Internet of Things Data Streams (GIoTS...
Bertenthal
Building Fast Applications for Streaming Data
Data Science with the Help of Metadata
DataStax and Esri: Geotemporal IoT Search and Analytics
A machine learning and data science pipeline for real companies
Web of Things to the edge

More from Andreas Kamilaris (20)

PDF
Experiences from the use of CovTracer: A contact tracing tool deployed in Cyp...
PPTX
Transferring manure from livestock farms to be used as fertilizer in crop fields
PPTX
Training deep learning models to count using synthetic images
PPTX
Geospatial Analysis and Internet of Things in Environmental Informatics
PPTX
A Review on the Application of Natural Computing in Environmental Informatics
PPTX
The evolution of pervasive computing towards a Web of Things
PPTX
AgriBigCAT: An Online Platform for Estimating the Impact of Livestock Agricul...
PPTX
Estimating the Environmental Impact of Agriculture by means of Geospatial and...
PPTX
Disaster Monitoring using Unmanned Aerial Vehicles and Deep Learning
PPTX
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
PPTX
Big data analysis and Integration of Geophysical information from the Catalan...
PPTX
Estimating the Impact of Agriculture on the Environment of Catalunya by means...
PPTX
Agri-IoT: A Semantic Framework for Internet of Things-enabled Smart Farming A...
PPTX
Enabling the physical world to the Internet and potential benefits for agricu...
PPTX
Privacy Concerns in Sharing Personal Consumption Data through Online Applicat...
PPTX
Social Electricity User Manual
PPTX
Social Electricity
PPTX
Social Electricity Online Platform (SEOP) EU Project Description
PPTX
How the Internet can motivate you to switch off the lights
PPTX
Good Practices in the Use of ICT Equipment for Electricity Savings at a Unive...
Experiences from the use of CovTracer: A contact tracing tool deployed in Cyp...
Transferring manure from livestock farms to be used as fertilizer in crop fields
Training deep learning models to count using synthetic images
Geospatial Analysis and Internet of Things in Environmental Informatics
A Review on the Application of Natural Computing in Environmental Informatics
The evolution of pervasive computing towards a Web of Things
AgriBigCAT: An Online Platform for Estimating the Impact of Livestock Agricul...
Estimating the Environmental Impact of Agriculture by means of Geospatial and...
Disaster Monitoring using Unmanned Aerial Vehicles and Deep Learning
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
Big data analysis and Integration of Geophysical information from the Catalan...
Estimating the Impact of Agriculture on the Environment of Catalunya by means...
Agri-IoT: A Semantic Framework for Internet of Things-enabled Smart Farming A...
Enabling the physical world to the Internet and potential benefits for agricu...
Privacy Concerns in Sharing Personal Consumption Data through Online Applicat...
Social Electricity User Manual
Social Electricity
Social Electricity Online Platform (SEOP) EU Project Description
How the Internet can motivate you to switch off the lights
Good Practices in the Use of ICT Equipment for Electricity Savings at a Unive...

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Machine learning based COVID-19 study performance prediction
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
KodekX | Application Modernization Development
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
Building Integrated photovoltaic BIPV_UPV.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Machine learning based COVID-19 study performance prediction
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KodekX | Application Modernization Development
Network Security Unit 5.pdf for BCA BBA.
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Understanding_Digital_Forensics_Presentation.pptx

WOTS2E: A Search Engine for a Semantic Web of Things

  • 1. WOTS2E: A Search Engine for a Semantic Web of Things Unit for Reasoning, Querying and Stream Processing Insight Centre for Data Analytics, National University of Ireland, Galway Andreas Kamilaris, Semih Yumusak, Ali Intizar World Forum – IoT 2016 Reston, VA, USA- December 12-14, 2016
  • 2. https://www.w3.org/WoT/images/iot.png Web of Things • Designed to connect “things” to the Web • A combination of • Approaches • Software Architectures • Interfaces
  • 3. https://www.w3.org/WoT/images/iot.png • Increase Interoperability among IoT platforms • Mitigate Silo Architecture • Avoid Multiple and Conflicting Standards • Global and Easy Discovery of Devices Why we need Web of Things?
  • 4. • Few of the emerging WoT platforms • Sorcades • ThingWorx • SpitFire • Evrythng • Open.Sen.se • WoTKit • Auto WoT • Xively Web of Things Platforms
  • 5. • Can we improve the discoverability of Web of Things? • Can we use semantic technologies to improve device discoverability? • Are there any datasets produced by WoT devices available as Open Data on the Web? • Can we create a global and distributed index for search and discovery of WoT devices? Our Motivation
  • 6. Discovery • Machines needs to automatically discover devices/things and their description • Global repositories • Indexing Things and their description • Semantic Annotation to describe things • SPARQL queries and data endpoints • Discover devices on the fly (Late Binding) Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
  • 7. Repository Based Discovery Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
  • 8. Device Discovery Mechanisms • Spatial Search – BLE beacon based things • Network Based Search – mDNS, multicast CoAP • Device Registration Directories – CoRE resource directory, XMPP IoT directory, HyperCat • Meta-Data Discovery – CoRE Link Format • Semantic Search and Discovery Techniqyes – Offers high richness in search queries • E.g. “search for all light bulbs in my house” Slide Source: ISWC 2016, Tutorial on Semantic Web Meets IoT/WoT (Soumya Kanti Datta)
  • 9. Semantic Search & Discovery: Key Challenges • Optimal Data Source Discovery • Streams are everywhere • Multiple data streams can answer the same query • Optimal data stream selection • Catering for user-defined constraints and preferences • On-Demand Stream Federation • Automated composition of primitive data streams to answer complex queries • Adaptation • Data source properties can change over time • Make sure selected sources remain “optimal” throughout life cycle of the query
  • 10. Stream Discovery, Federation and Adaptation • Stream Discovery – Data interoperability: • Semantic descriptions (ontologies and annotations) – Interface interoperability: • Streams as event services (service discovery) • Stream Federation – Efficient processing of complicated event logics • Data Stream Management Systems • Complex Event Processing Semantic Web Service Oriented Architectures DSMS and CEP
  • 11. Semantic Description • A sensor service description is annotated as: sdesc = (td, g, qd, Pd, FoId, fd) type grounding QoS Observed Properties Feature Of Iterest Pd → FoId • Similarly, a sensor service request is annotated: sr = (tr, Pr, FoIr, fr, pref, C) type Requested Properties Feature of Interest Pd → FoId no grounding NFP Constraint and Preferences
  • 12. Middle-ware for Stream Discovery and Federation Semantic Annotation ACEIS Core Resource Management Application Interface Knowledge Base QoI/QoS Stream Description Data Mgmt, Indexing, Caching User Input Event Request Data Federation Resource Discovery Event Service Composer Composition Plan Subscription Manager Query Transformer Query Engine Query Results Constraint Validation Constraint Violation Adaptation Manager Data Store IoT Data Stream Social Data Stream
  • 13. Web of Things Discovery • Optimal Data Source Discovery • Web of Things Search Space is Global • Across the whole Web • Indexing • Geo-Spatial Mapping • Movable Objects/Things • Require Frequent Updates in Indexes
  • 15. WOTS2E: ArchitectureWOTS2E: Overview • A Search engine to discover semantic meta- description of things • Crawls the Web to discover Linked Data Sources • Analyzes Linked Data sources to identify relevant WoT devices
  • 16. WOTS2E: ArchitectureWOTS2E: Overview • Maintain a registry of devices for discovery • Support application request and provide details to interact with the devices.
  • 17. WOTS2E: ArchitectureWOTS2E: Operations • Discovery of Linked Data Endpoints – Web crawlers that continuously scan the web for discovery of Linked Data endpoints, frequency of one scan per day • Examination of Discovered Linked Data Endpoints – Query endpoints for IoT/WoT relevant ontologies – Explore popular dataset descriptions, such as VoID, SPARQL-SD • Analysis of Linked Data Endpoints and WoT Device Discovery – Through SPARQL queries, VoID/SPARQL- SD files, use of open APIs • Recording of WoT Devices and Services Discovered – Service type, location, time, features, interaction types, restrictions etc.
  • 18. WOTS2E: ArchitectureWOTS2E: Implementation/Analysis Common Patterns <meta name=”Keywords” content=”OpenLink Virtuoso Sparql”> Virtuoso SPARQL Query Editor OpenLink Software <label for=”debug”>Strict checking of void variables</label> <a href=”http://www.openlinksw.com/virtuoso” > <a href=”/isparql”>iSPARQL</a> <label for=”query”>Query text</label> • SPARQL Endpoint Listed at Datahub.io • Common Patterns Identification • SPARQL endpoint discovery
  • 19. WOTS2E: ArchitectureWOTS2E: Implementation/Analysis • Discovered patterns are used as an input to our web crawlers, in order to search the web for available SPARQL endpoints. • For web crawling, we used a meta-crawling service called SpEnD. • SpEnD exploits the search functionality available over popular search engines to accelerate the performance of web crawling.
  • 20. WOTS2E: ArchitectureWOTS2E: Implementation/Analysis • To analyze the URLs retrieved, the Jena Framework was used to send SPARQL queries to the candidate endpoints, checking whether they are valid SPARQL endpoints or not. • All valid SPARQL endpoints were examined whether they contain information related to IoT/WoT, i.e. contained relevant ontologies. SELECT DISTINCT ?Concept WHERE {[] a ?Concept} LIMIT 100 SELECT * WHERE {{?s ?p ?o} UNION {GRAPH ?g {?s ?p ?o}} FILTER regex(?o, "/SSN"). FILTER isIRI(?o).
  • 21. • After labeling some SPARQL endpoint as related to IoT/WoT, the next step was to analyze it, discovering which devices/services are available through it: VoID file adapted to reveal information about WoT devices and services Extend SSN to include discovery and description information through some ontology that describes web services. :ExampleDataset a void:Dataset; void:subset :ExampleSensor . :ExampleSensor a void:Dataset; dcterms:title "WoT Example Sensor"; dcterms:description "http://../sens.wadl"; dcterms:contributor "Insight Centre"; dcterms:source "140.203.154.11" WOTS2E: Implementation/Analysis
  • 22. WOTS2E: ArchitectureWOTS2E: Implementation/Analysis • Information about SPARQL endpoints, devices/services discovered and sensor meta-data information was stored as RDF triples on a Virtuoso RDF store, installed on WOTS2E. • Use of the IoT ontologyPrefix iot: <http://purl.org/IoT/iot#> Prefix ssn: <http://purl.oclc.org/../ssn#> SELECT ?sm ?device ?service WHERE { ?sm a iot:smart_entity ?sm iot:has_part_device ?device ?device ssn:observes ?service ?service a iot:Temperature }
  • 23. WOTS2E: ArchitectureWOTS2E: Evaluation • The SpEnD (meta-)crawling service ran for 24 hours • Using the common patterns for SPARQL endpoints • Relevant URLs from the Bing, Yahoo, Google, Baidu, and Yandex search engines. • Comparison of discovered endpoints with Datahub. Active Inactive Total WOTS2E 638 640 1278 Datahub 258 296 554
  • 24. WOTS2E: ArchitectureWOTS2E: Evaluation • From the discovered 638 active SPARQL endpoints, we examined them one by one for relevance to IoT/WoT Ontology Number of Endpoints SSN 13 DBPedia 13 SmartBuilding 3 DogOnt 2 DUL 2 km4city 2 OpenEI 2 RDFS, SKOS 4 Fan Fpai, Fiemser, IoT, PROV, SAREF 5 (once each ontology)
  • 25. WOTS2E: ArchitectureWOTS2E: Evaluation • IoT/WoT-specific triples from the endpoints Ontology Number of Triples SSN 1.433,248 DUL 182 km4city 56 Fiemser 50 OpenIoT 44 SmartBuilding 36 DogOnt 24 SAREF 4 Fan Fpai 2
  • 26. Conclusion • Semantic Search and Discovery is essential for Web of Things • Currently only a handful of available SPARQL endpoints (46, 7.2%) seem to relate to IoT/WoT. • Lack of meta data availability • Need for standardization for discovery mechanisms • Our method aims to suggest a solid proposal on how to achieve discovery on SWoT seamlessly and with minimum effort. • WOTS2E can support applications looking for on the fly discovery and integration of devices
  • 27. Future Work • Improve Search mechanism by designing good vocabularies/ontologies and descriptions for IoT/WoT devices, services and data. • A user-friendly website of WOTS2E, to incrementally let users to access the discovered lists of services in a well- organized way. • From meta-crawling to efficient (classical) web crawling. • Contribute to the standardization efforts on the WoT (W3C WoT IG, OGC Sensor Web Interface for IoT SWG), promoting WOTS2E as a viable solution for a SWoT search engine.