SlideShare a Scribd company logo
How much semantic data on small devices?Mathieu d’Aquin, AndriyNikolov and Enrico MottaKnowledge Media Institute, The Open Univeristy, UKm.daquin@open.ac.uk@mdaquin
Semantic Data on Small Devices?
Benchmarking Semantic Data ToolsLUBM(1,0)103,397 triplesLarge Scale Benchmarks
Extracting sets of small-scale ontologiesClusters of ontologies having similar characteristics, except for size
Extracting sets of small-scale OntologiesCharacteristics of ontologiesSize (tiples): varies from very small scale to medium scaleRatio class/prop: allowing 50% varianceRatio class/inst.: allowing 50% varianceDL expressivity: Complexity of the language99 automatically created clustersManual selection of 10
Results
QueriesUsing real life ontologies need domain independent QueriesA set of 8 generic queries of varying complexity, and which results might depend on inferenceSelect all instances of all classesSelect all comments Select all labels and commentsSelect all labelsSelect all classes (RDFS/OWL/DAML)Select all properties by their domainSelect all RDFS classesSelect all properties applied to instances of all classes
Running the benchmarks – Triple StoresJena with TDB persistent storageRAs above + RDFS reasoningSesame with persistent storageRAs above + RDFS reasoningMulgara with default configuration
Running the benchmarks – DeviceAsus EEE PC 700 (2G)
Running the benchmarks - MeasuresLoading time: for each ontologies in an empty, re-initialized store.Disk Space: of the persistent store right after loading.Memory consumption: of the triple store process right after loading the ontology.Query time: for each ontology, averaged over the 8 queries.
Results – Loading time
Results – Loading timeR=R
Results – Disk Space
Results – Disk Space=<<RR
Results – Memory consumption
Results – Memory consumptionsRR=
Result – Query time
Result – Query time=<RR
Conclusion – on testsSesame performs best in almost all aspects, even when including reasoningReasoning has big impact on Jena TDB at query timeMulgara is clearly not adequate in a small-scale scenario
Conclusion – on small-scale benchmarkingValidates our assumption that small-scale benchmarks give different results than large-scale benchmarksPoints out the need for more work to tackle the small-scale scenariosResults are not always clear cut in every aspects: benchmarks as support to decide which tool to use, depending on the application constraints

More Related Content

PPTX
Jos van Sas - Testimonial Alcatel-Lucent Bell Labs
PPT
Using Linked Data in Learning Analytics tutorial - Introduction and basics of...
PPTX
Building the Open University's Web of Linked Data
PPTX
NeOn Tool Support for Building Ontologies By Reuse - ICBO 09
PPT
Referendum Oosterweelverbinding - Doe mee!
PPTX
Quick overview of the use of linked data in sociallearn
PPTX
Making sense of users' Web activities
PDF
OpenData in OpenEd (beyond OERs)
Jos van Sas - Testimonial Alcatel-Lucent Bell Labs
Using Linked Data in Learning Analytics tutorial - Introduction and basics of...
Building the Open University's Web of Linked Data
NeOn Tool Support for Building Ontologies By Reuse - ICBO 09
Referendum Oosterweelverbinding - Doe mee!
Quick overview of the use of linked data in sociallearn
Making sense of users' Web activities
OpenData in OpenEd (beyond OERs)

Similar to How much Semantic Data on Small Devices? (20)

ODP
Writting Better Software
PPT
MEME – An Integrated Tool For Advanced Computational Experiments
PPTX
Google, quality and you
PPTX
Machine learning session6(decision trees random forrest)
PPTX
Performance Issue? Machine Learning to the rescue!
PPTX
Using the Machine to predict Testability
PDF
IRJET- Factoid Question and Answering System
PDF
M018147883
PDF
Electi Deep Learning Optimization
ODP
Challenges in Large Scale Machine Learning
PPT
Testing
PPTX
Evaluation
PPTX
H evaluation
PPTX
team10.ppt.pptx
PDF
Issues in AI product development and practices in audio applications
PPT
Michigan Information Retrieval Enthusiasts Group Meetup - August 19, 2010
PPT
Semantics in Sensor Networks
PDF
Sentiment Analysis: A comparative study of Deep Learning and Machine Learning
PDF
Aco based solution for tsp model for evaluation of software test suite
PDF
Aco based solution for tsp model for evaluation of software test suite
Writting Better Software
MEME – An Integrated Tool For Advanced Computational Experiments
Google, quality and you
Machine learning session6(decision trees random forrest)
Performance Issue? Machine Learning to the rescue!
Using the Machine to predict Testability
IRJET- Factoid Question and Answering System
M018147883
Electi Deep Learning Optimization
Challenges in Large Scale Machine Learning
Testing
Evaluation
H evaluation
team10.ppt.pptx
Issues in AI product development and practices in audio applications
Michigan Information Retrieval Enthusiasts Group Meetup - August 19, 2010
Semantics in Sensor Networks
Sentiment Analysis: A comparative study of Deep Learning and Machine Learning
Aco based solution for tsp model for evaluation of software test suite
Aco based solution for tsp model for evaluation of software test suite
Ad

More from Mathieu d'Aquin (20)

PDF
A factorial study of neural network learning from differences for regression
PDF
Recentrer l'intelligence artificielle sur les connaissances
PDF
Data and Knowledge as Commodities
PDF
Unsupervised learning approach for identifying sub-genres in music scores
PDF
Is knowledge engineering still relevant?
PDF
A data view of the data science process
PDF
Dealing with Open Domain Data
PDF
Web Analytics for Everyday Learning
PDF
Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)
PDF
Learning Analytics: understand learning and support the learner
PDF
The AFEL Project
PDF
Assessing the Readability of Policy Documents: The Case of Terms of Use of On...
PDF
Data ethics
PDF
Data for Learning and Learning with Data
PDF
Towards an “Ethics in Design” methodology for AI research projects
PDF
AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...
PDF
Profiling information sources and services for discovery
PDF
Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...
PDF
From Knowledge Bases to Knowledge Infrastructures for Intelligent Systems
PDF
Data analytics beyond data processing and how it affects Industry 4.0
A factorial study of neural network learning from differences for regression
Recentrer l'intelligence artificielle sur les connaissances
Data and Knowledge as Commodities
Unsupervised learning approach for identifying sub-genres in music scores
Is knowledge engineering still relevant?
A data view of the data science process
Dealing with Open Domain Data
Web Analytics for Everyday Learning
Presentation a in ovive montpellier - 26%2 f06%2f2018 (1)
Learning Analytics: understand learning and support the learner
The AFEL Project
Assessing the Readability of Policy Documents: The Case of Terms of Use of On...
Data ethics
Data for Learning and Learning with Data
Towards an “Ethics in Design” methodology for AI research projects
AFEL: Towards Measuring Online Activities Contributions to Self-Directed Lear...
Profiling information sources and services for discovery
Analyse de données et de réseaux sociaux pour l’aide à l’apprentissage infor...
From Knowledge Bases to Knowledge Infrastructures for Intelligent Systems
Data analytics beyond data processing and how it affects Industry 4.0
Ad

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Modernizing your data center with Dell and AMD
PDF
KodekX | Application Modernization Development
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Monthly Chronicles - July 2025
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
A Presentation on Artificial Intelligence
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
Unlocking AI with Model Context Protocol (MCP)
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Modernizing your data center with Dell and AMD
KodekX | Application Modernization Development
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Reach Out and Touch Someone: Haptics and Empathic Computing
Agricultural_Statistics_at_a_Glance_2022_0.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Understanding_Digital_Forensics_Presentation.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Monthly Chronicles - July 2025
The AUB Centre for AI in Media Proposal.docx
Network Security Unit 5.pdf for BCA BBA.
A Presentation on Artificial Intelligence
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

How much Semantic Data on Small Devices?

  • 1. How much semantic data on small devices?Mathieu d’Aquin, AndriyNikolov and Enrico MottaKnowledge Media Institute, The Open Univeristy, UKm.daquin@open.ac.uk@mdaquin
  • 2. Semantic Data on Small Devices?
  • 3. Benchmarking Semantic Data ToolsLUBM(1,0)103,397 triplesLarge Scale Benchmarks
  • 4. Extracting sets of small-scale ontologiesClusters of ontologies having similar characteristics, except for size
  • 5. Extracting sets of small-scale OntologiesCharacteristics of ontologiesSize (tiples): varies from very small scale to medium scaleRatio class/prop: allowing 50% varianceRatio class/inst.: allowing 50% varianceDL expressivity: Complexity of the language99 automatically created clustersManual selection of 10
  • 7. QueriesUsing real life ontologies need domain independent QueriesA set of 8 generic queries of varying complexity, and which results might depend on inferenceSelect all instances of all classesSelect all comments Select all labels and commentsSelect all labelsSelect all classes (RDFS/OWL/DAML)Select all properties by their domainSelect all RDFS classesSelect all properties applied to instances of all classes
  • 8. Running the benchmarks – Triple StoresJena with TDB persistent storageRAs above + RDFS reasoningSesame with persistent storageRAs above + RDFS reasoningMulgara with default configuration
  • 9. Running the benchmarks – DeviceAsus EEE PC 700 (2G)
  • 10. Running the benchmarks - MeasuresLoading time: for each ontologies in an empty, re-initialized store.Disk Space: of the persistent store right after loading.Memory consumption: of the triple store process right after loading the ontology.Query time: for each ontology, averaged over the 8 queries.
  • 14. Results – Disk Space=<<RR
  • 15. Results – Memory consumption
  • 16. Results – Memory consumptionsRR=
  • 18. Result – Query time=<RR
  • 19. Conclusion – on testsSesame performs best in almost all aspects, even when including reasoningReasoning has big impact on Jena TDB at query timeMulgara is clearly not adequate in a small-scale scenario
  • 20. Conclusion – on small-scale benchmarkingValidates our assumption that small-scale benchmarks give different results than large-scale benchmarksPoints out the need for more work to tackle the small-scale scenariosResults are not always clear cut in every aspects: benchmarks as support to decide which tool to use, depending on the application constraints