How much Semantic Data on Small Devices?

How much semantic data on small devices?Mathieu d’Aquin, AndriyNikolov and Enrico MottaKnowledge Media Institute, The Open Univeristy, UKm.daquin@open.ac.uk@mdaquin

Semantic Data on Small Devices?

Benchmarking Semantic Data ToolsLUBM(1,0)103,397 triplesLarge Scale Benchmarks

Extracting sets of small-scale ontologiesClusters of ontologies having similar characteristics, except for size

Extracting sets of small-scale OntologiesCharacteristics of ontologiesSize (tiples): varies from very small scale to medium scaleRatio class/prop: allowing 50% varianceRatio class/inst.: allowing 50% varianceDL expressivity: Complexity of the language99 automatically created clustersManual selection of 10

QueriesUsing real life ontologies need domain independent QueriesA set of 8 generic queries of varying complexity, and which results might depend on inferenceSelect all instances of all classesSelect all comments Select all labels and commentsSelect all labelsSelect all classes (RDFS/OWL/DAML)Select all properties by their domainSelect all RDFS classesSelect all properties applied to instances of all classes

Running the benchmarks – Triple StoresJena with TDB persistent storageRAs above + RDFS reasoningSesame with persistent storageRAs above + RDFS reasoningMulgara with default configuration

Running the benchmarks – DeviceAsus EEE PC 700 (2G)

Running the benchmarks - MeasuresLoading time: for each ontologies in an empty, re-initialized store.Disk Space: of the persistent store right after loading.Memory consumption: of the triple store process right after loading the ontology.Query time: for each ontology, averaged over the 8 queries.

Results – Memory consumption

Results – Memory consumptionsRR=

Conclusion – on testsSesame performs best in almost all aspects, even when including reasoningReasoning has big impact on Jena TDB at query timeMulgara is clearly not adequate in a small-scale scenario

Conclusion – on small-scale benchmarkingValidates our assumption that small-scale benchmarks give different results than large-scale benchmarksPoints out the need for more work to tackle the small-scale scenariosResults are not always clear cut in every aspects: benchmarks as support to decide which tool to use, depending on the application constraints

How much Semantic Data on Small Devices?

More Related Content

Similar to How much Semantic Data on Small Devices? (20)

More from Mathieu d'Aquin (20)

Recently uploaded (20)

How much Semantic Data on Small Devices?