User-driven Quality
Evaluation of DBpedia
Amrapali Zaveri, Dimitris Kontokostas,
Mohamed A. Sherif, Lorenz Bühmann,
Mohamed Morsey, Sören Auer, Jens Lehmann
Outline
❏Data Quality
❏Data Quality Assessment Methodology
❏Evaluating Quality of Dbpedia
❏ Manual
❏ Semi-automatic
❏Results
❏Conclusion & Future Work
Data Quality
● Data Quality (DQ) is defined as:
○ fitness for a certain use case*
● On the Data Web - varying quality of information
covering various domains
● High quality datasets
○ curated over decades - e.g. life science domain
○ crowdsourcing process - extracted from unstructured
and semi-structured information, e.g. DBpedia
* J. Juran. The Quality Control Handbook. McGraw-Hill, New York, 1974.
Data Quality Assessment
Methodology
4 Step Methodology:
❏ Step 1: Resource selection
❏ Per Class
❏ Completely random
❏ Manual
❏ Step 2: Evaluation mode
selection
❏ Manual
❏ Semi-automatic
❏ Automatic
❏ Step 3: Resource evaluation
❏ Step 4: DQ improvement
❏ Direct
❏ Indirect
Evaluating Quality of Dbpedia
– Manual
❏Phase 1: Creation of quality problem
taxonomy
❏Phase 2: User-driven quality assessment
Evaluating Quality of Dbpedia
– Manual
❏Phase 1: Creation of quality problem
taxonomy
❏Phase 2: User-driven quality assessment
Quality Problem Taxonomy
Dimension Category Sub-category D F Dbpedia
Specific
Accuracy Triple
Incorrectly
extracted
Object value is incompletely extracted - E -
Object value in incorrectly extracted - E -
Special template not properly recognized √ E √
Datatype
problems
Datatype incorrectly extracted √ E -
Implicit
relation-
ships
between
attributes
One fact is encoded in several attributes - M √
Several facts are encoded in one attribute - E -
Attribute value computed from another
attribute value
- E
+
M
√
D = Detectable means problem detection can be automized.
F = Fixable means the issue is solvable by amending either the extraction framework (E), the mappings
wiki (M) or Wikipedia (W).
Quality Problem Taxonomy
Dimension Category Sub-category D F Dbpedia
Specific
Relevancy Irrelevant inform-
ation extracted
Extraction of attributes containing
layout information
√ E √
Redundant attribute values √ - -
Image related information √ E √
Other irrelevant information √ E -
Represen-
tational
Consistency
Representation of
number values
Inconsistency in representation of
number values
√ W -
Interlinking External links External websites √ W -
Interlinks with other
datasets
Links to Wikimedia √ E -
Links to Freebase √ E -
Links to Geospecies √ E -
Links generated via Flickr wrapper √ E -
Evaluating Quality of Dbpedia
– Manual
❏Phase 1: Creation of quality problem
taxonomy
❏Phase 2: User-driven quality assessment
User-driven quality assessment
Type Contest-based
Participants LD experts
Task Detect and classify LD quality issues
Time 1 month
Reward 300 EU prize
Tool TripleChekMate
Crowdsourcing
 HITs (Human Intelligent Tasks),
 Submit to a crowdsourcing platform (e.g. Amazon Mechanical Turk)
 Financial Reward for each HIT
DQ Assessment Tool -
TripleCheckMate
http://nl.dbpedia.org:8080/TripleCheckMate-Demo/
Evaluating Results -
Manual Methodology
Total no. of users 58
Total no. of distinct resources evaluated 521
Total no. of resources evaluated 792
Total no. of distinct resources without problems 86
Total no. of distinct resources with problems 435
Total no. of distinct incorrect triples 2928
Total no. of distinct incorrect triples in the dbprop namespace 1745
Total no. of inter-evaluations 268
No. of resources with evaluators having different opinions 89
Resource-based inter-rater agreement (Cohen’s kappa) 0.34
Triple-based inter-rater agreement (Cohen’s kappa) 0.38
Evaluating Results -
Manual Methodology
No. of triples evaluated for correctness 700
No. of triples evaluated to be correct 567
No. of triples evaluated incorrectly 133
% of triples correctly evaluated 81
Average no. of problems per resource 5.69
Average no. of problems per resource in the dbprop namespae 3.45
Average no. of triples per resource 47.19
% of triples affected 11.93
% of triples affected in the dbprop namespace 7.11
Evaluating Quality of Dbpedia
– Semi-automatic
❏ Step 1: Automatic creation of an extended
schema
❏ DL-Learner*
❏ for all properties in DBpedia, axioms expressing the (inverse)
functional, irreflexive and asymmetric characteristic were
generated
❏ minimum confidence value of 0.95
❏ Step 2: Manual evaluation of the generated
axioms
❏ 100 random axioms per type
❏ Restricted evaluation of those axioms where at least one
violation is found
❏ Taking target context into account
*J. Lehmann. DL-Learner: learning concepts in description logics. Journal of Machine Learning
Research (JMLR), 10:2639{2642, 2009.
Evaluation Results
- Semi-automatic
❏ Irreflexivity:
❏ dbpedia:2012_Coppa_Italia_Final dbpedia-owl:followingEvent
dbpedia:2012_Coppa_Italia_Final
❏ Asymmetry:
❏ dbpedia-owl:starring with domain Work and range Actor
❏ Functionality:
❏ 2 different values 2600.0 and 1630.0 for the density of the moon Himalia.
❏ Inverse Functionality:
❏ Domain: dbpedia-owl:FormulaOneRacer
Range:dbpedia-owl:GrandPrix
Violation:
dbpedia:Fernando_Alonso dbpedia-owl:firstWin
dbpedia:2003_Hungarian_Grand_Prix .
dbpedia:WikiProject_Formula_one dbpedia-owl:firstWin
dbpedia:2003_Hungarian_Grand_Prix .
Evaluation Results -
Semi-automatic methodology
Characteristic #Properties Correct #Violation
Total Violated Min Max Avg. Total
Irreflexivity 142 24 24 1 133 9.8 236
Asymmetry 500 144 81 1 628 16.7 1358
Functionality 739 671 76 1 91581 2624.7 199480
Inverse
Functionality
52 49 13 8 18236 1685.2 21908
Conclusion & Future Work
● Empirical quality analysis for more than 500
resources of a large linked dataset extracted
from crowdsourced content
● Future work:
○ Fix problems detected (Improvement step)
○ Assess other LOD sources
○ Adopt an agile methodology to improve quality of LOD
○ Revisit quality analysis (in regular intervals)
Thank You
Questions?
http://aksw.org/AmrapaliZaveri
zaveri@informatik.uni-leipzig.de
Twitter: @amrapaliz

More Related Content

PDF
[CS570] Machine Learning Team Project (I know what items really are)
PDF
Chapter3 bag2
PDF
Menu of services 2011
PDF
Converting GHO to RDF
PDF
TripleCheckMate
PPT
Making PowerPoint Accessible
PDF
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
[CS570] Machine Learning Team Project (I know what items really are)
Chapter3 bag2
Menu of services 2011
Converting GHO to RDF
TripleCheckMate
Making PowerPoint Accessible
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...

Similar to User-driven Quality Evaluation of DBpedia (20)

PDF
Crowdsourcing Linked Data Quality Assessment
PPTX
An analysis of the quality issues of the properties available in the Spanish ...
ODP
Data-driven Joint Debugging of the DBpedia Mappings and Ontology
ODP
Type Inference on Noisy RDF Data
PPTX
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
PDF
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
PDF
DBpediaSameAs
PDF
DBpediaSameAs
PDF
KEDL DBpedia 2019
PPTX
Using Linked Data to Mine RDF from Wikipedia's Tables
PDF
20150209 improving the_d_bpedia_ontology_v2
PPT
Mining Historical Data for DBpedia via Temporal Tagging of Wikipedia Infoboxes
PDF
Sebastian Hellmann
PDF
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
PDF
Scaling Quality on Quora Using Machine Learning
PPTX
Loupe API - A Linked Data Profiling Service for Quality Assessment
PDF
Mapping Keywords to
PPT
Query Dependent Pseudo-Relevance Feedback based on Wikipedia
PPTX
Crowdsourcing Linked Data Quality Assessment
PPTX
FEASIBLE-Benchmark-Framework-ISWC2015
Crowdsourcing Linked Data Quality Assessment
An analysis of the quality issues of the properties available in the Spanish ...
Data-driven Joint Debugging of the DBpedia Mappings and Ontology
Type Inference on Noisy RDF Data
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
DBpediaSameAs
DBpediaSameAs
KEDL DBpedia 2019
Using Linked Data to Mine RDF from Wikipedia's Tables
20150209 improving the_d_bpedia_ontology_v2
Mining Historical Data for DBpedia via Temporal Tagging of Wikipedia Infoboxes
Sebastian Hellmann
Tackling Usability Challenges in Querying Massive, Ultra-heterogeneous Graphs
Scaling Quality on Quora Using Machine Learning
Loupe API - A Linked Data Profiling Service for Quality Assessment
Mapping Keywords to
Query Dependent Pseudo-Relevance Feedback based on Wikipedia
Crowdsourcing Linked Data Quality Assessment
FEASIBLE-Benchmark-Framework-ISWC2015
Ad

More from Amrapali Zaveri, PhD (13)

PDF
Data Quality and the FAIR principles
PDF
Workshop on Data Quality Management in Wikidata
PDF
ESOF Panel 2018
PDF
CrowdED: Guideline for optimal Crowdsourcing Experimental Design
PDF
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
PDF
smartAPI: Towards a more intelligent network of Web APIs
PDF
Introduction to Bio SPARQL
PDF
Linked Data Quality Assessment: A Survey
PDF
Amrapali Zaveri Defense
PDF
LDQ 2014 DQ Methodology
PDF
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
KEY
ReDD-Observatory
Data Quality and the FAIR principles
Workshop on Data Quality Management in Wikidata
ESOF Panel 2018
CrowdED: Guideline for optimal Crowdsourcing Experimental Design
MetaCrowd: Crowdsourcing Gene Expression Metadata Quality Assessment
smartAPI: Towards a more intelligent network of Web APIs
Introduction to Bio SPARQL
Linked Data Quality Assessment: A Survey
Amrapali Zaveri Defense
LDQ 2014 DQ Methodology
Towards Biomedical Data Integration for Analyzing the Evolution of Cognition
ReDD-Observatory
Ad

Recently uploaded (20)

PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
Trump Administration's workforce development strategy
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
HVAC Specification 2024 according to central public works department
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
Computer Architecture Input Output Memory.pptx
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
advance database management system book.pdf
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
Complications of Minimal Access-Surgery.pdf
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
International_Financial_Reporting_Standa.pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Hazard Identification & Risk Assessment .pdf
Trump Administration's workforce development strategy
AI-driven educational solutions for real-life interventions in the Philippine...
HVAC Specification 2024 according to central public works department
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Computer Architecture Input Output Memory.pptx
B.Sc. DS Unit 2 Software Engineering.pptx
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
advance database management system book.pdf
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Practical Manual AGRO-233 Principles and Practices of Natural Farming
Complications of Minimal Access-Surgery.pdf
Paper A Mock Exam 9_ Attempt review.pdf.
Introduction to pro and eukaryotes and differences.pptx
International_Financial_Reporting_Standa.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
FORM 1 BIOLOGY MIND MAPS and their schemes

User-driven Quality Evaluation of DBpedia

  • 1. User-driven Quality Evaluation of DBpedia Amrapali Zaveri, Dimitris Kontokostas, Mohamed A. Sherif, Lorenz Bühmann, Mohamed Morsey, Sören Auer, Jens Lehmann
  • 2. Outline ❏Data Quality ❏Data Quality Assessment Methodology ❏Evaluating Quality of Dbpedia ❏ Manual ❏ Semi-automatic ❏Results ❏Conclusion & Future Work
  • 3. Data Quality ● Data Quality (DQ) is defined as: ○ fitness for a certain use case* ● On the Data Web - varying quality of information covering various domains ● High quality datasets ○ curated over decades - e.g. life science domain ○ crowdsourcing process - extracted from unstructured and semi-structured information, e.g. DBpedia * J. Juran. The Quality Control Handbook. McGraw-Hill, New York, 1974.
  • 4. Data Quality Assessment Methodology 4 Step Methodology: ❏ Step 1: Resource selection ❏ Per Class ❏ Completely random ❏ Manual ❏ Step 2: Evaluation mode selection ❏ Manual ❏ Semi-automatic ❏ Automatic ❏ Step 3: Resource evaluation ❏ Step 4: DQ improvement ❏ Direct ❏ Indirect
  • 5. Evaluating Quality of Dbpedia – Manual ❏Phase 1: Creation of quality problem taxonomy ❏Phase 2: User-driven quality assessment
  • 6. Evaluating Quality of Dbpedia – Manual ❏Phase 1: Creation of quality problem taxonomy ❏Phase 2: User-driven quality assessment
  • 7. Quality Problem Taxonomy Dimension Category Sub-category D F Dbpedia Specific Accuracy Triple Incorrectly extracted Object value is incompletely extracted - E - Object value in incorrectly extracted - E - Special template not properly recognized √ E √ Datatype problems Datatype incorrectly extracted √ E - Implicit relation- ships between attributes One fact is encoded in several attributes - M √ Several facts are encoded in one attribute - E - Attribute value computed from another attribute value - E + M √ D = Detectable means problem detection can be automized. F = Fixable means the issue is solvable by amending either the extraction framework (E), the mappings wiki (M) or Wikipedia (W).
  • 8. Quality Problem Taxonomy Dimension Category Sub-category D F Dbpedia Specific Relevancy Irrelevant inform- ation extracted Extraction of attributes containing layout information √ E √ Redundant attribute values √ - - Image related information √ E √ Other irrelevant information √ E - Represen- tational Consistency Representation of number values Inconsistency in representation of number values √ W - Interlinking External links External websites √ W - Interlinks with other datasets Links to Wikimedia √ E - Links to Freebase √ E - Links to Geospecies √ E - Links generated via Flickr wrapper √ E -
  • 9. Evaluating Quality of Dbpedia – Manual ❏Phase 1: Creation of quality problem taxonomy ❏Phase 2: User-driven quality assessment
  • 10. User-driven quality assessment Type Contest-based Participants LD experts Task Detect and classify LD quality issues Time 1 month Reward 300 EU prize Tool TripleChekMate Crowdsourcing  HITs (Human Intelligent Tasks),  Submit to a crowdsourcing platform (e.g. Amazon Mechanical Turk)  Financial Reward for each HIT
  • 11. DQ Assessment Tool - TripleCheckMate http://nl.dbpedia.org:8080/TripleCheckMate-Demo/
  • 12. Evaluating Results - Manual Methodology Total no. of users 58 Total no. of distinct resources evaluated 521 Total no. of resources evaluated 792 Total no. of distinct resources without problems 86 Total no. of distinct resources with problems 435 Total no. of distinct incorrect triples 2928 Total no. of distinct incorrect triples in the dbprop namespace 1745 Total no. of inter-evaluations 268 No. of resources with evaluators having different opinions 89 Resource-based inter-rater agreement (Cohen’s kappa) 0.34 Triple-based inter-rater agreement (Cohen’s kappa) 0.38
  • 13. Evaluating Results - Manual Methodology No. of triples evaluated for correctness 700 No. of triples evaluated to be correct 567 No. of triples evaluated incorrectly 133 % of triples correctly evaluated 81 Average no. of problems per resource 5.69 Average no. of problems per resource in the dbprop namespae 3.45 Average no. of triples per resource 47.19 % of triples affected 11.93 % of triples affected in the dbprop namespace 7.11
  • 14. Evaluating Quality of Dbpedia – Semi-automatic ❏ Step 1: Automatic creation of an extended schema ❏ DL-Learner* ❏ for all properties in DBpedia, axioms expressing the (inverse) functional, irreflexive and asymmetric characteristic were generated ❏ minimum confidence value of 0.95 ❏ Step 2: Manual evaluation of the generated axioms ❏ 100 random axioms per type ❏ Restricted evaluation of those axioms where at least one violation is found ❏ Taking target context into account *J. Lehmann. DL-Learner: learning concepts in description logics. Journal of Machine Learning Research (JMLR), 10:2639{2642, 2009.
  • 15. Evaluation Results - Semi-automatic ❏ Irreflexivity: ❏ dbpedia:2012_Coppa_Italia_Final dbpedia-owl:followingEvent dbpedia:2012_Coppa_Italia_Final ❏ Asymmetry: ❏ dbpedia-owl:starring with domain Work and range Actor ❏ Functionality: ❏ 2 different values 2600.0 and 1630.0 for the density of the moon Himalia. ❏ Inverse Functionality: ❏ Domain: dbpedia-owl:FormulaOneRacer Range:dbpedia-owl:GrandPrix Violation: dbpedia:Fernando_Alonso dbpedia-owl:firstWin dbpedia:2003_Hungarian_Grand_Prix . dbpedia:WikiProject_Formula_one dbpedia-owl:firstWin dbpedia:2003_Hungarian_Grand_Prix .
  • 16. Evaluation Results - Semi-automatic methodology Characteristic #Properties Correct #Violation Total Violated Min Max Avg. Total Irreflexivity 142 24 24 1 133 9.8 236 Asymmetry 500 144 81 1 628 16.7 1358 Functionality 739 671 76 1 91581 2624.7 199480 Inverse Functionality 52 49 13 8 18236 1685.2 21908
  • 17. Conclusion & Future Work ● Empirical quality analysis for more than 500 resources of a large linked dataset extracted from crowdsourced content ● Future work: ○ Fix problems detected (Improvement step) ○ Assess other LOD sources ○ Adopt an agile methodology to improve quality of LOD ○ Revisit quality analysis (in regular intervals)