SlideShare a Scribd company logo
4th Natural Language Interface over the Web of Data
(NLIWoD) workshop and QALD-9
Question Answering over Linked Data Challenge
Presenter: Prof. Key-Sun Choi and Dr. Muhammad Saleem
NLIWoD 4 and QALD-9 @ ISWC 2018
Monterey, USA
Horizon 2020, GA No 688227
9th October 2018
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 1 / 17
Organization committee
Key-Sun Choi
KAIST, Korea
Jin-Dong Kim
Database Center for Life Science, Japan
Axel-Cyrille Ngonga Ngomo
Paderborn University, Germany
Muhammad Saleem
Leipzig University, Germany
Ricardo Usbeck
Paderborn University, Germany
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 2 / 17
Program committee
Kody Moodley, Maastricht University
Grigorios Tzortzis, NCSR Demokritos
Vanessa Lopez, IBM
Dennis Diefenbach, University Jean Monet
Kuldeep Singh, Fraunhofer IAIS
Edgard Marx, Leipzig University of Applied Sciences (HTWK)
Raghava Mutharaju, IIIT-Delhi
Subhabrata Mukherjee, Max Planck Institute for Informatics
Varish Mulwad, GE Global Research
Roberto Garcia, Universitat de Lleida
Giorgos Giannopoulos, Imis Institute, "Athena" R.C.
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 3 / 17
Overview NLIWoD-3
Time Author Title
09.00-09.05 Key-Sun Choi Introduction
09.05-09.45 Peter F. Patel-Schneider Keynote: "Connecting Industrial
NL Applications to Knowledge
(in Nuance)"
09.45-10.15 Richard Frost and Shane Peelar An Extensible Natural-Language
Query Interface to an Event-
Based Semantic Web
10.15-10.30 Younggyun Hahm, Jiho Kim,
Sangmin An, Minho Lee and
Key-Sun Choi
Chatbot Who Wants to Learn
the Knowledge: KB-Agent
10:30-11:00 Coffee Break
11.00-11.20 Muhammad Saleem QALD 9 Challenge Overview
and Evaluation
11.20-11.50 Jiho Kim, Sangha Nam and Key-
Sun Choi
Open Relation Extraction by
Matrix Factorization and Univer-
sal Schemas
11.50 - end Kyriaki Zafeiroudi, Leah Eck-
man and Rebecca Passonneau
Best Paper: Testing a Knowl-
edge Inquiry System on Ques-
tion Answering
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 4 / 17
Keynote
Please welcome Peter F. Patel Schneider to his keynote:
’Connecting Industrial NL Applications to Knowledge (in
Nuance)’
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 5 / 17
Keynote
Coffee break!
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 6 / 17
Overview Question Answering
Question answering systems mediate between
An user expressing an information need in natural language
RDF-modelled data
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 7 / 17
Overview QALD
QALD is a series of evaluation campaigns that provide a benchmark for
comparing different approaches and systems
Get a picture of their strengths and shortcomings
Gain insight into how we can develop approaches that deal with Semantic
Web data as a knowledge source
QALD-1 @ ESWC 2011 (3)
QALD-2 @ ESWC 2012 (4)
QALD-3 @ CLEF 2013 (6)
QALD-4 @ CLEF 2014 QA track (9)
QALD-5 @ CLEF 2015 QA track (7)
QALD-6 @ ESWC 2016 (13)
QALD-7 @ ESWC 2017 (3)
QALD-8 @ ISWC 2017 (8/3)
QALD-9 @ ISWC 2018 (6/5)
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 8 / 17
Tasks
Overall task Given a natural language question, retrieve the correct
answer(s) from a given RDF repository.
Types of challenges (specific tasks):
1 Multilingual
2 Wikidata
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 9 / 17
Tasks
Overall task Given a natural language question, retrieve the correct
answer(s) from a given RDF repository.
Types of challenges (specific tasks):
1 Multilingual
2 Wikidata Canceled
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 10 / 17
Task 1 - Multilingual questions
Dataset: DBpedia 2016-10 (with multilingual labels)
Challenge: Lexical and structural gap between natural language expressions and
data, e.g.
high → elevation
have inhabitants → populationTotal
graduate from → almaMater
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 11 / 17
Task 1 - Multilingual questions
Dataset: DBpedia 2016-10 (with multilingual labels)
Challenge: Lexical and structural gap between natural language expressions and
data, e.g.
high → elevation
have inhabitants → populationTotal
graduate from → almaMater
Questions: 413 training, 150 test questions (out of which 50 are novel)
Provided in different languages
Can be answered with respect to the provided RDF data
Annotated with corresponding SPARQL queries and answers
QALD 9 test stems partly form chatbot logs
http://chat.dbpedia.org
The largest QALD ever!
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 11 / 17
Example
Which book has the most pages?
Welches Buch hat die meisten Seiten?
Quale libro ha il maggior numero di pagine?
Quel livre a le plus de pages?
¿Que libro tiene el mayor numero de paginas?
. . .
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 12 / 17
Data experts for creating QALD-9
Rricha Jalota, Paderborn University, Germany
Paramjot Kauer, Paderborn University, Germany
Abdullah Ahmed, Paderborn University, Germany
Danish Ahmed, Paderborn University, Germany
Nikit Srivasta, Paderborn University, Germany
Michael Röder, Paderborn University, Germany
Jan Reineke, Paderborn University, Germany
Alexander Bigerl, Paderborn University, Germany
Afshin Amini, Paderborn University, Germany
Geraldo De Souza, Paderborn University, Germany
Felix Conrads, Paderborn University, Germany and InfAI e.V. Leipzig
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 13 / 17
Participants in Task 1 - English
Dennis Diefenbach - Université Jean Monnet, Saint-Étienne
WDAqua-core1: DBpedia http://wdaqua.eu/qa
Task 1, English, French
Sen Hu - School of Electronics Engineering and Computer Science, Peking
University
gAnswer http://ganswer.gstore-pku.com/api/qald.jsp?
Task 1, English
Peter Nancke et al. - Leipzig University, Germany
TeBaQA http://139.18.2.39:8187/
Task 1, English
Szabó Bence et al. - Paderborn University, Germany
Elon http://qald-beta.cs.upb.de:443/
Task 1, English
Lukas Blübaum and Nick Düsterhus - Paderborn University Germany
QASystem http://qald-beta.cs.upb.de:80/
Task 1, English
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 14 / 17
And the winner...
Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak
defined answer types (not nice!)
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
And the winner...
Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak
defined answer types (not nice!)
All QA systems were run on QALD-9 train and test dataset in English and
GERBIL QA version 0.2.3
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
And the winner...
Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak
defined answer types (not nice!)
All QA systems were run on QALD-9 train and test dataset in English and
GERBIL QA version 0.2.3
More details at: www.semantic-web-journal.net/content/
benchmarking-question-answering-systems
FAIR experiment data for training and test dataset at
http://w3id.org/gerbil/qa/experiment?id=201810080002
http://w3id.org/gerbil/qa/experiment?id=201810060001
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
And the winner...
...is gAnswer!
Annotator
Macro
Precision
Macro
Recall
Macro
F1
Error
Count
Average
Time/Doc ms
Macro
F1 QALD
Elon (WS) 0.049 0.053 0.050 2 219 0.100
QASystem (WS) 0.097 0.116 0.098 0 1014 0.200
TeBaQA (WS) 0.129 0.134 0.130 0 2668 0.222
wdaqua-core1 (DBpedia) 0.261 0.267 0.250 0 661 0.289
gAnswer (WS) 0.293 0.327 0.298 1 3076 0.430
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 16 / 17
That’s all Folks!
Thank you!
Questions?
Data Science@UPB
Follow us on Twitter @DiceResearch
Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 17 / 17

More Related Content

PDF
Introduction of semantic technology for SAS programmers
PPTX
Kaggle Competitions, New Friends, New Skills and New Opportunities
PPTX
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
PDF
Theory behind Image Compression and Semantic Search
PPTX
Pattern-based Acquisition of Scientific Entities from Scholarly Article Title...
PDF
Perspectives on mining knowledge graphs from text
PDF
Using Public RDF Resources in Neo4j
PPTX
4V - WP3 Progress Report (TIN2013-46238)
Introduction of semantic technology for SAS programmers
Kaggle Competitions, New Friends, New Skills and New Opportunities
ProteomeXchange Experience: PXD Identifiers and Release of Data on Acceptance...
Theory behind Image Compression and Semantic Search
Pattern-based Acquisition of Scientific Entities from Scholarly Article Title...
Perspectives on mining knowledge graphs from text
Using Public RDF Resources in Neo4j
4V - WP3 Progress Report (TIN2013-46238)

What's hot (6)

PDF
Scaling the (evolving) web data –at low cost-
PDF
Introduction of Knowledge Graphs
PPT
20140521 sem-tech-biz-guest-lecture
PDF
Open Research Knowledge Graph (ORKG) - an overview
PPTX
Efficient RDF Interchange (ERI) Format for RDF Data Streams
PDF
Web Data Management in the RDF Age
Scaling the (evolving) web data –at low cost-
Introduction of Knowledge Graphs
20140521 sem-tech-biz-guest-lecture
Open Research Knowledge Graph (ORKG) - an overview
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Web Data Management in the RDF Age
Ad

Similar to 4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QALD-9 Question Answering over Linked Data Challenge (20)

PDF
Carpenter "The Future of the Scholarly Record"
PDF
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
PDF
How to Create the Google for Earth Data (XLDB 2015, Stanford)
PPTX
Visual Querying LOD sources with LODeX
PDF
Leopard ISWC Semantic Web Challenge 2017
PDF
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
PDF
QALD-7 Question Answering over Linked Data Challenge
PDF
Qald 7 at ESWC2017
PDF
Knowledge Graph Maintenance
ODP
Beyond Infrastructure - Stefan Gradmann (Leipzig Digital Humanities Seminar, ...
PDF
OKE2018 Challenge @ ESWC2018
PDF
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...
PPTX
Cornell 2011 05-13
PPTX
Ciard Initiative and a Global Infrastructure for Linked Open Data
PDF
Linking Open Government Data at Scale
PPTX
SEEKing our way to better presentation of data and models from scientific inv...
PPT
3rd 3DDRESD: DRESD Future Plan 0809
PPTX
Software Sustainability: Better Software Better Science
PDF
NaturalMSEQueries_presICWI2023.pdf
PDF
Linked Data for Architecture, Engineering and Construction (AEC)
Carpenter "The Future of the Scholarly Record"
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
How to Create the Google for Earth Data (XLDB 2015, Stanford)
Visual Querying LOD sources with LODeX
Leopard ISWC Semantic Web Challenge 2017
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
QALD-7 Question Answering over Linked Data Challenge
Qald 7 at ESWC2017
Knowledge Graph Maintenance
Beyond Infrastructure - Stefan Gradmann (Leipzig Digital Humanities Seminar, ...
OKE2018 Challenge @ ESWC2018
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...
Cornell 2011 05-13
Ciard Initiative and a Global Infrastructure for Linked Open Data
Linking Open Government Data at Scale
SEEKing our way to better presentation of data and models from scientific inv...
3rd 3DDRESD: DRESD Future Plan 0809
Software Sustainability: Better Software Better Science
NaturalMSEQueries_presICWI2023.pdf
Linked Data for Architecture, Engineering and Construction (AEC)
Ad

More from Holistic Benchmarking of Big Linked Data (20)

PDF
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
PDF
Benchmarking Big Linked Data: The case of the HOBBIT Project
PDF
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
PDF
The DEBS Grand Challenge 2018
PPTX
Benchmarking of distributed linked data streaming systems
PDF
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
PDF
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
PPTX
The DEBS Grand Challenge 2017
PDF
Scalable Link Discovery for Modern Data-Driven Applications (poster)
PDF
An Evaluation of Models for Runtime Approximation in Link Discovery
PDF
Scalable Link Discovery for Modern Data-Driven Applications
PDF
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
PPTX
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
PDF
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
PDF
MOCHA 2018 Challenge @ ESWC2018
PDF
Dynamic planning for link discovery - ESWC 2018
PDF
Hobbit project overview presented at EBDVF 2017
PDF
Leopard ISWC Semantic Web Challenge 2017 (poster)
PDF
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
PDF
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
Benchmarking Big Linked Data: The case of the HOBBIT Project
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
The DEBS Grand Challenge 2018
Benchmarking of distributed linked data streaming systems
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
The DEBS Grand Challenge 2017
Scalable Link Discovery for Modern Data-Driven Applications (poster)
An Evaluation of Models for Runtime Approximation in Link Discovery
Scalable Link Discovery for Modern Data-Driven Applications
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
MOCHA 2018 Challenge @ ESWC2018
Dynamic planning for link discovery - ESWC 2018
Hobbit project overview presented at EBDVF 2017
Leopard ISWC Semantic Web Challenge 2017 (poster)
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Electronic commerce courselecture one. Pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Big Data Technologies - Introduction.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Cloud computing and distributed systems.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Encapsulation_ Review paper, used for researhc scholars
sap open course for s4hana steps from ECC to s4
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Electronic commerce courselecture one. Pdf
Empathic Computing: Creating Shared Understanding
Per capita expenditure prediction using model stacking based on satellite ima...
Spectral efficient network and resource selection model in 5G networks
Big Data Technologies - Introduction.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Cloud computing and distributed systems.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
NewMind AI Weekly Chronicles - August'25 Week I
Diabetes mellitus diagnosis method based random forest with bat algorithm
MIND Revenue Release Quarter 2 2025 Press Release
Programs and apps: productivity, graphics, security and other tools
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QALD-9 Question Answering over Linked Data Challenge

  • 1. 4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QALD-9 Question Answering over Linked Data Challenge Presenter: Prof. Key-Sun Choi and Dr. Muhammad Saleem NLIWoD 4 and QALD-9 @ ISWC 2018 Monterey, USA Horizon 2020, GA No 688227 9th October 2018 Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 1 / 17
  • 2. Organization committee Key-Sun Choi KAIST, Korea Jin-Dong Kim Database Center for Life Science, Japan Axel-Cyrille Ngonga Ngomo Paderborn University, Germany Muhammad Saleem Leipzig University, Germany Ricardo Usbeck Paderborn University, Germany Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 2 / 17
  • 3. Program committee Kody Moodley, Maastricht University Grigorios Tzortzis, NCSR Demokritos Vanessa Lopez, IBM Dennis Diefenbach, University Jean Monet Kuldeep Singh, Fraunhofer IAIS Edgard Marx, Leipzig University of Applied Sciences (HTWK) Raghava Mutharaju, IIIT-Delhi Subhabrata Mukherjee, Max Planck Institute for Informatics Varish Mulwad, GE Global Research Roberto Garcia, Universitat de Lleida Giorgos Giannopoulos, Imis Institute, "Athena" R.C. Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 3 / 17
  • 4. Overview NLIWoD-3 Time Author Title 09.00-09.05 Key-Sun Choi Introduction 09.05-09.45 Peter F. Patel-Schneider Keynote: "Connecting Industrial NL Applications to Knowledge (in Nuance)" 09.45-10.15 Richard Frost and Shane Peelar An Extensible Natural-Language Query Interface to an Event- Based Semantic Web 10.15-10.30 Younggyun Hahm, Jiho Kim, Sangmin An, Minho Lee and Key-Sun Choi Chatbot Who Wants to Learn the Knowledge: KB-Agent 10:30-11:00 Coffee Break 11.00-11.20 Muhammad Saleem QALD 9 Challenge Overview and Evaluation 11.20-11.50 Jiho Kim, Sangha Nam and Key- Sun Choi Open Relation Extraction by Matrix Factorization and Univer- sal Schemas 11.50 - end Kyriaki Zafeiroudi, Leah Eck- man and Rebecca Passonneau Best Paper: Testing a Knowl- edge Inquiry System on Ques- tion Answering Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 4 / 17
  • 5. Keynote Please welcome Peter F. Patel Schneider to his keynote: ’Connecting Industrial NL Applications to Knowledge (in Nuance)’ Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 5 / 17
  • 6. Keynote Coffee break! Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 6 / 17
  • 7. Overview Question Answering Question answering systems mediate between An user expressing an information need in natural language RDF-modelled data Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 7 / 17
  • 8. Overview QALD QALD is a series of evaluation campaigns that provide a benchmark for comparing different approaches and systems Get a picture of their strengths and shortcomings Gain insight into how we can develop approaches that deal with Semantic Web data as a knowledge source QALD-1 @ ESWC 2011 (3) QALD-2 @ ESWC 2012 (4) QALD-3 @ CLEF 2013 (6) QALD-4 @ CLEF 2014 QA track (9) QALD-5 @ CLEF 2015 QA track (7) QALD-6 @ ESWC 2016 (13) QALD-7 @ ESWC 2017 (3) QALD-8 @ ISWC 2017 (8/3) QALD-9 @ ISWC 2018 (6/5) Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 8 / 17
  • 9. Tasks Overall task Given a natural language question, retrieve the correct answer(s) from a given RDF repository. Types of challenges (specific tasks): 1 Multilingual 2 Wikidata Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 9 / 17
  • 10. Tasks Overall task Given a natural language question, retrieve the correct answer(s) from a given RDF repository. Types of challenges (specific tasks): 1 Multilingual 2 Wikidata Canceled Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 10 / 17
  • 11. Task 1 - Multilingual questions Dataset: DBpedia 2016-10 (with multilingual labels) Challenge: Lexical and structural gap between natural language expressions and data, e.g. high → elevation have inhabitants → populationTotal graduate from → almaMater Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 11 / 17
  • 12. Task 1 - Multilingual questions Dataset: DBpedia 2016-10 (with multilingual labels) Challenge: Lexical and structural gap between natural language expressions and data, e.g. high → elevation have inhabitants → populationTotal graduate from → almaMater Questions: 413 training, 150 test questions (out of which 50 are novel) Provided in different languages Can be answered with respect to the provided RDF data Annotated with corresponding SPARQL queries and answers QALD 9 test stems partly form chatbot logs http://chat.dbpedia.org The largest QALD ever! Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 11 / 17
  • 13. Example Which book has the most pages? Welches Buch hat die meisten Seiten? Quale libro ha il maggior numero di pagine? Quel livre a le plus de pages? ¿Que libro tiene el mayor numero de paginas? . . . Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 12 / 17
  • 14. Data experts for creating QALD-9 Rricha Jalota, Paderborn University, Germany Paramjot Kauer, Paderborn University, Germany Abdullah Ahmed, Paderborn University, Germany Danish Ahmed, Paderborn University, Germany Nikit Srivasta, Paderborn University, Germany Michael Röder, Paderborn University, Germany Jan Reineke, Paderborn University, Germany Alexander Bigerl, Paderborn University, Germany Afshin Amini, Paderborn University, Germany Geraldo De Souza, Paderborn University, Germany Felix Conrads, Paderborn University, Germany and InfAI e.V. Leipzig Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 13 / 17
  • 15. Participants in Task 1 - English Dennis Diefenbach - Université Jean Monnet, Saint-Étienne WDAqua-core1: DBpedia http://wdaqua.eu/qa Task 1, English, French Sen Hu - School of Electronics Engineering and Computer Science, Peking University gAnswer http://ganswer.gstore-pku.com/api/qald.jsp? Task 1, English Peter Nancke et al. - Leipzig University, Germany TeBaQA http://139.18.2.39:8187/ Task 1, English Szabó Bence et al. - Paderborn University, Germany Elon http://qald-beta.cs.upb.de:443/ Task 1, English Lukas Blübaum and Nick Düsterhus - Paderborn University Germany QASystem http://qald-beta.cs.upb.de:80/ Task 1, English Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 14 / 17
  • 16. And the winner... Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak defined answer types (not nice!) Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
  • 17. And the winner... Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak defined answer types (not nice!) All QA systems were run on QALD-9 train and test dataset in English and GERBIL QA version 0.2.3 Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
  • 18. And the winner... Curve Balls: Real world queries (dirty!), new query forms (nasty!), weak defined answer types (not nice!) All QA systems were run on QALD-9 train and test dataset in English and GERBIL QA version 0.2.3 More details at: www.semantic-web-journal.net/content/ benchmarking-question-answering-systems FAIR experiment data for training and test dataset at http://w3id.org/gerbil/qa/experiment?id=201810080002 http://w3id.org/gerbil/qa/experiment?id=201810060001 Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 15 / 17
  • 19. And the winner... ...is gAnswer! Annotator Macro Precision Macro Recall Macro F1 Error Count Average Time/Doc ms Macro F1 QALD Elon (WS) 0.049 0.053 0.050 2 219 0.100 QASystem (WS) 0.097 0.116 0.098 0 1014 0.200 TeBaQA (WS) 0.129 0.134 0.130 0 2668 0.222 wdaqua-core1 (DBpedia) 0.261 0.267 0.250 0 661 0.289 gAnswer (WS) 0.293 0.327 0.298 1 3076 0.430 Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 16 / 17
  • 20. That’s all Folks! Thank you! Questions? Data Science@UPB Follow us on Twitter @DiceResearch Usbeck (DICE Data Science Group, University Paderborn) NLIWoD and QALD-9 9th October 2018 17 / 17