SlideShare a Scribd company logo
Multilingual Retrieval Interface for Structured Data on the Web 
Dana Dannells, Ramona Enache, Mariana Damova 
NLIWoD - ISWC’2014
Semantic Data Infrastructures 
Leonardo 
Mona Lisa 
RDF Repository 
SPARQL 
:Painter 
:Painting 
:painted 
Mona Lisa 
? 
rdf:type 
rdf:type 
• Semantic Web 
• Linked data 
• SPARQL query language 
2 
10/19/2014
Natural Interface through NL 
10/19/2014 
3 
EN 
FR 
DE 
Who painted Mona Lisa? 
Qui a paint Mona Lisa? 
Wer hat Mona Lisa gemahlt? 
Who is Mona Lisa’s painter? 
Qui est le paintre de Mona Lisa? 
Wer ist der Mahler von Mona Lisa ? 
Who created Mona Lisa? 
Qui a créé Mona Lisa? 
Wer hat Mona Lisa geschöpft? 
Leonardo da Vinci 
A: 
Q: 
NL to ontology interoperability
Multilingual Retrieval Interface 
10/19/2014 
4
GF – Grammar Framework 
•Type-theoretical grammar formalism supporting multilingual applications 
•Two-layered architecture 
–Abstract syntax - semantics 
–Concrete syntax – language dependent surface structure 
10/19/2014 
5 
Abstract syntax: Concrete English syntax: Abstract representation: 
cat NP, VP, S; lincat NP, VP, S = {s: Str}; lin Mary = mkNP (mkPN "Mary"); 
fun Mary, John: NP; lin Mary = {s = "Mary"}; lin John = mkNP (mkPN "John"); 
fun Love: NP -> VP; lin John = {s = "John"}; lin Love o = mkVP (mkV2 "love") o; 
fun Pred : NP -> VP -> S; lin Love o = {s = "loves" ++ o.s}; lin Pred sub v = mkS (mkCl sub v); lin Pred sub v = {s = sub.s ++ v.s}; 
Ex: John loves Mary
Multilingual Aspect of GF 
10/19/2014 
6
YAQL (Yet Another Query language) 
•A common architecture with one base module and domain knowledge representation 
•Straightforward abstract syntax generation from ontology with just the minimum lexical types 
–Common noun – Kind 
–Noun phrase – Entity 
–Verb phrase – Property 
–Verb phrase with higher arity – Relation 
•Reusable generic grammar structure 
10/19/2014 
7 
Abstract syntax: Concrete syntax: 
Move ; Move = Utt ; 
Query ; Query = QS ; 
MQuery : Query -> Move ; 
Query 
Command 
Answer
YAQL and the Semantic Web 
The category Kind gets coupled with OWL entities 
10/19/2014 
8 
•Yet another NL two layers so that a new domain model can be easily integrated into the query module 
•Bidirectional translation in 15 languages (Bulgarian, Catalan, Danish, Dutch, English, Finnish, French, Hebrew, Italian, German, Norwegian, Romanian, Russian, Spanish, Swedish) 
Text 
Query 
Answer 
Data 
Lexicon 
RGL 
YAQL
NL to SPARQL 
10/19/2014 
9
Who painted Mona Lisa? 
English: Who painted t ? 
QPainter t = mkQS pastTense (mkQCl who_IP paint_V2 t) 
Finnish: Whose painting is t ?} 
QPainter t = mkQS (mkQCl (mkIP (E.GenIP who_IP) (mkN "maalaama")) t) 
French: By who is t ? 
QPainter t = mkQS (mkQCl (mkIAdv by8agent_Prep who_IP) t) 
10/19/2014 
10 
$MQuery (QPainter (PTitle TMona_Lisa))$ 
Abstract syntax 
SPARQL 
MQuery q = "PREFIX painting:<http://spraakbanken.gu.se/rdf/owl/painting.owl#> PREFIX rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX rdfs:<http://www.w3.org/2000/01/rdf-schema#>> SELECT distinct"++ q.wh1 ++ " WHERE { ?painting rdf:type painting:Painting; rdfs:label ?title; " ++ q.wh2 ++ q.prop++"}" ; 
Concrete syntax
Evaluation 
•User satisfaction 
•Efficiency in terms of time, effort and cost 
•Effectiveness, how the system scales up 
10/19/2014 
11 
Coverage: 
1159 query patterns in 15 languages 
10 characteristics of CH objects 
Extendibility 
New query grammar - 150 lines of code 
Evaluation 
Random queries in 7 languages with very few native informants’ corrections
Conclusion 
•NL to ontology interoperability approach 
•Multilingual interface for retrieval of structured data from the Web 
•Easily extendable initial base of YAQL transformations 
•Great coverage of paraphrases 
•Expert language/information engineers required 
10/19/2014 
12
Thank you for your attention 
10/19/2014 
13 
Contacts: 
dana.dannells@svenska.gu.se 
ra.monique@gmail.com 
mariana.damova@mozajka.co 
?

More Related Content

PDF
TRIPTICO GUADALINFO
PPT
Contextual ontology alignment may 2011
PDF
Rise 2014 st requier
PPT
Arenas En Hormigones Y Morteros
PDF
Tsh masterclass you've got seed... now what?
PPT
BBC JUICER API Presentation - for SeedHack 4.0 - BBC News Labs
PPT
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
PPTX
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
TRIPTICO GUADALINFO
Contextual ontology alignment may 2011
Rise 2014 st requier
Arenas En Hormigones Y Morteros
Tsh masterclass you've got seed... now what?
BBC JUICER API Presentation - for SeedHack 4.0 - BBC News Labs
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015

Similar to NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on the Web (12)

ZIP
XSLT+SPARQL: Scripting the Semantic Web with SPARQL embedded into XSLT styles...
PDF
Knowledge graph construction with a façade - The SPARQL Anything Project
PDF
The SPARQL Anything project
ODP
Ontologies and Semantic in OpenSource projects
PDF
Using linguistic analysis to translate
PPTX
Dependency Parsing-based QA System for RDF and SPARQL
PPT
Ontology Engineering: Ontology Use
PPT
Semantic web
PPT
Semantic Web: From Representations to Applications
PDF
MLGrafViz: multilingual ontology visualization plug-in for Protégé
PDF
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
PDF
Culture Geeks Feb talk: Adventures in Linked Data Land
XSLT+SPARQL: Scripting the Semantic Web with SPARQL embedded into XSLT styles...
Knowledge graph construction with a façade - The SPARQL Anything Project
The SPARQL Anything project
Ontologies and Semantic in OpenSource projects
Using linguistic analysis to translate
Dependency Parsing-based QA System for RDF and SPARQL
Ontology Engineering: Ontology Use
Semantic web
Semantic Web: From Representations to Applications
MLGrafViz: multilingual ontology visualization plug-in for Protégé
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
Culture Geeks Feb talk: Adventures in Linked Data Land
Ad

More from Mariana Damova, Ph.D (20)

PPT
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
PDF
Geography of Letters - The Spirituality of Sofia in the Historic Memory
PDF
Startup Europe Week Sofia 2017 - Introduction
PDF
IndustryInform Service of Mozaika
PDF
Семантични технологии основи
PDF
IndustryInform Demo March 2016
PDF
Startup Europe Week Sofia introduction
PDF
Mozaika-Jan2016a
PDF
Concordia july2015
PDF
Industry informofmozaikathehumanizingtechnologieslab june23
PDF
Industry informofmozaikathehumanizingtechnologieslab june23
PDF
Communication channels for the european single digital market
PDF
Bulgariana europeana27112013 ним
PDF
Mozaika june2014
PDF
Europeana in Bulgaria
PDF
Bulgariana europeana02112013
PDF
проектиране на онтологии и връзката им с езиковите технологии
PDF
семантични технологии основи
PDF
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
PPT
National aggregatorvarna032013 marianadamova
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Startup Europe Week Sofia 2017 - Introduction
IndustryInform Service of Mozaika
Семантични технологии основи
IndustryInform Demo March 2016
Startup Europe Week Sofia introduction
Mozaika-Jan2016a
Concordia july2015
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
Communication channels for the european single digital market
Bulgariana europeana27112013 ним
Mozaika june2014
Europeana in Bulgaria
Bulgariana europeana02112013
проектиране на онтологии и връзката им с езиковите технологии
семантични технологии основи
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
National aggregatorvarna032013 marianadamova
Ad

Recently uploaded (20)

PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Introduction to the R Programming Language
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
CYBER SECURITY the Next Warefare Tactics
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
Introduction to Inferential Statistics.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
ISS -ESG Data flows What is ESG and HowHow
Introduction to the R Programming Language
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
SAP 2 completion done . PRESENTATION.pptx
retention in jsjsksksksnbsndjddjdnFPD.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Pilar Kemerdekaan dan Identi Bangsa.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
CYBER SECURITY the Next Warefare Tactics
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Introduction to Inferential Statistics.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Qualitative Qantitative and Mixed Methods.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
STERILIZATION AND DISINFECTION-1.ppthhhbx

NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on the Web

  • 1. Multilingual Retrieval Interface for Structured Data on the Web Dana Dannells, Ramona Enache, Mariana Damova NLIWoD - ISWC’2014
  • 2. Semantic Data Infrastructures Leonardo Mona Lisa RDF Repository SPARQL :Painter :Painting :painted Mona Lisa ? rdf:type rdf:type • Semantic Web • Linked data • SPARQL query language 2 10/19/2014
  • 3. Natural Interface through NL 10/19/2014 3 EN FR DE Who painted Mona Lisa? Qui a paint Mona Lisa? Wer hat Mona Lisa gemahlt? Who is Mona Lisa’s painter? Qui est le paintre de Mona Lisa? Wer ist der Mahler von Mona Lisa ? Who created Mona Lisa? Qui a créé Mona Lisa? Wer hat Mona Lisa geschöpft? Leonardo da Vinci A: Q: NL to ontology interoperability
  • 5. GF – Grammar Framework •Type-theoretical grammar formalism supporting multilingual applications •Two-layered architecture –Abstract syntax - semantics –Concrete syntax – language dependent surface structure 10/19/2014 5 Abstract syntax: Concrete English syntax: Abstract representation: cat NP, VP, S; lincat NP, VP, S = {s: Str}; lin Mary = mkNP (mkPN "Mary"); fun Mary, John: NP; lin Mary = {s = "Mary"}; lin John = mkNP (mkPN "John"); fun Love: NP -> VP; lin John = {s = "John"}; lin Love o = mkVP (mkV2 "love") o; fun Pred : NP -> VP -> S; lin Love o = {s = "loves" ++ o.s}; lin Pred sub v = mkS (mkCl sub v); lin Pred sub v = {s = sub.s ++ v.s}; Ex: John loves Mary
  • 6. Multilingual Aspect of GF 10/19/2014 6
  • 7. YAQL (Yet Another Query language) •A common architecture with one base module and domain knowledge representation •Straightforward abstract syntax generation from ontology with just the minimum lexical types –Common noun – Kind –Noun phrase – Entity –Verb phrase – Property –Verb phrase with higher arity – Relation •Reusable generic grammar structure 10/19/2014 7 Abstract syntax: Concrete syntax: Move ; Move = Utt ; Query ; Query = QS ; MQuery : Query -> Move ; Query Command Answer
  • 8. YAQL and the Semantic Web The category Kind gets coupled with OWL entities 10/19/2014 8 •Yet another NL two layers so that a new domain model can be easily integrated into the query module •Bidirectional translation in 15 languages (Bulgarian, Catalan, Danish, Dutch, English, Finnish, French, Hebrew, Italian, German, Norwegian, Romanian, Russian, Spanish, Swedish) Text Query Answer Data Lexicon RGL YAQL
  • 9. NL to SPARQL 10/19/2014 9
  • 10. Who painted Mona Lisa? English: Who painted t ? QPainter t = mkQS pastTense (mkQCl who_IP paint_V2 t) Finnish: Whose painting is t ?} QPainter t = mkQS (mkQCl (mkIP (E.GenIP who_IP) (mkN "maalaama")) t) French: By who is t ? QPainter t = mkQS (mkQCl (mkIAdv by8agent_Prep who_IP) t) 10/19/2014 10 $MQuery (QPainter (PTitle TMona_Lisa))$ Abstract syntax SPARQL MQuery q = "PREFIX painting:<http://spraakbanken.gu.se/rdf/owl/painting.owl#> PREFIX rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX rdfs:<http://www.w3.org/2000/01/rdf-schema#>> SELECT distinct"++ q.wh1 ++ " WHERE { ?painting rdf:type painting:Painting; rdfs:label ?title; " ++ q.wh2 ++ q.prop++"}" ; Concrete syntax
  • 11. Evaluation •User satisfaction •Efficiency in terms of time, effort and cost •Effectiveness, how the system scales up 10/19/2014 11 Coverage: 1159 query patterns in 15 languages 10 characteristics of CH objects Extendibility New query grammar - 150 lines of code Evaluation Random queries in 7 languages with very few native informants’ corrections
  • 12. Conclusion •NL to ontology interoperability approach •Multilingual interface for retrieval of structured data from the Web •Easily extendable initial base of YAQL transformations •Great coverage of paraphrases •Expert language/information engineers required 10/19/2014 12
  • 13. Thank you for your attention 10/19/2014 13 Contacts: dana.dannells@svenska.gu.se ra.monique@gmail.com mariana.damova@mozajka.co ?