SlideShare a Scribd company logo
Self-Adaptive Based 
Natural Language Interface 
for Disambiguation of 
Semantic Search 
NURFADHL INA MOHD SHARE F NURFADHL INA@UPM.EDU.MY 
MOHAMMAD YAS S ER SHAFAZAND 79. ZAND@GMAI L .COM 
FACULT Y OF COMPUT ER SCI ENCE AND INFORMAT ION T ECHNOLOGY, 
UNIVERS I T I PUTRA MALAYS IA 
S ERDANG, S E LANGOR, MALAYS IA
"Big Data" refers to data sets whose size is beyond the ability of 
typical database software tools to capture, store manage and analyze 
(McKinsey). 
“Linked Data” stands for semantically well structured, 
interconnected, syntactically interoperable datasets that are 
distributed among several repositories either inside or outside 
organisations http://www.semantic-web.at/big-data-linked-data
Utilizing Linked Data and Big Data for organisational and 
enterprise purposes will be one of the next big challenges in 
the evolution of the web. 
Big Data takes account of the fact that new techniques and 
technologies are needed for the sustainable and socially 
balanced exploitation of huge data pools. The Linked Data 
paradigm is one approach to cope with Big Data, as it advances 
the hypertext principle from a web of documents to a web of 
rich data.
Self adaptive based natural language interface for disambiguation of
Semantic Web: a webby way to link data 
Open Data meets the Semantic Web: Linked Open Data 
http://www.semantic-web-journal.net/system/files/swj488.pdf
One of the key challenges in making use of Big Data lies in 
finding ways of dealing with heterogeneity, diversity, and 
complexity of the data, while its volume and velocity forbid 
solutions available for smaller datasets as based, e.g., on 
manual curation or manual integration of data. Semantic 
Web Technologies are meant to deal with these issues, 
and indeed since the advent of Linked Data a few years ago, 
they have become central to mainstream Semantic Web 
research and development. 
We can easily understand Linked Data as being a part of the 
greater Big Data landscape, as many of the challenges are 
the same. The linking component of Linked Data, however, 
puts an additional focus on the integration and conflation 
of data across multiple sources.
Volume Velocity Variety 
BIG DATA 
Value and 
Veracity 
Supercomputing 
Internet of Things 
Semantic Web 
Social Science
Smart Data 
Smart data makes sense out of Big data 
http://amitsheth.blogspot.com/2013/06/transforming-big-data-into-smart-data.html 
It provides value from harnessing the challenges posed by 
volume, velocity, variety and veracity of big data, in-turn providing actionable 
information and improve decision making. 
uses background knowledge, experiences, advanced and contextualized 
reasoning, and is often highly personalized 
focused on the actionable value in data creation, processing and consumption 
phases for improving the human experience
5 steps to Turn Big Data into Smart Data 
http://tdwi.org/Articles/2014/07/15/Turning-Big-Data-into-Smart-Data-2.aspx?Page=1 
1. Add meaning 
2. Add context 
3. Embrace Graphs 
4. Iterate 
5. Adopt standard
Natural Language Query Generated SPARQL 
What is the lowest point in kansas? SELECT ?c0 
WHERE { 
?c0 ?p0 ?i0 . ?c0 a geo:LoPoint . 
filter (?i0 = geo:kansas) . 
filter ( ?p0 = geo:isLowestPointOf ) . 
} 
What is the area of idaho? SELECT ?i0 
WHERE { 
?c0 ?p0 ?i0 . 
filter (?c0 = geo:idaho) . 
filter ( ?p0 = geo:stateArea ) . 
} 
what states border oklahoma? SELECT ?i0 
WHERE { 
?c0 ?p0 ?i0 . ?i0 a geo:State . 
filter (?c0 = geo:oklahoma) . 
filter ( ?p0 = geo:borders ) . 
} 
what is the population of oregon? SELECT ?i0 
WHERE { 
?c0 ?p0 ?i0 . 
filter (?c0 = geo:oregon) . 
filter ( ?p0 = geo:statePopulation ) . 
}
Ambiguities in Querying Big Data 
when there are more than one possible concept annotation for a word in the 
NL input 
when a word inside the NL input cannot be matched with any KB concept 
when constructing the SPARQL where there is more than one possibility of 
SPARQL pattern
Self Adaptive Model for Semantic Data Search in Big Data
Input: NL query 
Output: Answer 
Process: 
1. Load ontology and build a matrix of the object properties, classes and instances and its 
connections 
2. Let T as the tokenized and stemmed NL query 
3. For each tT, let A be the set of annotation based on relevant concepts 
4. For each aA 
a. Create and add possible triplets, filters and options statements using dictionary 
and reasoner (using bottom up reasoning rules) 
b. Create new SPARQL syntax using (4(a)) 
c. Run SPARQL and send statements and results to reasoner. 
5. Return last created SPARQL syntax which has results.
Results 
The SANLI is tested on two different datasets namely the Mooney’s Geography 
ontology and a Quran structure ontology. 
SANLI is able to correctly answer all questions in the geography ontology 
where the questions have <s, p, o>, <o, p, s>, <p, o>, <o, p> and <o > patterns 
identified. 
Rules for other patterns have not yet been implemented. For example <o, p, o> 
patterns mostly result in a true false result as in “Does Texas border Oklahoma?” 
which we have not implemented yet.
Conclusion 
The Semantic Web can leverage the sophisticated analytics with big 
data. 
Big Data and Linked Data will be an integral part of the future web 
infrastructure, where massive amounts of data are available, 
connected and identifiable via Uniform Resource Identifiers. 
More personalized-based applications to exploit smart data to its 
maximum potential

More Related Content

PDF
Knowledge Graph Maintenance
PDF
Knowledge Graph Maintenance
PPTX
Thinking About the Making of Data
PPTX
The Challenge of Deeper Knowledge Graphs for Science
PDF
Knowledge Representation on the Web
PPTX
From Text to Data to the World: The Future of Knowledge Graphs
PPTX
Content + Signals: The value of the entire data estate for machine learning
PPTX
Sources of Change in Modern Knowledge Organization Systems
Knowledge Graph Maintenance
Knowledge Graph Maintenance
Thinking About the Making of Data
The Challenge of Deeper Knowledge Graphs for Science
Knowledge Representation on the Web
From Text to Data to the World: The Future of Knowledge Graphs
Content + Signals: The value of the entire data estate for machine learning
Sources of Change in Modern Knowledge Organization Systems

What's hot (20)

PDF
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
PPTX
Thoughts on Knowledge Graphs & Deeper Provenance
PPTX
The Roots: Linked data and the foundations of successful Agriculture Data
PPTX
Minimal viable-datareuse-czi
PPTX
From Data Search to Data Showcasing
PPTX
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
PDF
An Ecosystem for Linked Humanities Data
PPTX
Data Communities - reusable data in and outside your organization.
PDF
Prov-O-Viz: Interactive Provenance Visualization
PPTX
The need for a transparent data supply chain
PPTX
Large Graph Mining
PDF
Research Knowledge Graphs at GESIS & NFDI4DataScience
PDF
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
PDF
Data Tactics Analytics Brown Bag (Aug 22, 2013)
PDF
What makes a linked data pattern interesting?
PDF
Data Tactics Data Science Brown Bag (April 2014)
PDF
The web of data: how are we doing so far?
PPTX
Identifying semantics characteristics of user’s interactions datasets through...
PPTX
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
PPTX
Data Science, Data Curation, and Human-Data Interaction
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Thoughts on Knowledge Graphs & Deeper Provenance
The Roots: Linked data and the foundations of successful Agriculture Data
Minimal viable-datareuse-czi
From Data Search to Data Showcasing
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
An Ecosystem for Linked Humanities Data
Data Communities - reusable data in and outside your organization.
Prov-O-Viz: Interactive Provenance Visualization
The need for a transparent data supply chain
Large Graph Mining
Research Knowledge Graphs at GESIS & NFDI4DataScience
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Data Tactics Analytics Brown Bag (Aug 22, 2013)
What makes a linked data pattern interesting?
Data Tactics Data Science Brown Bag (April 2014)
The web of data: how are we doing so far?
Identifying semantics characteristics of user’s interactions datasets through...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Data Science, Data Curation, and Human-Data Interaction
Ad

Viewers also liked (20)

PPT
Incremental Evolving Grammar Fragments
PPT
Incremental Evolving Grammar Fragments
PPT
ORDER INDEPENDENT INCREMENTAL EVOLVING FUZZY GRAMMAR FRAGMENT LEARNER
PPTX
semantic web & natural language
PPT
Many Worlds, the Born Rule, and Self-Locating Uncertainty
PPT
Setting Time Aright
PPTX
7 bran Simonova Necronomiconu
PDF
Krystalická hrůza H. P. Lovecrafta ve filmu
PPTX
Magické myšlení aneb jak magie ovlivňuje náš život
PDF
Tajemná krajina severovýchodních Čech
PPTX
A review of sentiment analysis approaches in big
PPT
Purpose and the Universe
PDF
OSGi Blueprint Services
PDF
Proposed smart traffic signal control in brunei darussalam
PPT
Against Space
PDF
Proceeding of Brunei International Conference on Engineering and Technology 2012
PPT
What We (Don't) Know About the Beginning of the Universe
PPT
The Origin of the Universe and the Arrow of Time
PDF
Design Physics
PPTX
Physics and Marketing
Incremental Evolving Grammar Fragments
Incremental Evolving Grammar Fragments
ORDER INDEPENDENT INCREMENTAL EVOLVING FUZZY GRAMMAR FRAGMENT LEARNER
semantic web & natural language
Many Worlds, the Born Rule, and Self-Locating Uncertainty
Setting Time Aright
7 bran Simonova Necronomiconu
Krystalická hrůza H. P. Lovecrafta ve filmu
Magické myšlení aneb jak magie ovlivňuje náš život
Tajemná krajina severovýchodních Čech
A review of sentiment analysis approaches in big
Purpose and the Universe
OSGi Blueprint Services
Proposed smart traffic signal control in brunei darussalam
Against Space
Proceeding of Brunei International Conference on Engineering and Technology 2012
What We (Don't) Know About the Beginning of the Universe
The Origin of the Universe and the Arrow of Time
Design Physics
Physics and Marketing
Ad

Similar to Self adaptive based natural language interface for disambiguation of (20)

PPTX
ESWC 2015 Closing and "General Chair's minute of Madness"
PDF
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
PDF
Using the Semantic Web Stack to Make Big Data Smarter
PDF
From Linked Data to Semantic Applications
PPTX
Knowledge Graph Introduction
PPTX
BrightTALK - Semantic AI
PPTX
Semantics and Machine Learning
PDF
Web-scale semantic search
PPT
Where Does It Break?
PDF
Overview of-semantic-technologies-and-ontologies
PPTX
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
PDF
Big Data and the Semantic Web: Challenges and Opportunities
PPT
Introduction to Semantic Web for GIS Practitioners
PPTX
Using Knowledge Graph for Promoting Cognitive Computing
PDF
Semantic Interoperability - grafi della conoscenza
PDF
Ontology Engineering Synthesis Lectures on Data Semantics and Knowledge 1st ...
PPTX
Introduction to Big data
PPTX
The Web of Data: do we actually understand what we built?
PDF
The Future of Semantics on the Web
PPT
Apache Stanbol 
and the Web of Data - ApacheCon 2011
ESWC 2015 Closing and "General Chair's minute of Madness"
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Using the Semantic Web Stack to Make Big Data Smarter
From Linked Data to Semantic Applications
Knowledge Graph Introduction
BrightTALK - Semantic AI
Semantics and Machine Learning
Web-scale semantic search
Where Does It Break?
Overview of-semantic-technologies-and-ontologies
Spatial Semantics for Better Interoperability and Analysis: Challenges and Ex...
Big Data and the Semantic Web: Challenges and Opportunities
Introduction to Semantic Web for GIS Practitioners
Using Knowledge Graph for Promoting Cognitive Computing
Semantic Interoperability - grafi della conoscenza
Ontology Engineering Synthesis Lectures on Data Semantics and Knowledge 1st ...
Introduction to Big data
The Web of Data: do we actually understand what we built?
The Future of Semantics on the Web
Apache Stanbol 
and the Web of Data - ApacheCon 2011

More from Nurfadhlina Mohd Sharef (20)

PPTX
Regenerating learning experience with AI
PPTX
Enhancing academic productivity using Gen AI
PPTX
ChatGPT in Teaching and Learning
PPTX
Struggle to success: How generative ai can transform your university experience?
PPTX
Ada Apa Dengan ChatGPT
PPTX
Teaching with ChatGPT-Practical Tips and Strategies
PPTX
Artificial Intelligence in Education
PPTX
Usage of ChatGPT at Higher Learning Institutions.pptx
PDF
ChatGPT: Friend or Foe?
PDF
Meaningful Online Learning Experience
PDF
Online Instructional Design
PDF
Introduction to eLearning at Universiti Putra Malaysia
PPTX
Data raya dan kecerdasan buatan
PDF
CIKGUAIBOT: A CHATBOT TO TEACH ARTIFICIAL INTELLIGENCE IN MALAY
PDF
Learning analytics based intelligent simulator for personalised learning slide
PDF
ICADEIS 2020 keynote
PDF
Basketball players performance analytic as experiential learning approach
PPTX
Enhancing Multi-Aspect Collaborative Filtering for Personalized Recommendation
PPTX
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...
PPTX
Temporal Relations Mining Approach to Improve Dengue Outbreak and Intrusion T...
Regenerating learning experience with AI
Enhancing academic productivity using Gen AI
ChatGPT in Teaching and Learning
Struggle to success: How generative ai can transform your university experience?
Ada Apa Dengan ChatGPT
Teaching with ChatGPT-Practical Tips and Strategies
Artificial Intelligence in Education
Usage of ChatGPT at Higher Learning Institutions.pptx
ChatGPT: Friend or Foe?
Meaningful Online Learning Experience
Online Instructional Design
Introduction to eLearning at Universiti Putra Malaysia
Data raya dan kecerdasan buatan
CIKGUAIBOT: A CHATBOT TO TEACH ARTIFICIAL INTELLIGENCE IN MALAY
Learning analytics based intelligent simulator for personalised learning slide
ICADEIS 2020 keynote
Basketball players performance analytic as experiential learning approach
Enhancing Multi-Aspect Collaborative Filtering for Personalized Recommendation
Aspect Extraction Performance With Common Pattern of Dependency Relation in ...
Temporal Relations Mining Approach to Improve Dengue Outbreak and Intrusion T...

Recently uploaded (20)

PPTX
Internet___Basics___Styled_ presentation
PPTX
Funds Management Learning Material for Beg
PPTX
PptxGenJS_Demo_Chart_20250317130215833.pptx
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
PDF
The Internet -By the Numbers, Sri Lanka Edition
PPTX
Job_Card_System_Styled_lorem_ipsum_.pptx
PPTX
Slides PPTX World Game (s) Eco Economic Epochs.pptx
DOCX
Unit-3 cyber security network security of internet system
PDF
Decoding a Decade: 10 Years of Applied CTI Discipline
PDF
Triggering QUIC, presented by Geoff Huston at IETF 123
PPT
tcp ip networks nd ip layering assotred slides
PPTX
Introuction about WHO-FIC in ICD-10.pptx
PDF
The New Creative Director: How AI Tools for Social Media Content Creation Are...
PPTX
introduction about ICD -10 & ICD-11 ppt.pptx
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
PPTX
Module 1 - Cyber Law and Ethics 101.pptx
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PPTX
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
PDF
Testing WebRTC applications at scale.pdf
PPTX
Introuction about ICD -10 and ICD-11 PPT.pptx
Internet___Basics___Styled_ presentation
Funds Management Learning Material for Beg
PptxGenJS_Demo_Chart_20250317130215833.pptx
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
The Internet -By the Numbers, Sri Lanka Edition
Job_Card_System_Styled_lorem_ipsum_.pptx
Slides PPTX World Game (s) Eco Economic Epochs.pptx
Unit-3 cyber security network security of internet system
Decoding a Decade: 10 Years of Applied CTI Discipline
Triggering QUIC, presented by Geoff Huston at IETF 123
tcp ip networks nd ip layering assotred slides
Introuction about WHO-FIC in ICD-10.pptx
The New Creative Director: How AI Tools for Social Media Content Creation Are...
introduction about ICD -10 & ICD-11 ppt.pptx
Cloud-Scale Log Monitoring _ Datadog.pdf
Module 1 - Cyber Law and Ethics 101.pptx
Slides PDF The World Game (s) Eco Economic Epochs.pdf
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
Testing WebRTC applications at scale.pdf
Introuction about ICD -10 and ICD-11 PPT.pptx

Self adaptive based natural language interface for disambiguation of

  • 1. Self-Adaptive Based Natural Language Interface for Disambiguation of Semantic Search NURFADHL INA MOHD SHARE F NURFADHL INA@UPM.EDU.MY MOHAMMAD YAS S ER SHAFAZAND 79. ZAND@GMAI L .COM FACULT Y OF COMPUT ER SCI ENCE AND INFORMAT ION T ECHNOLOGY, UNIVERS I T I PUTRA MALAYS IA S ERDANG, S E LANGOR, MALAYS IA
  • 2. "Big Data" refers to data sets whose size is beyond the ability of typical database software tools to capture, store manage and analyze (McKinsey). “Linked Data” stands for semantically well structured, interconnected, syntactically interoperable datasets that are distributed among several repositories either inside or outside organisations http://www.semantic-web.at/big-data-linked-data
  • 3. Utilizing Linked Data and Big Data for organisational and enterprise purposes will be one of the next big challenges in the evolution of the web. Big Data takes account of the fact that new techniques and technologies are needed for the sustainable and socially balanced exploitation of huge data pools. The Linked Data paradigm is one approach to cope with Big Data, as it advances the hypertext principle from a web of documents to a web of rich data.
  • 5. Semantic Web: a webby way to link data Open Data meets the Semantic Web: Linked Open Data http://www.semantic-web-journal.net/system/files/swj488.pdf
  • 6. One of the key challenges in making use of Big Data lies in finding ways of dealing with heterogeneity, diversity, and complexity of the data, while its volume and velocity forbid solutions available for smaller datasets as based, e.g., on manual curation or manual integration of data. Semantic Web Technologies are meant to deal with these issues, and indeed since the advent of Linked Data a few years ago, they have become central to mainstream Semantic Web research and development. We can easily understand Linked Data as being a part of the greater Big Data landscape, as many of the challenges are the same. The linking component of Linked Data, however, puts an additional focus on the integration and conflation of data across multiple sources.
  • 7. Volume Velocity Variety BIG DATA Value and Veracity Supercomputing Internet of Things Semantic Web Social Science
  • 8. Smart Data Smart data makes sense out of Big data http://amitsheth.blogspot.com/2013/06/transforming-big-data-into-smart-data.html It provides value from harnessing the challenges posed by volume, velocity, variety and veracity of big data, in-turn providing actionable information and improve decision making. uses background knowledge, experiences, advanced and contextualized reasoning, and is often highly personalized focused on the actionable value in data creation, processing and consumption phases for improving the human experience
  • 9. 5 steps to Turn Big Data into Smart Data http://tdwi.org/Articles/2014/07/15/Turning-Big-Data-into-Smart-Data-2.aspx?Page=1 1. Add meaning 2. Add context 3. Embrace Graphs 4. Iterate 5. Adopt standard
  • 10. Natural Language Query Generated SPARQL What is the lowest point in kansas? SELECT ?c0 WHERE { ?c0 ?p0 ?i0 . ?c0 a geo:LoPoint . filter (?i0 = geo:kansas) . filter ( ?p0 = geo:isLowestPointOf ) . } What is the area of idaho? SELECT ?i0 WHERE { ?c0 ?p0 ?i0 . filter (?c0 = geo:idaho) . filter ( ?p0 = geo:stateArea ) . } what states border oklahoma? SELECT ?i0 WHERE { ?c0 ?p0 ?i0 . ?i0 a geo:State . filter (?c0 = geo:oklahoma) . filter ( ?p0 = geo:borders ) . } what is the population of oregon? SELECT ?i0 WHERE { ?c0 ?p0 ?i0 . filter (?c0 = geo:oregon) . filter ( ?p0 = geo:statePopulation ) . }
  • 11. Ambiguities in Querying Big Data when there are more than one possible concept annotation for a word in the NL input when a word inside the NL input cannot be matched with any KB concept when constructing the SPARQL where there is more than one possibility of SPARQL pattern
  • 12. Self Adaptive Model for Semantic Data Search in Big Data
  • 13. Input: NL query Output: Answer Process: 1. Load ontology and build a matrix of the object properties, classes and instances and its connections 2. Let T as the tokenized and stemmed NL query 3. For each tT, let A be the set of annotation based on relevant concepts 4. For each aA a. Create and add possible triplets, filters and options statements using dictionary and reasoner (using bottom up reasoning rules) b. Create new SPARQL syntax using (4(a)) c. Run SPARQL and send statements and results to reasoner. 5. Return last created SPARQL syntax which has results.
  • 14. Results The SANLI is tested on two different datasets namely the Mooney’s Geography ontology and a Quran structure ontology. SANLI is able to correctly answer all questions in the geography ontology where the questions have <s, p, o>, <o, p, s>, <p, o>, <o, p> and <o > patterns identified. Rules for other patterns have not yet been implemented. For example <o, p, o> patterns mostly result in a true false result as in “Does Texas border Oklahoma?” which we have not implemented yet.
  • 15. Conclusion The Semantic Web can leverage the sophisticated analytics with big data. Big Data and Linked Data will be an integral part of the future web infrastructure, where massive amounts of data are available, connected and identifiable via Uniform Resource Identifiers. More personalized-based applications to exploit smart data to its maximum potential