SlideShare a Scribd company logo
Mail us : info@ocularsystems.in
Mobile No : 7385350430
Keyword Query Routing
Problem Defination :
Existing work on keyword search relies on an element-level model (data graphs) to
compute keyword query results.Elements mentioning keywords are retrieved from this model
and paths between them are explored to compute Steiner graphs. KRG (keyword Element
Relationship Graph) captures relationships at the keyword level.Relationships captured by a
KRG are not direct edges between tuples but stand for paths between keywords.
We propose to route keywords only to relevant sources to reduce the high cost of
processing keyword search queries over all sources. A multilevel scoring mechanism is proposed
for computing the relevance of routing plans based on scores at the level of keywords, data
elements,element sets, and subgraphs that connect these elements.
1. Introduction :
The web is no longer only a collection of textual documents but also a web of interlinked
data sources.Through this project, a large amount of legacy data have been transformed to RDF,
linked with other sources, and published as Linked Data. It is difficult for the typical web users
to exploit this web data by means of structured queries using languages like SQL or SPARQL.
To this end, keyword search has proven to be intuitive. As opposed to structured queries, no
knowledge of the query language, the schema or the underlying data are needed. In database
research, solutions have been proposed, which given a keyword query, retrieve the most relevant
structured results or simply, select the single most relevant databases. However, these
approaches are single-source solutions The goal is to produce routing plans, which can be used
to compute results from multiple sources.
2. Objectives and scope :
We propose to investigate the problem of keyword query routing for keyword search over
a large number of structured and Linked Data sources. Routing keywords only to relevant
sources can reduce the high cost of searching for structured results that span multiple sources.We
Mail us : info@ocularsystems.in
Mobile No : 7385350430
show routing greatly helps to improve the performance of keyword search, without
compromising its result quality.
3. Methodology :
We aim to identify data sources that contain results to a keyword query. In the Linked
Data scenario, results might combine data from several sources,
1. Keyword Routing Plan - The problem of keyword query routing is to find the top keyword
routing plans based on their relevance to a query. A relevant plan should correspond to the
information need as intended by the user.
2.Multilevel Inter-Relationship Graph - We illustrate the search space of keyword query
routing using a multilevel inter-relationship graph. At the lowest level individual data elements,
and a set-level data graph, which captures information about group of elements.
4. Algorithm :
5. Future scope and further enhancement :
Mail us : info@ocularsystems.in
Mobile No : 7385350430
In combination with the proposed ranking, valid plans (precision@1 = 0.92) that are highly
relevant (mean reciprocal rank = 0.86) could be computed in 1 s on average. Further, we show
that when routing is applied to an existing keyword search system to prune sources, substantial
performance gain can be achieved.
6. Conclusion :
Routing can be seen as a promising alternative paradigm especially for cases, where the
information need is well described and available as a large amount of texts.We have presented a
solution to the novel problem of keyword query routing. Based on modeling the search space as
a multilevel inter-relationship graph, we proposed a summary model that groups keyword and
element relationships at the level of sets, and developed a multilevel ranking scheme to
incorporate relevance atdifferent dimensions.The experiments showed that the summary model
compactly preserves relevant information.
7. Bibliography :
[1] V. Hristidis, L. Gravano, and Y. Papakonstantinou, “Efficient IR-Style Keyword Search over
Relational Databases,” Proc. 29th Int’l Conf. Very Large Data Bases (VLDB), pp. 850-861,2003.
[2]M. Sayyadian, H. LeKhac, A. Doan, and L. Gravano, “Efficient Keyword Search Across
Heterogeneous Relational Databases,” Proc. IEEE 23rd Int’l Conf. Data Eng. (ICDE), pp. 346-
355, 2007
[3] G. Ladwig and T. Tran, “Index Structures and Top-K Join Algorithms for Native Keyword
Search Databases,” Proc. 20th ACM Int’l Conf. Information and Knowledge Management
(CIKM),pp. 1505-1514, 2011.
[4] H. He, H. Wang, J. Yang, and P.S. Yu, “Blinks: Ranked Keyword Searches on Graphs,” Proc.
ACM SIGMOD Conf., pp. 305-316,2007.

More Related Content

DOCX
keyword query routing
DOCX
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
DOCX
JPJ1423 Keyword Query Routing
DOC
Keyword query routing
PDF
Approaches for Keyword Query Routing
PDF
At33264269
DOC
Keyword query routing
PDF
Volume 2-issue-6-2016-2020
keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
JPJ1423 Keyword Query Routing
Keyword query routing
Approaches for Keyword Query Routing
At33264269
Keyword query routing
Volume 2-issue-6-2016-2020

What's hot (16)

PDF
Using Page Size for Controlling Duplicate Query Results in Semantic Web
PDF
Computing semantic similarity measure between words using web search engine
PDF
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
PDF
Nearest keyword set search in multi dimensional datasets
PDF
G5234552
PDF
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
PDF
An Advanced IR System of Relational Keyword Search Technique
PDF
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
PDF
Pdd crawler a focused web
PPT
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
PDF
Az31349353
PDF
IRJET- Review on Information Retrieval for Desktop Search Engine
PDF
Efficiently searching nearest neighbor in documents using keywords
PDF
Efficiently searching nearest neighbor in documents
Using Page Size for Controlling Duplicate Query Results in Semantic Web
Computing semantic similarity measure between words using web search engine
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
Nearest keyword set search in multi dimensional datasets
G5234552
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
An Advanced IR System of Relational Keyword Search Technique
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
Pdd crawler a focused web
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Az31349353
IRJET- Review on Information Retrieval for Desktop Search Engine
Efficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents
Ad

Similar to Keyword Query Routing (20)

DOCX
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
DOCX
Keyword query routing
PDF
Survey on scalable continual top k keyword search in
PDF
Survey on scalable continual top k keyword search in relational databases
PPTX
Presentation
PDF
Best Keyword Cover Search
PDF
An empirical performance evaluation of relational keyword search systems
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PDF
Enhancing keyword search over relational databases using ontologies
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PPTX
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
PDF
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
DOC
Efficient instant fuzzy search with proximity ranking
PDF
ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
PDF
Hybrid geo textual index structure
DOC
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
PPT
Effective XML Keyword Search with Relevance Oriented Ranking
DOCX
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
DOCX
Fast nearest neighbor search with keywords
2014 IEEE JAVA DATA MINING PROJECT Keyword query routing
Keyword query routing
Survey on scalable continual top k keyword search in
Survey on scalable continual top k keyword search in relational databases
Presentation
Best Keyword Cover Search
An empirical performance evaluation of relational keyword search systems
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
Enhancing keyword search over relational databases using ontologies
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
Efficient instant fuzzy search with proximity ranking
ROMAN URDU OPINION MINING SYSTEM (RUOMIS)
Hybrid geo textual index structure
IEEE 2014 JAVA DATA MINING PROJECTS Fast nearest neighbor search with keywords
Effective XML Keyword Search with Relevance Oriented Ranking
JAVA 2013 IEEE DATAMINING PROJECT Fast nearest neighbor search with keywords
Fast nearest neighbor search with keywords
Ad

More from SWAMI06 (11)

DOCX
Secure Distibuted data discovery & dissemination IN WSN
PDF
ns2-project-list
DOCX
Heart disease prediction system
DOC
Detection of Spyware by Mining Executable Files
PPTX
Annotating Search Results from Web Databases
PPTX
Multimedia Answer Generation for Community Question Answering
DOCX
A Hybrid Cloud Approach for Secure Authorized Deduplication
PPTX
Efficient Instant-Fuzzy Search With Proximity Ranking
PDF
Opinion Mining & Sentiment Analysis Based on Natural Language Processing
PPTX
A Segmentation based Sequential Pattern Matching for Efficient Video Copy De...
PPTX
Frequent itemset mining_on_hadoop
Secure Distibuted data discovery & dissemination IN WSN
ns2-project-list
Heart disease prediction system
Detection of Spyware by Mining Executable Files
Annotating Search Results from Web Databases
Multimedia Answer Generation for Community Question Answering
A Hybrid Cloud Approach for Secure Authorized Deduplication
Efficient Instant-Fuzzy Search With Proximity Ranking
Opinion Mining & Sentiment Analysis Based on Natural Language Processing
A Segmentation based Sequential Pattern Matching for Efficient Video Copy De...
Frequent itemset mining_on_hadoop

Recently uploaded (20)

PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
web development for engineering and engineering
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Lecture Notes Electrical Wiring System Components
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPT
Mechanical Engineering MATERIALS Selection
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Digital Logic Computer Design lecture notes
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
R24 SURVEYING LAB MANUAL for civil enggi
CH1 Production IntroductoryConcepts.pptx
web development for engineering and engineering
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Automation-in-Manufacturing-Chapter-Introduction.pdf
Lecture Notes Electrical Wiring System Components
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Mechanical Engineering MATERIALS Selection
Embodied AI: Ushering in the Next Era of Intelligent Systems
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
573137875-Attendance-Management-System-original
Model Code of Practice - Construction Work - 21102022 .pdf
Operating System & Kernel Study Guide-1 - converted.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Digital Logic Computer Design lecture notes

Keyword Query Routing

  • 1. Mail us : info@ocularsystems.in Mobile No : 7385350430 Keyword Query Routing Problem Defination : Existing work on keyword search relies on an element-level model (data graphs) to compute keyword query results.Elements mentioning keywords are retrieved from this model and paths between them are explored to compute Steiner graphs. KRG (keyword Element Relationship Graph) captures relationships at the keyword level.Relationships captured by a KRG are not direct edges between tuples but stand for paths between keywords. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements,element sets, and subgraphs that connect these elements. 1. Introduction : The web is no longer only a collection of textual documents but also a web of interlinked data sources.Through this project, a large amount of legacy data have been transformed to RDF, linked with other sources, and published as Linked Data. It is difficult for the typical web users to exploit this web data by means of structured queries using languages like SQL or SPARQL. To this end, keyword search has proven to be intuitive. As opposed to structured queries, no knowledge of the query language, the schema or the underlying data are needed. In database research, solutions have been proposed, which given a keyword query, retrieve the most relevant structured results or simply, select the single most relevant databases. However, these approaches are single-source solutions The goal is to produce routing plans, which can be used to compute results from multiple sources. 2. Objectives and scope : We propose to investigate the problem of keyword query routing for keyword search over a large number of structured and Linked Data sources. Routing keywords only to relevant sources can reduce the high cost of searching for structured results that span multiple sources.We
  • 2. Mail us : info@ocularsystems.in Mobile No : 7385350430 show routing greatly helps to improve the performance of keyword search, without compromising its result quality. 3. Methodology : We aim to identify data sources that contain results to a keyword query. In the Linked Data scenario, results might combine data from several sources, 1. Keyword Routing Plan - The problem of keyword query routing is to find the top keyword routing plans based on their relevance to a query. A relevant plan should correspond to the information need as intended by the user. 2.Multilevel Inter-Relationship Graph - We illustrate the search space of keyword query routing using a multilevel inter-relationship graph. At the lowest level individual data elements, and a set-level data graph, which captures information about group of elements. 4. Algorithm : 5. Future scope and further enhancement :
  • 3. Mail us : info@ocularsystems.in Mobile No : 7385350430 In combination with the proposed ranking, valid plans (precision@1 = 0.92) that are highly relevant (mean reciprocal rank = 0.86) could be computed in 1 s on average. Further, we show that when routing is applied to an existing keyword search system to prune sources, substantial performance gain can be achieved. 6. Conclusion : Routing can be seen as a promising alternative paradigm especially for cases, where the information need is well described and available as a large amount of texts.We have presented a solution to the novel problem of keyword query routing. Based on modeling the search space as a multilevel inter-relationship graph, we proposed a summary model that groups keyword and element relationships at the level of sets, and developed a multilevel ranking scheme to incorporate relevance atdifferent dimensions.The experiments showed that the summary model compactly preserves relevant information. 7. Bibliography : [1] V. Hristidis, L. Gravano, and Y. Papakonstantinou, “Efficient IR-Style Keyword Search over Relational Databases,” Proc. 29th Int’l Conf. Very Large Data Bases (VLDB), pp. 850-861,2003. [2]M. Sayyadian, H. LeKhac, A. Doan, and L. Gravano, “Efficient Keyword Search Across Heterogeneous Relational Databases,” Proc. IEEE 23rd Int’l Conf. Data Eng. (ICDE), pp. 346- 355, 2007 [3] G. Ladwig and T. Tran, “Index Structures and Top-K Join Algorithms for Native Keyword Search Databases,” Proc. 20th ACM Int’l Conf. Information and Knowledge Management (CIKM),pp. 1505-1514, 2011. [4] H. He, H. Wang, J. Yang, and P.S. Yu, “Blinks: Ranked Keyword Searches on Graphs,” Proc. ACM SIGMOD Conf., pp. 305-316,2007.