GLOBALSOFT TECHNOLOGIES 
IEEE PROJECTS & SOFTWARE DEVELOPMENTS 
IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE 
BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS 
CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 
Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com 
Keyword Query Routing 
ABSTRACT: 
Keyword search is an intuitive paradigm for searching linked data sources on the 
web. We propose to route keywords only to relevant sources to reduce the high 
cost of processing keyword search queries over all sources. We propose a novel 
method for computing top-k routing plans based on their potentials to contain 
results for a given keyword query. We employ a keyword-element relationship 
summary that compactly represents relationships between keywords and the data 
elements mentioning them. A multilevel scoring mechanism is proposed for 
computing the relevance of routing plans based on scores at the level of keywords, 
data elements, element sets, and subgraphs that connect these elements. 
Experiments carried out using 150 publicly available sources on the web showed 
that valid plans (precision@1 of 0.92) that are highly relevant (mean reciprocal 
rank of 0.89) can be computed in 1 second on average on a single PC. Further, we 
show routing greatly helps to improve the performance of keyword search, without 
compromising its result quality.
AIM: 
Linked data describes a method of publishing structured data so that it can be 
interlinked and become more useful. Keyword search is an intuitive paradigm for 
searching linked data sources on the web. We propose to route keywords only to 
relevant sources to reduce the high cost of processing keyword search queries over 
all sources. In this we have implement TOP K-Routing plan based on their 
potentials to contain results for a given keyword query. 
SYNOPSIS: 
In recent years the Web has evolved from a global information space of linked 
documents to one where both documents and data are linked. Underpinning this 
evolution is a set of best practices for publishing and connecting structured data on 
the Web known as Linked Data. The adoption of the Linked Data best practices 
has lead to the extension of the Web with a global data space connecting data from 
diverse domains such as people, companies, books, scientific publications, films, 
music, television and radio programmes, genes, proteins, drugs and clinical trials, 
online communities, statistical and scientific data, and reviews. This Web of Data 
enables new types of applications. There are generic Linked Data browsers which 
allow users to start browsing in one data source and then navigate along links into 
related data sources. There are Linked Data search engines that crawl the Web of 
Data by following links between data sources and provide expressive query 
capabilities over aggregated data, similar to how a local database is queried today. 
The Web of Data also opens up new possibilities for domain-specific applications. 
Unlike Web 2.0 mashups which work against a fixed set of data sources, Linked
Data applications operate on top of an unbound, global data space. This enables 
them to deliver more complete answers as new data sources appear on the Web. 
We propose to investigate the problem of keyword query routing for 
keyword search over a large number of structured and Linked Data sources. 
Routing keywords only to relevant sources can reduce the high cost of searching 
for structured results that span multiple sources. To the best of our knowledge, the 
work presented in this paper represents the first attempt to address this problem. 
We use a graph-based data model to characterize individual data sources. In 
that model, we distinguish between an element-level data graph representing 
relationships between individual data elements, and a set-level data graph, which 
captures information about group of elements. This set-level graph essentially 
captures a part of the Linked Data schema on the web that is represented in RDFS, 
i.e., relations between classes. Often, a schema might be incomplete or simply does 
not exist for RDF data on the web. In such a case, a pseudoschema can be obtained 
by computing a structural summary such as a dataguide. 
EXISTING SYSTEM: 
Existing work can be categorized into two main categories: 
 schema-based approaches 
 Schema-agnostic approaches 
There are schema-based approaches implemented on top of off-the-shelf 
databases. A keyword query is processed by mapping keywords to elements of the 
database (called keyword elements). Then, using the schema, valid join sequences
are derived, which are then employed to join (“connect”) the computed keyword 
elements to form so called candidate networks representing possible results to the 
keyword query. 
Schema-agnostic approaches operate directly on the data. Structured results 
are computed by exploring the underlying data graph. The goal is to find structures 
in the data called Steiner trees (Steiner graphs in general), which connect keyword 
elements. Various kinds of algorithms have been proposed for the efficient 
exploration of keyword search results over data graphs, which might be very large. 
Examples are bidirectional search and dynamic programming 
Existing work on keyword search relies on an element-level model (i.e., data 
graphs) to compute keyword query results. 
DISADVANTAGES OF EXISTING SYSTEM: 
 The number of potential results may increase exponentially with the 
number of sources and links between them. Yet, most of the results 
may be not necessary especially when they are not relevant to the 
user. 
 The routing problem, we need to compute results capturing specific 
elements at the data level. 
 Routing keywords return all the source which may or may not be the 
relevant sources 
PROPOSED SYSTEM:
We propose to route keywords only to relevant sources to reduce the high cost of 
processing keyword search queries over all sources. We propose a novel method 
for computing top-k routing plans based on their potentials to contain results for a 
given keyword query. We employ a keyword-element relationship summary that 
compactly represents relationships between keywords and the data elements 
mentioning them. A multilevel scoring mechanism is proposed for computing the 
relevance of routing plans based on scores at the level of keywords, data elements, 
element sets, and subgraphs that connect these elements. We propose to investigate 
the problem of keyword query routing for keyword search over a large number of 
structured and Linked Data sources. 
ADVANTAGES OF PROPOSED SYSTEM: 
 Routing keywords only to relevant sources can reduce the high cost of 
searching for structured results that span multiple sources. 
 The routing plans, produced can be used to compute results from multiple 
sources.
SYSTEM ARCHITECTURE: 
MODULES: 
 Linked Data Generation 
 Key level Mapping 
 Multilevel Inter relationship
 Routing Plan 
MODULES DESCRIPTION: 
Linked Data Generation 
The GeoNames Services makes it possible to add geospatial semantic information 
to the Word Wide Web. All over 6.2 million geonames toponyms now have a 
unique URL with a corresponding XML web service. In this we have used Country 
Info , Time zone and Finance Info services. This model resembles RDF data where 
entities stand for some RDF resources, data values stand for RDF literals, and 
relations and attributes correspond to RDF triples. While it is primarily used to 
model RDF Linked Data on the web, such a graph model is sufficiently general to 
capture XML and relational data. 
Key level Mapping 
The set-level graph essentially captures a part of the Linked Data schema on the 
web that is represented in RDFS, i.e., relations between classes. Often, a schema 
might be incomplete or simply does not exist for RDF data on the web. In such a 
case, a pseudoschema can be obtained by computing a structural summary such as 
a data guide. A set-level data graph can be derived from a given schema or a 
generated pseudoschema. The web of data is modeled as a web graph where GA is 
the set of all data graphs, N is the set of all nodes, E is the set of all “internal” 
edges that connect elements within a particular source. 
Multilevel Inter relationship
The search space of keyword query routing using a multilevel inter-relationship 
graph. The inter-relationships between elements at different levels keyword is 
mentioned in some entity descriptions at the element level. Entities at the element 
level are associated with a set-level element via type. A set-level element is 
contained in a source. There is an edge between two keywords if two elements at 
the element level mentioning these keywords are connected via a path. We propose 
a ranking scheme that deals with relevance at many levels. 
Routing Plan: 
Given the web graph W =(G,N,E) and a keyword query K, the mapping: K-2G that 
associates a query with a set of data graphs is called a keyword routing plan RP. A 
plan RP is considered valid w.r.t. K when the union set of its data graphs contains a 
result for K. The problem of keyword query routing is to find the top-k keyword 
routing plans based on their relevance to a query. A relevant plan should 
correspond to the information need as intended by the user. 
SYSTEM REQUIREMENTS: 
HARDWARE REQUIREMENTS: 
 System : Pentium IV 2.4 GHz. 
 Hard Disk : 40 GB. 
 Floppy Drive : 1.44 Mb. 
 Monitor : 15 VGA Colour. 
 Mouse : Logitech.
 Ram : 512 Mb. 
SOFTWARE REQUIREMENTS: 
 Operating system : Windows XP/7. 
 Coding Language : JAVA/J2EE 
 IDE : Netbeans 7.4 
 Database : MYSQL 
REFERENCE: 
Thanh Tran and Lei Zhang, “Keyword Query Routing”. IEEE TRANSACTIONS 
ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 2, FEBRUARY 
2014

More Related Content

DOCX
JPJ1423 Keyword Query Routing
DOCX
keyword query routing
DOCX
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
DOCX
Keyword Query Routing
DOCX
Keyword query routing
DOC
Keyword query routing
PDF
Approaches for Keyword Query Routing
DOC
Keyword query routing
JPJ1423 Keyword Query Routing
keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
Keyword Query Routing
Keyword query routing
Keyword query routing
Approaches for Keyword Query Routing
Keyword query routing

What's hot (16)

PDF
At33264269
PDF
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
PDF
G5234552
PDF
Volume 2-issue-6-2016-2020
PDF
Computing semantic similarity measure between words using web search engine
PDF
PERFORMANCE EVALUATION OF SOCIAL NETWORK ANALYSIS ALGORITHMS USING DISTRIBUTE...
PDF
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
PDF
Smart Crawler for Efficient Deep-Web Harvesting
PDF
ANALYSIS OF RESEARCH ISSUES IN WEB DATA MINING
PDF
Semantic Search Over The Web
PDF
An adaptive clustering and classification algorithm for Twitter data streamin...
PDF
Pagerank and hits
PPTX
Social Network Analysis with Spark
PDF
Implemenation of Enhancing Information Retrieval Using Integration of Invisib...
PDF
Sree saranya
PDF
Ak4301197200
At33264269
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
G5234552
Volume 2-issue-6-2016-2020
Computing semantic similarity measure between words using web search engine
PERFORMANCE EVALUATION OF SOCIAL NETWORK ANALYSIS ALGORITHMS USING DISTRIBUTE...
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
Smart Crawler for Efficient Deep-Web Harvesting
ANALYSIS OF RESEARCH ISSUES IN WEB DATA MINING
Semantic Search Over The Web
An adaptive clustering and classification algorithm for Twitter data streamin...
Pagerank and hits
Social Network Analysis with Spark
Implemenation of Enhancing Information Retrieval Using Integration of Invisib...
Sree saranya
Ak4301197200
Ad

Similar to 2014 IEEE JAVA DATA MINING PROJECT Keyword query routing (20)

PDF
Az31349353
PDF
Survey on scalable continual top k keyword search in
PDF
Survey on scalable continual top k keyword search in relational databases
PDF
An Advanced IR System of Relational Keyword Search Technique
PDF
Enhancing keyword search over relational databases using ontologies
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PDF
Efficiently searching nearest neighbor in documents
PPTX
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
PDF
An empirical performance evaluation of relational keyword search systems
PDF
Efficiently searching nearest neighbor in documents using keywords
PDF
20110728 datalift-rpi-troy
PPT
Facets and Pivoting for Flexible and Usable Linked Data Exploration
PDF
Ngdm09 han gao
PPTX
Semantic Web and Related Work at W3C
PDF
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
PDF
EDF2012: The Web of Data and its Five Stars
PDF
Integrating content search with structure analysis for hypermedia retrieval a...
PDF
On demand access to Big Data through Semantic Technologies
Az31349353
Survey on scalable continual top k keyword search in
Survey on scalable continual top k keyword search in relational databases
An Advanced IR System of Relational Keyword Search Technique
Enhancing keyword search over relational databases using ontologies
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
Efficiently searching nearest neighbor in documents
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-S...
An empirical performance evaluation of relational keyword search systems
Efficiently searching nearest neighbor in documents using keywords
20110728 datalift-rpi-troy
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Ngdm09 han gao
Semantic Web and Related Work at W3C
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
EDF2012: The Web of Data and its Five Stars
Integrating content search with structure analysis for hypermedia retrieval a...
On demand access to Big Data through Semantic Technologies
Ad

More from IEEEMEMTECHSTUDENTSPROJECTS (20)

DOCX
2014 IEEE DOTNET DATA MINING PROJECT Web image re ranking using query-specifi...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Trusteddb a-trusted-hardware-based-datab...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Supporting privacy-protection-in-persona...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Similarity preserving snippet based visu...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Product aspect-ranking-and--its-applicat...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Mining statistically significant co loca...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Lars an-efficient-and-scalable-location-...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Data mining with big data
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Converged architecture for broadcast and...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Anonymous query processing in road networks
DOCX
2014 IEEE DOTNET DATA MINING PROJECT Ai and opinion mining
DOCX
2014 IEEE DOTNET DATA MINING PROJECT A probabilistic approach to string trans...
DOCX
2014 IEEE DOTNET DATA MINING PROJECT A novel model for mining association rul...
DOC
2014 IEEE JAVA DATA MINING PROJECT Xs path navigation on xml schemas made easy
DOCX
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
DOCX
2014 IEEE JAVA DATA MINING PROJECT Shortest path computing in relational dbms
DOCX
2014 IEEE JAVA DATA MINING PROJECT Security evaluation of pattern classifiers...
DOCX
2014 IEEE JAVA DATA MINING PROJECT Secure outsourced attribute based signatures
DOCX
2014 IEEE JAVA DATA MINING PROJECT Secure mining of association rules in hori...
DOCX
2014 IEEE JAVA DATA MINING PROJECT Searching dimension incomplete databases
2014 IEEE DOTNET DATA MINING PROJECT Web image re ranking using query-specifi...
2014 IEEE DOTNET DATA MINING PROJECT Trusteddb a-trusted-hardware-based-datab...
2014 IEEE DOTNET DATA MINING PROJECT Supporting privacy-protection-in-persona...
2014 IEEE DOTNET DATA MINING PROJECT Similarity preserving snippet based visu...
2014 IEEE DOTNET DATA MINING PROJECT Product aspect-ranking-and--its-applicat...
2014 IEEE DOTNET DATA MINING PROJECT Mining statistically significant co loca...
2014 IEEE DOTNET DATA MINING PROJECT Lars an-efficient-and-scalable-location-...
2014 IEEE DOTNET DATA MINING PROJECT Data mining with big data
2014 IEEE DOTNET DATA MINING PROJECT Converged architecture for broadcast and...
2014 IEEE DOTNET DATA MINING PROJECT Anonymous query processing in road networks
2014 IEEE DOTNET DATA MINING PROJECT Ai and opinion mining
2014 IEEE DOTNET DATA MINING PROJECT A probabilistic approach to string trans...
2014 IEEE DOTNET DATA MINING PROJECT A novel model for mining association rul...
2014 IEEE JAVA DATA MINING PROJECT Xs path navigation on xml schemas made easy
2014 IEEE JAVA DATA MINING PROJECT Web image re ranking using query-specific ...
2014 IEEE JAVA DATA MINING PROJECT Shortest path computing in relational dbms
2014 IEEE JAVA DATA MINING PROJECT Security evaluation of pattern classifiers...
2014 IEEE JAVA DATA MINING PROJECT Secure outsourced attribute based signatures
2014 IEEE JAVA DATA MINING PROJECT Secure mining of association rules in hori...
2014 IEEE JAVA DATA MINING PROJECT Searching dimension incomplete databases

Recently uploaded (20)

PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
MLpara ingenieira CIVIL, meca Y AMBIENTAL
PPTX
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
PPTX
Feature types and data preprocessing steps
PPTX
Module 8- Technological and Communication Skills.pptx
PDF
Java Basics-Introduction and program control
PPTX
CN_Unite_1 AI&DS ENGGERING SPPU PUNE UNIVERSITY
PPTX
mechattonicsand iotwith sensor and actuator
PPTX
Principal presentation for NAAC (1).pptx
PPTX
Graph Data Structures with Types, Traversals, Connectivity, and Real-Life App...
PPTX
CyberSecurity Mobile and Wireless Devices
PPT
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
PDF
Cryptography and Network Security-Module-I.pdf
PDF
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
PDF
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PPTX
ai_satellite_crop_management_20250815030350.pptx
PDF
Applications of Equal_Area_Criterion.pdf
PDF
August -2025_Top10 Read_Articles_ijait.pdf
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
MLpara ingenieira CIVIL, meca Y AMBIENTAL
A Brief Introduction to IoT- Smart Objects: The "Things" in IoT
Feature types and data preprocessing steps
Module 8- Technological and Communication Skills.pptx
Java Basics-Introduction and program control
CN_Unite_1 AI&DS ENGGERING SPPU PUNE UNIVERSITY
mechattonicsand iotwith sensor and actuator
Principal presentation for NAAC (1).pptx
Graph Data Structures with Types, Traversals, Connectivity, and Real-Life App...
CyberSecurity Mobile and Wireless Devices
Chapter 1 - Introduction to Manufacturing Technology_2.ppt
Cryptography and Network Security-Module-I.pdf
Unit I -OPERATING SYSTEMS_SRM_KATTANKULATHUR.pptx.pdf
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
distributed database system" (DDBS) is often used to refer to both the distri...
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
ai_satellite_crop_management_20250815030350.pptx
Applications of Equal_Area_Criterion.pdf
August -2025_Top10 Read_Articles_ijait.pdf

2014 IEEE JAVA DATA MINING PROJECT Keyword query routing

  • 1. GLOBALSOFT TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com Keyword Query Routing ABSTRACT: Keyword search is an intuitive paradigm for searching linked data sources on the web. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. We propose a novel method for computing top-k routing plans based on their potentials to contain results for a given keyword query. We employ a keyword-element relationship summary that compactly represents relationships between keywords and the data elements mentioning them. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements, element sets, and subgraphs that connect these elements. Experiments carried out using 150 publicly available sources on the web showed that valid plans (precision@1 of 0.92) that are highly relevant (mean reciprocal rank of 0.89) can be computed in 1 second on average on a single PC. Further, we show routing greatly helps to improve the performance of keyword search, without compromising its result quality.
  • 2. AIM: Linked data describes a method of publishing structured data so that it can be interlinked and become more useful. Keyword search is an intuitive paradigm for searching linked data sources on the web. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. In this we have implement TOP K-Routing plan based on their potentials to contain results for a given keyword query. SYNOPSIS: In recent years the Web has evolved from a global information space of linked documents to one where both documents and data are linked. Underpinning this evolution is a set of best practices for publishing and connecting structured data on the Web known as Linked Data. The adoption of the Linked Data best practices has lead to the extension of the Web with a global data space connecting data from diverse domains such as people, companies, books, scientific publications, films, music, television and radio programmes, genes, proteins, drugs and clinical trials, online communities, statistical and scientific data, and reviews. This Web of Data enables new types of applications. There are generic Linked Data browsers which allow users to start browsing in one data source and then navigate along links into related data sources. There are Linked Data search engines that crawl the Web of Data by following links between data sources and provide expressive query capabilities over aggregated data, similar to how a local database is queried today. The Web of Data also opens up new possibilities for domain-specific applications. Unlike Web 2.0 mashups which work against a fixed set of data sources, Linked
  • 3. Data applications operate on top of an unbound, global data space. This enables them to deliver more complete answers as new data sources appear on the Web. We propose to investigate the problem of keyword query routing for keyword search over a large number of structured and Linked Data sources. Routing keywords only to relevant sources can reduce the high cost of searching for structured results that span multiple sources. To the best of our knowledge, the work presented in this paper represents the first attempt to address this problem. We use a graph-based data model to characterize individual data sources. In that model, we distinguish between an element-level data graph representing relationships between individual data elements, and a set-level data graph, which captures information about group of elements. This set-level graph essentially captures a part of the Linked Data schema on the web that is represented in RDFS, i.e., relations between classes. Often, a schema might be incomplete or simply does not exist for RDF data on the web. In such a case, a pseudoschema can be obtained by computing a structural summary such as a dataguide. EXISTING SYSTEM: Existing work can be categorized into two main categories:  schema-based approaches  Schema-agnostic approaches There are schema-based approaches implemented on top of off-the-shelf databases. A keyword query is processed by mapping keywords to elements of the database (called keyword elements). Then, using the schema, valid join sequences
  • 4. are derived, which are then employed to join (“connect”) the computed keyword elements to form so called candidate networks representing possible results to the keyword query. Schema-agnostic approaches operate directly on the data. Structured results are computed by exploring the underlying data graph. The goal is to find structures in the data called Steiner trees (Steiner graphs in general), which connect keyword elements. Various kinds of algorithms have been proposed for the efficient exploration of keyword search results over data graphs, which might be very large. Examples are bidirectional search and dynamic programming Existing work on keyword search relies on an element-level model (i.e., data graphs) to compute keyword query results. DISADVANTAGES OF EXISTING SYSTEM:  The number of potential results may increase exponentially with the number of sources and links between them. Yet, most of the results may be not necessary especially when they are not relevant to the user.  The routing problem, we need to compute results capturing specific elements at the data level.  Routing keywords return all the source which may or may not be the relevant sources PROPOSED SYSTEM:
  • 5. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. We propose a novel method for computing top-k routing plans based on their potentials to contain results for a given keyword query. We employ a keyword-element relationship summary that compactly represents relationships between keywords and the data elements mentioning them. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements, element sets, and subgraphs that connect these elements. We propose to investigate the problem of keyword query routing for keyword search over a large number of structured and Linked Data sources. ADVANTAGES OF PROPOSED SYSTEM:  Routing keywords only to relevant sources can reduce the high cost of searching for structured results that span multiple sources.  The routing plans, produced can be used to compute results from multiple sources.
  • 6. SYSTEM ARCHITECTURE: MODULES:  Linked Data Generation  Key level Mapping  Multilevel Inter relationship
  • 7.  Routing Plan MODULES DESCRIPTION: Linked Data Generation The GeoNames Services makes it possible to add geospatial semantic information to the Word Wide Web. All over 6.2 million geonames toponyms now have a unique URL with a corresponding XML web service. In this we have used Country Info , Time zone and Finance Info services. This model resembles RDF data where entities stand for some RDF resources, data values stand for RDF literals, and relations and attributes correspond to RDF triples. While it is primarily used to model RDF Linked Data on the web, such a graph model is sufficiently general to capture XML and relational data. Key level Mapping The set-level graph essentially captures a part of the Linked Data schema on the web that is represented in RDFS, i.e., relations between classes. Often, a schema might be incomplete or simply does not exist for RDF data on the web. In such a case, a pseudoschema can be obtained by computing a structural summary such as a data guide. A set-level data graph can be derived from a given schema or a generated pseudoschema. The web of data is modeled as a web graph where GA is the set of all data graphs, N is the set of all nodes, E is the set of all “internal” edges that connect elements within a particular source. Multilevel Inter relationship
  • 8. The search space of keyword query routing using a multilevel inter-relationship graph. The inter-relationships between elements at different levels keyword is mentioned in some entity descriptions at the element level. Entities at the element level are associated with a set-level element via type. A set-level element is contained in a source. There is an edge between two keywords if two elements at the element level mentioning these keywords are connected via a path. We propose a ranking scheme that deals with relevance at many levels. Routing Plan: Given the web graph W =(G,N,E) and a keyword query K, the mapping: K-2G that associates a query with a set of data graphs is called a keyword routing plan RP. A plan RP is considered valid w.r.t. K when the union set of its data graphs contains a result for K. The problem of keyword query routing is to find the top-k keyword routing plans based on their relevance to a query. A relevant plan should correspond to the information need as intended by the user. SYSTEM REQUIREMENTS: HARDWARE REQUIREMENTS:  System : Pentium IV 2.4 GHz.  Hard Disk : 40 GB.  Floppy Drive : 1.44 Mb.  Monitor : 15 VGA Colour.  Mouse : Logitech.
  • 9.  Ram : 512 Mb. SOFTWARE REQUIREMENTS:  Operating system : Windows XP/7.  Coding Language : JAVA/J2EE  IDE : Netbeans 7.4  Database : MYSQL REFERENCE: Thanh Tran and Lei Zhang, “Keyword Query Routing”. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 2, FEBRUARY 2014