SlideShare a Scribd company logo
GLOBALSOFT TECHNOLOGIES 
IEEE PROJECTS & SOFTWARE DEVELOPMENTS 
IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE 
BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS 
CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 
Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com 
Keyword Query Routing 
ABSTRACT: 
Keyword search is an intuitive paradigm for searching linked data sources on the 
web. We propose to route keywords only to relevant sources to reduce the high 
cost of processing keyword search queries over all sources. We propose a novel 
method for computing top-k routing plans based on their potentials to contain 
results for a given keyword query. We employ a keyword-element relationship 
summary that compactly represents relationships between keywords and the data 
elements mentioning them. A multilevel scoring mechanism is proposed for 
computing the relevance of routing plans based on scores at the level of keywords, 
data elements, element sets, and subgraphs that connect these elements. 
Experiments carried out using 150 publicly available sources on the web showed 
that valid plans (precision@1 of 0.92) that are highly relevant (mean reciprocal 
rank of 0.89) can be computed in 1 second on average on a single PC. Further, we 
show routing greatly helps to improve the performance of keyword search, without 
compromising its result quality.
AIM: 
Linked data describes a method of publishing structured data so that it can be 
interlinked and become more useful. Keyword search is an intuitive paradigm for 
searching linked data sources on the web. We propose to route keywords only to 
relevant sources to reduce the high cost of processing keyword search queries over 
all sources. In this we have implement TOP K-Routing plan based on their 
potentials to contain results for a given keyword query. 
SYNOPSIS: 
In recent years the Web has evolved from a global information space of linked 
documents to one where both documents and data are linked. Underpinning this 
evolution is a set of best practices for publishing and connecting structured data on 
the Web known as Linked Data. The adoption of the Linked Data best practices 
has lead to the extension of the Web with a global data space connecting data from 
diverse domains such as people, companies, books, scientific publications, films, 
music, television and radio programmes, genes, proteins, drugs and clinical trials, 
online communities, statistical and scientific data, and reviews. This Web of Data 
enables new types of applications. There are generic Linked Data browsers which 
allow users to start browsing in one data source and then navigate along links into 
related data sources. There are Linked Data search engines that crawl the Web of 
Data by following links between data sources and provide expressive query 
capabilities over aggregated data, similar to how a local database is queried today. 
The Web of Data also opens up new possibilities for domain-specific applications. 
Unlike Web 2.0 mashups which work against a fixed set of data sources, Linked
Data applications operate on top of an unbound, global data space. This enables 
them to deliver more complete answers as new data sources appear on the Web. 
We propose to investigate the problem of keyword query routing for 
keyword search over a large number of structured and Linked Data sources. 
Routing keywords only to relevant sources can reduce the high cost of searching 
for structured results that span multiple sources. To the best of our knowledge, the 
work presented in this paper represents the first attempt to address this problem. 
We use a graph-based data model to characterize individual data sources. In 
that model, we distinguish between an element-level data graph representing 
relationships between individual data elements, and a set-level data graph, which 
captures information about group of elements. This set-level graph essentially 
captures a part of the Linked Data schema on the web that is represented in RDFS, 
i.e., relations between classes. Often, a schema might be incomplete or simply does 
not exist for RDF data on the web. In such a case, a pseudoschema can be obtained 
by computing a structural summary such as a dataguide. 
EXISTING SYSTEM: 
Existing work can be categorized into two main categories: 
 schema-based approaches 
 Schema-agnostic approaches 
There are schema-based approaches implemented on top of off-the-shelf 
databases. A keyword query is processed by mapping keywords to elements of the 
database (called keyword elements). Then, using the schema, valid join sequences
are derived, which are then employed to join (“connect”) the computed keyword 
elements to form so called candidate networks representing possible results to the 
keyword query. 
Schema-agnostic approaches operate directly on the data. Structured results 
are computed by exploring the underlying data graph. The goal is to find structures 
in the data called Steiner trees (Steiner graphs in general), which connect keyword 
elements. Various kinds of algorithms have been proposed for the efficient 
exploration of keyword search results over data graphs, which might be very large. 
Examples are bidirectional search and dynamic programming 
Existing work on keyword search relies on an element-level model (i.e., data 
graphs) to compute keyword query results. 
DISADVANTAGES OF EXISTING SYSTEM: 
 The number of potential results may increase exponentially with the 
number of sources and links between them. Yet, most of the results 
may be not necessary especially when they are not relevant to the 
user. 
 The routing problem, we need to compute results capturing specific 
elements at the data level. 
 Routing keywords return all the source which may or may not be the 
relevant sources 
PROPOSED SYSTEM:
We propose to route keywords only to relevant sources to reduce the high cost of 
processing keyword search queries over all sources. We propose a novel method 
for computing top-k routing plans based on their potentials to contain results for a 
given keyword query. We employ a keyword-element relationship summary that 
compactly represents relationships between keywords and the data elements 
mentioning them. A multilevel scoring mechanism is proposed for computing the 
relevance of routing plans based on scores at the level of keywords, data elements, 
element sets, and subgraphs that connect these elements. We propose to investigate 
the problem of keyword query routing for keyword search over a large number of 
structured and Linked Data sources. 
ADVANTAGES OF PROPOSED SYSTEM: 
 Routing keywords only to relevant sources can reduce the high cost of 
searching for structured results that span multiple sources. 
 The routing plans, produced can be used to compute results from multiple 
sources.
SYSTEM ARCHITECTURE: 
MODULES: 
 Linked Data Generation 
 Key level Mapping 
 Multilevel Inter relationship
 Routing Plan 
MODULES DESCRIPTION: 
Linked Data Generation 
The GeoNames Services makes it possible to add geospatial semantic information 
to the Word Wide Web. All over 6.2 million geonames toponyms now have a 
unique URL with a corresponding XML web service. In this we have used Country 
Info , Time zone and Finance Info services. This model resembles RDF data where 
entities stand for some RDF resources, data values stand for RDF literals, and 
relations and attributes correspond to RDF triples. While it is primarily used to 
model RDF Linked Data on the web, such a graph model is sufficiently general to 
capture XML and relational data. 
Key level Mapping 
The set-level graph essentially captures a part of the Linked Data schema on the 
web that is represented in RDFS, i.e., relations between classes. Often, a schema 
might be incomplete or simply does not exist for RDF data on the web. In such a 
case, a pseudoschema can be obtained by computing a structural summary such as 
a data guide. A set-level data graph can be derived from a given schema or a 
generated pseudoschema. The web of data is modeled as a web graph where GA is 
the set of all data graphs, N is the set of all nodes, E is the set of all “internal” 
edges that connect elements within a particular source. 
Multilevel Inter relationship
The search space of keyword query routing using a multilevel inter-relationship 
graph. The inter-relationships between elements at different levels keyword is 
mentioned in some entity descriptions at the element level. Entities at the element 
level are associated with a set-level element via type. A set-level element is 
contained in a source. There is an edge between two keywords if two elements at 
the element level mentioning these keywords are connected via a path. We propose 
a ranking scheme that deals with relevance at many levels. 
Routing Plan: 
Given the web graph W =(G,N,E) and a keyword query K, the mapping: K-2G that 
associates a query with a set of data graphs is called a keyword routing plan RP. A 
plan RP is considered valid w.r.t. K when the union set of its data graphs contains a 
result for K. The problem of keyword query routing is to find the top-k keyword 
routing plans based on their relevance to a query. A relevant plan should 
correspond to the information need as intended by the user. 
SYSTEM REQUIREMENTS: 
HARDWARE REQUIREMENTS: 
 System : Pentium IV 2.4 GHz. 
 Hard Disk : 40 GB. 
 Floppy Drive : 1.44 Mb. 
 Monitor : 15 VGA Colour. 
 Mouse : Logitech.
 Ram : 512 Mb. 
SOFTWARE REQUIREMENTS: 
 Operating system : Windows XP/7. 
 Coding Language : JAVA/J2EE 
 IDE : Netbeans 7.4 
 Database : MYSQL 
REFERENCE: 
Thanh Tran and Lei Zhang, “Keyword Query Routing”. IEEE TRANSACTIONS 
ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 2, FEBRUARY 
2014

More Related Content

DOCX
keyword query routing
DOCX
JPJ1423 Keyword Query Routing
DOC
Keyword query routing
PDF
Approaches for Keyword Query Routing
PDF
At33264269
DOC
Keyword query routing
PDF
Volume 2-issue-6-2016-2020
PDF
Using Page Size for Controlling Duplicate Query Results in Semantic Web
keyword query routing
JPJ1423 Keyword Query Routing
Keyword query routing
Approaches for Keyword Query Routing
At33264269
Keyword query routing
Volume 2-issue-6-2016-2020
Using Page Size for Controlling Duplicate Query Results in Semantic Web

What's hot (15)

PDF
Computing semantic similarity measure between words using web search engine
PDF
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
PDF
G5234552
PDF
Nearest keyword set search in multi dimensional datasets
PDF
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
PDF
An Advanced IR System of Relational Keyword Search Technique
PDF
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
PDF
Pdd crawler a focused web
PDF
Az31349353
PDF
IRJET- Review on Information Retrieval for Desktop Search Engine
PPT
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
PDF
Efficiently searching nearest neighbor in documents using keywords
PDF
Efficiently searching nearest neighbor in documents
Computing semantic similarity measure between words using web search engine
A survey on Design and Implementation of Clever Crawler Based On DUST Removal
G5234552
Nearest keyword set search in multi dimensional datasets
WEB PAGE RANKING BASED ON TEXT SUBSTANCE OF LINKED PAGES
An Advanced IR System of Relational Keyword Search Technique
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
Pdd crawler a focused web
Az31349353
IRJET- Review on Information Retrieval for Desktop Search Engine
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Efficiently searching nearest neighbor in documents using keywords
Efficiently searching nearest neighbor in documents
Ad

Similar to IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing (20)

DOCX
Keyword Query Routing
PDF
At33264269
DOCX
Keyword query routing
PDF
Volume 2-issue-6-2016-2020
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PDF
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
PDF
Enhancing keyword search over relational databases using ontologies
PDF
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
PDF
Paper id 25201463
PDF
WEB SEARCH ENGINE BASED SEMANTIC SIMILARITY MEASURE BETWEEN WORDS USING PATTE...
PDF
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
PDF
G1803054653
PDF
A Novel Data Extraction and Alignment Method for Web Databases
DOC
Introduction abstract
PPSX
Annotating search results from web databases-IEEE Transaction Paper 2013
PDF
Ijarcet vol-2-issue-7-2252-2257
PDF
Ijarcet vol-2-issue-7-2252-2257
PDF
Ak4301197200
PDF
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
PDF
B131626
Keyword Query Routing
At33264269
Keyword query routing
Volume 2-issue-6-2016-2020
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
Enhancing keyword search over relational databases using ontologies
Object surface segmentation, Image segmentation, Region growing, X-Y-Z image,...
Paper id 25201463
WEB SEARCH ENGINE BASED SEMANTIC SIMILARITY MEASURE BETWEEN WORDS USING PATTE...
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
G1803054653
A Novel Data Extraction and Alignment Method for Web Databases
Introduction abstract
Annotating search results from web databases-IEEE Transaction Paper 2013
Ijarcet vol-2-issue-7-2252-2257
Ijarcet vol-2-issue-7-2252-2257
Ak4301197200
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
B131626
Ad

More from IEEEFINALYEARSTUDENTPROJECTS (20)

DOCX
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Efficient and privacy aware data agg...
DOCX
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Building a scalable system for steal...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Token mac a fair mac protocol for pa...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Tag sense leveraging smartphones for...
DOC
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Privacy preserving optimal meeting l...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Preserving location privacy in geo s...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Friendbook a semantic based friend r...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Efficient and privacy aware data agg...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Cloud assisted mobile-access of heal...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS A low complexity algorithm for neigh...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Hierarchical prediction and context ...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Designing an-efficient-image encrypt...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Click prediction-for-web-image-reran...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Web service recommendation via expl...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Scalable and accurate prediction of...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Privacy enhanced web service compos...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Decentralized enactment of bpel pro...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS A novel time obfuscated algorithm ...
DOCX
IEEE 2014 JAVA SOFTWARE ENGINEER PROJECTS Conservation of information softwar...
DOC
IEEE 2014 JAVA DATA MINING PROJECTS Xs path navigation on xml schemas made easy
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Efficient and privacy aware data agg...
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Building a scalable system for steal...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Token mac a fair mac protocol for pa...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Tag sense leveraging smartphones for...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Privacy preserving optimal meeting l...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Preserving location privacy in geo s...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Friendbook a semantic based friend r...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Efficient and privacy aware data agg...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Cloud assisted mobile-access of heal...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS A low complexity algorithm for neigh...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Hierarchical prediction and context ...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Designing an-efficient-image encrypt...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Click prediction-for-web-image-reran...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Web service recommendation via expl...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Scalable and accurate prediction of...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Privacy enhanced web service compos...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Decentralized enactment of bpel pro...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS A novel time obfuscated algorithm ...
IEEE 2014 JAVA SOFTWARE ENGINEER PROJECTS Conservation of information softwar...
IEEE 2014 JAVA DATA MINING PROJECTS Xs path navigation on xml schemas made easy

Recently uploaded (20)

PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
web development for engineering and engineering
PPTX
Geodesy 1.pptx...............................................
PPTX
Welding lecture in detail for understanding
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPT
Project quality management in manufacturing
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Lecture Notes Electrical Wiring System Components
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
bas. eng. economics group 4 presentation 1.pptx
OOP with Java - Java Introduction (Basics)
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Foundation to blockchain - A guide to Blockchain Tech
web development for engineering and engineering
Geodesy 1.pptx...............................................
Welding lecture in detail for understanding
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Project quality management in manufacturing
CYBER-CRIMES AND SECURITY A guide to understanding
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...

IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing

  • 1. GLOBALSOFT TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com Keyword Query Routing ABSTRACT: Keyword search is an intuitive paradigm for searching linked data sources on the web. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. We propose a novel method for computing top-k routing plans based on their potentials to contain results for a given keyword query. We employ a keyword-element relationship summary that compactly represents relationships between keywords and the data elements mentioning them. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements, element sets, and subgraphs that connect these elements. Experiments carried out using 150 publicly available sources on the web showed that valid plans (precision@1 of 0.92) that are highly relevant (mean reciprocal rank of 0.89) can be computed in 1 second on average on a single PC. Further, we show routing greatly helps to improve the performance of keyword search, without compromising its result quality.
  • 2. AIM: Linked data describes a method of publishing structured data so that it can be interlinked and become more useful. Keyword search is an intuitive paradigm for searching linked data sources on the web. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. In this we have implement TOP K-Routing plan based on their potentials to contain results for a given keyword query. SYNOPSIS: In recent years the Web has evolved from a global information space of linked documents to one where both documents and data are linked. Underpinning this evolution is a set of best practices for publishing and connecting structured data on the Web known as Linked Data. The adoption of the Linked Data best practices has lead to the extension of the Web with a global data space connecting data from diverse domains such as people, companies, books, scientific publications, films, music, television and radio programmes, genes, proteins, drugs and clinical trials, online communities, statistical and scientific data, and reviews. This Web of Data enables new types of applications. There are generic Linked Data browsers which allow users to start browsing in one data source and then navigate along links into related data sources. There are Linked Data search engines that crawl the Web of Data by following links between data sources and provide expressive query capabilities over aggregated data, similar to how a local database is queried today. The Web of Data also opens up new possibilities for domain-specific applications. Unlike Web 2.0 mashups which work against a fixed set of data sources, Linked
  • 3. Data applications operate on top of an unbound, global data space. This enables them to deliver more complete answers as new data sources appear on the Web. We propose to investigate the problem of keyword query routing for keyword search over a large number of structured and Linked Data sources. Routing keywords only to relevant sources can reduce the high cost of searching for structured results that span multiple sources. To the best of our knowledge, the work presented in this paper represents the first attempt to address this problem. We use a graph-based data model to characterize individual data sources. In that model, we distinguish between an element-level data graph representing relationships between individual data elements, and a set-level data graph, which captures information about group of elements. This set-level graph essentially captures a part of the Linked Data schema on the web that is represented in RDFS, i.e., relations between classes. Often, a schema might be incomplete or simply does not exist for RDF data on the web. In such a case, a pseudoschema can be obtained by computing a structural summary such as a dataguide. EXISTING SYSTEM: Existing work can be categorized into two main categories:  schema-based approaches  Schema-agnostic approaches There are schema-based approaches implemented on top of off-the-shelf databases. A keyword query is processed by mapping keywords to elements of the database (called keyword elements). Then, using the schema, valid join sequences
  • 4. are derived, which are then employed to join (“connect”) the computed keyword elements to form so called candidate networks representing possible results to the keyword query. Schema-agnostic approaches operate directly on the data. Structured results are computed by exploring the underlying data graph. The goal is to find structures in the data called Steiner trees (Steiner graphs in general), which connect keyword elements. Various kinds of algorithms have been proposed for the efficient exploration of keyword search results over data graphs, which might be very large. Examples are bidirectional search and dynamic programming Existing work on keyword search relies on an element-level model (i.e., data graphs) to compute keyword query results. DISADVANTAGES OF EXISTING SYSTEM:  The number of potential results may increase exponentially with the number of sources and links between them. Yet, most of the results may be not necessary especially when they are not relevant to the user.  The routing problem, we need to compute results capturing specific elements at the data level.  Routing keywords return all the source which may or may not be the relevant sources PROPOSED SYSTEM:
  • 5. We propose to route keywords only to relevant sources to reduce the high cost of processing keyword search queries over all sources. We propose a novel method for computing top-k routing plans based on their potentials to contain results for a given keyword query. We employ a keyword-element relationship summary that compactly represents relationships between keywords and the data elements mentioning them. A multilevel scoring mechanism is proposed for computing the relevance of routing plans based on scores at the level of keywords, data elements, element sets, and subgraphs that connect these elements. We propose to investigate the problem of keyword query routing for keyword search over a large number of structured and Linked Data sources. ADVANTAGES OF PROPOSED SYSTEM:  Routing keywords only to relevant sources can reduce the high cost of searching for structured results that span multiple sources.  The routing plans, produced can be used to compute results from multiple sources.
  • 6. SYSTEM ARCHITECTURE: MODULES:  Linked Data Generation  Key level Mapping  Multilevel Inter relationship
  • 7.  Routing Plan MODULES DESCRIPTION: Linked Data Generation The GeoNames Services makes it possible to add geospatial semantic information to the Word Wide Web. All over 6.2 million geonames toponyms now have a unique URL with a corresponding XML web service. In this we have used Country Info , Time zone and Finance Info services. This model resembles RDF data where entities stand for some RDF resources, data values stand for RDF literals, and relations and attributes correspond to RDF triples. While it is primarily used to model RDF Linked Data on the web, such a graph model is sufficiently general to capture XML and relational data. Key level Mapping The set-level graph essentially captures a part of the Linked Data schema on the web that is represented in RDFS, i.e., relations between classes. Often, a schema might be incomplete or simply does not exist for RDF data on the web. In such a case, a pseudoschema can be obtained by computing a structural summary such as a data guide. A set-level data graph can be derived from a given schema or a generated pseudoschema. The web of data is modeled as a web graph where GA is the set of all data graphs, N is the set of all nodes, E is the set of all “internal” edges that connect elements within a particular source. Multilevel Inter relationship
  • 8. The search space of keyword query routing using a multilevel inter-relationship graph. The inter-relationships between elements at different levels keyword is mentioned in some entity descriptions at the element level. Entities at the element level are associated with a set-level element via type. A set-level element is contained in a source. There is an edge between two keywords if two elements at the element level mentioning these keywords are connected via a path. We propose a ranking scheme that deals with relevance at many levels. Routing Plan: Given the web graph W =(G,N,E) and a keyword query K, the mapping: K-2G that associates a query with a set of data graphs is called a keyword routing plan RP. A plan RP is considered valid w.r.t. K when the union set of its data graphs contains a result for K. The problem of keyword query routing is to find the top-k keyword routing plans based on their relevance to a query. A relevant plan should correspond to the information need as intended by the user. SYSTEM REQUIREMENTS: HARDWARE REQUIREMENTS:  System : Pentium IV 2.4 GHz.  Hard Disk : 40 GB.  Floppy Drive : 1.44 Mb.  Monitor : 15 VGA Colour.  Mouse : Logitech.
  • 9.  Ram : 512 Mb. SOFTWARE REQUIREMENTS:  Operating system : Windows XP/7.  Coding Language : JAVA/J2EE  IDE : Netbeans 7.4  Database : MYSQL REFERENCE: Thanh Tran and Lei Zhang, “Keyword Query Routing”. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 26, NO. 2, FEBRUARY 2014