SlideShare a Scribd company logo
GRID INFORMATION
        RETRIVAL SYSTEM
                          USING JAVA
INTRODUCTION
GRID IR is Information Retrieval on the grid! It is a new initiative to
bring together information retrieval techniques with grid computing.
IR or information retrieval is a field of research            concerned with
searching unstructured (or quasi-structured) data              such as text
documents and the retrieval of results pertinent to           a user’s query.
Modern web search engines are the most                         widely known
implementations of IR system.
Grid computing is the accomplishment of computational tasks on a set
of computers connected by a network. This is similar to distributed
computing, except with a more finely grained implementation for task
assignment and coordination among the grid elements.
Grid computing provides clustering of remotely distributed computing.
The principal focus of grid computing to date has been on maximizing
the use of available processor resources for compute-intensive
applications. Grid computing along with storage virtualization and
server virtualization enables a Utility Computing.
Applying the resources of many computers in a network to a single
problem at the same time – usually a scientific or technical problem that
requires a great number of computer processing cycles or access to
large amounts of data. Grid computing uses software to divide and farm
out pieces of a program to as many as several thousand computers.
A number of corporations, professional groups and university consortia
have developed frameworks and software for managing grid computing
projects. Grid computing is a model for allowing companies to use a
large number of computing resources on demand, no matter where they
are located.
Grid IR applies the tools of grid computing to IR to provide a common
infrastructure for distributed IR. It also brings the capabilities of IR to grid
computing. GRID IR is a newly proposed initiative to implement a
specific architecture for realizing IR on the open grid service architecture
(OGSA) grid-computing platform. Traditional IR models are broken into
constituent pieces and described as OGSA grid services. A model for
interaction among these services describes the GRID IR system.
AIM/OBJECTIVE OF THE SYSTEM
The main aim of grid IR is to allow users information needs to be
matched to documents by document collections, indexes and query
engines which all exist as grid services.
The project is implemented using JAVA. MS-ACCESS database is used
for indexing the keywords of the document.


PROPOSED SYSTEM SOFTWARE REQUIREMENTS
Operating system    :   Windows XP/2000
Software            :   JDK 1.3 or higher
Database            :   MS-ACCESS


PROPOSED SYSTEM HARDWARE REQUIREMENTS
Processor    : Intel Pentium PIII or higher
RAM          : 128 MB or higher
HDD          : 80 GB HDD
FDD          : 1.44 MB or higher
Monitor / Keyboard / CD drive

PROPOSED SYSTEM DESCRIPTION
Grid Computing is an advanced technology of distributed computing. A
Grid is a collection of computers, storage and other devices, which are
joined together by any means of communication like Internet and which
can be used to manage information and solve their problems among
themselves.
Grid Computing allows usage of the unutilized resources of other
systems. This is achieved by distributing the workload of the system to
the other systems in order to use their unused resources such as their
memory, processor, etc which results in balancing the workload,
decreasing the network traffic, bandwidth, etc.
This concept is used in our project to render a large image in a very
short time by distributing the image to many systems for using their
resources. As the workload is evenly distributed among the grid
network, even the large work can be done in a short time itself.
The main scope is that using the unused resources to complete the
work efficiently. This project helps to use the resources efficiently and
cost effective.
Grid Computing is about making large amounts of computing power
available for applications and users. Collaborative development of Java
Grid Engine technology provides the proper development framework to
ensure that Grid Engine technology meets the requirements of the
largest number of users.
Grid computing is a form of networking. Unlike conventional networks
that focus on communication among devices, grid computing harnesses
unused processing cycles of all computers in a network for solving
problems too intensive for any stand-alone machine.
A common example of a well-known grid computing project is the SETI
(Search for Extraterrestrial Intelligence) @Home project, in which PC
users worldwide donate unused processor cycles to help the search for
signs of extraterrestrial life by analyzing signals coming from outer
space.
The proposed project relies on individual users to volunteer to allow the
project to harness the unused processing power of the user's computer.
This method saves the project both money and resources.
This project in Java based Grid computing does require special imaging
software that is unique to the computing project for which the grid is
being used.
The basic idea of grid IR is to define an IR system in terms of three
functional components, implemented as grid services: the collection
manager service (CM), the indexing/searching service (IS), and the
Query processing service (QP).
These services are autonomous, and being grid services, they are
distributed. Since they can be used to create new IR systems or link
existing ones together in an interoperable network of IR services.
Information retrieval(IR) is the science and practice of identifying
documents or sub-documents that needs information needs. Usually, IR
deals with textual documents in semi-structured (e.g., HTML, XML) or
unstructured (plain text) format. In order to boost processing power,
institutions aggregated computing resources across the entire
institution.
The same idea of sharing resources has paved the way for grid
computing but with a far wider scale and scope. Grid computing, in
effect, provides a global reach to distributed computing.
It promises lower total computing costs along with on-demand, reliable,
and inexpensive access to the vast, available computing resources that
would other wise go unused.
GRID COMPUTING FEATURES
The requirements for grid-computing infrastructure can be described by
the following attributed:
•   Pooling of resources to increase utilization
•   Provisioning of work based on policies and dynamic requirements
•   Virtualization at every layer of the computing stack
•   Self-adaptive software that largely tunes and fixes itself
•   Unified management and provisioning.


                 PROPOSED SYSTEM MODULES
Java Grid project is divided into three modules server, client and
worker

1. SERVER MODULE
       User interface Job Scheduler
       Workload Management
       Resource Management
       Data Management


2. WORKER MODULE
       Job Requests Receiver
       Job Processing Manager
       Job Requests Sender


3. CLIENT MODULE
       Job Fragmenter
       Job Requests Sender
       Job Results Receiver
       Job Results Aggregator
GRID - MODULE DESCRIPTION
1. SERVER MODULE
   Server module, which maintains the number of clients and worker
   connected to the grid engine, amount of work load given to the
   worker, add grid node, remove grid node, data available in the
   clients.

2. CLIENT MODULE
   The given job is divided into job fragments and given to the grid
   server to process, client aggregate the resultant job fragments form
   the grid server. The purpose of the client is to divide and aggregate
   the job.

3. WORKER MODULE
   Worker process the job given by the grid server and then result is
   send to the grid server. Worker module runs server automatically
   identifies and connect the worker to the grid engine.
   In this project worker process the job such as rendering of images
   using pov ray software.

          GRID – CLIENT                   GRID - WORKER


                               G
         Job Fragmenter
                               R
                               I
                Job Requests
                               D
                                              Job Requests Receiver
       Job Requests Sender
                               S
                               E
                               R                  Job Processing
                                                     Manager
                               V
                               E
      Job Results Receiver     R
                                                Job Results Sender



    Job Results Aggregator

More Related Content

PDF
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
PDF
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCE
PDF
Quality of Service based Task Scheduling Algorithms in Cloud Computing
PPTX
An optimized scientific workflow scheduling in cloud computing
PDF
Improve the Offloading Decision by Adaptive Partitioning of Task for Mobile C...
PDF
Intelligent Placement of Datacenters for Internet Services
PDF
IRJET- A Review on K-Means++ Clustering Algorithm and Cloud Computing wit...
PPT
A Survey on Resource Allocation & Monitoring in Cloud Computing
An Energy Efficient Data Transmission and Aggregation of WSN using Data Proce...
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCE
Quality of Service based Task Scheduling Algorithms in Cloud Computing
An optimized scientific workflow scheduling in cloud computing
Improve the Offloading Decision by Adaptive Partitioning of Task for Mobile C...
Intelligent Placement of Datacenters for Internet Services
IRJET- A Review on K-Means++ Clustering Algorithm and Cloud Computing wit...
A Survey on Resource Allocation & Monitoring in Cloud Computing

What's hot (19)

PPTX
Task Scheduling methodology in cloud computing
PDF
Qo s aware scientific application scheduling algorithm in cloud environment
PDF
D04573033
PDF
Performance evaluation and estimation model using regression method for hadoo...
PDF
Allocation Strategies of Virtual Resources in Cloud-Computing Networks
PDF
Cloud computing Review over various scheduling algorithms
PDF
NGN Hybrid Cloud Matrix
PDF
Lm2519942003
PDF
On Traffic-Aware Partition and Aggregation in Map Reduce for Big Data Applica...
PDF
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
PDF
Bhadale group of companies - engineering innovations programs catalogue
PDF
A survey of various scheduling algorithm in cloud computing environment
PDF
International Journal of Engineering Inventions (IJEI)
PDF
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
PPTX
Cs6703 grid and cloud computing unit 4
PDF
Scheduling in cloud computing
PDF
A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...
PDF
A Prolific Scheme for Load Balancing Relying on Task Completion Time
PDF
Mapreduce - Simplified Data Processing on Large Clusters
Task Scheduling methodology in cloud computing
Qo s aware scientific application scheduling algorithm in cloud environment
D04573033
Performance evaluation and estimation model using regression method for hadoo...
Allocation Strategies of Virtual Resources in Cloud-Computing Networks
Cloud computing Review over various scheduling algorithms
NGN Hybrid Cloud Matrix
Lm2519942003
On Traffic-Aware Partition and Aggregation in Map Reduce for Big Data Applica...
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
Bhadale group of companies - engineering innovations programs catalogue
A survey of various scheduling algorithm in cloud computing environment
International Journal of Engineering Inventions (IJEI)
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Cs6703 grid and cloud computing unit 4
Scheduling in cloud computing
A Novel Approach in Scheduling Of the Real- Time Tasks In Heterogeneous Multi...
A Prolific Scheme for Load Balancing Relying on Task Completion Time
Mapreduce - Simplified Data Processing on Large Clusters
Ad

Viewers also liked (7)

PDF
A S P
PDF
perf paper 9 tips for sales people
PDF
Final Year Projects In Chennai, Final Year Projects At Chennai, Final Year Pr...
PDF
TMT Industry Research Round-up H1 2014
PPT
Johnson Presentation
PDF
Interview Handling Tips
PPTX
Personal interview
A S P
perf paper 9 tips for sales people
Final Year Projects In Chennai, Final Year Projects At Chennai, Final Year Pr...
TMT Industry Research Round-up H1 2014
Johnson Presentation
Interview Handling Tips
Personal interview
Ad

Similar to Java Abs Grid Information Retrival System (20)

PPTX
Grid computing
PPT
Grid Computing
PPT
Gridcomputingppt
PPT
Grid Computing
PPT
All about GridComputing-an introduction (2).ppt
DOCX
Grid computing assiment
PPTX
Unit i introduction to grid computing
DOC
Grid computing 12
PPT
GridComputing-an introduction.ppt
PPT
Jug gridgain java_grid_computing_made_simple
PPT
grid mining
PPTX
Grid computing the grid
PDF
A Review Paper On Grid Computing
PPT
Inroduction to grid computing by gargi shankar verma
PPT
Grid computing by vaishali sahare [katkar]
PPT
Grid computing
PDF
The grid aprimer
DOCX
Grid computing dis
PPT
grid computing
PPT
Grid computing
Grid computing
Grid Computing
Gridcomputingppt
Grid Computing
All about GridComputing-an introduction (2).ppt
Grid computing assiment
Unit i introduction to grid computing
Grid computing 12
GridComputing-an introduction.ppt
Jug gridgain java_grid_computing_made_simple
grid mining
Grid computing the grid
A Review Paper On Grid Computing
Inroduction to grid computing by gargi shankar verma
Grid computing by vaishali sahare [katkar]
Grid computing
The grid aprimer
Grid computing dis
grid computing
Grid computing

More from ncct (20)

PDF
Biomedical Wearable Device For Remote Monitoring Ofphysiological Signals
PDF
Digital Water Marking For Video Piracy Detection
PDF
Self Repairing Tree Topology Enabling Content Based Routing In Local Area Ne...
PDF
Cockpit White Box
PDF
Rail Track Inspector
PDF
Botminer Clustering Analysis Of Network Traffic For Protocol And Structure...
PDF
Bot Robo Tanker Sound Detector
PDF
Distance Protection
PDF
Bluetooth Jammer
PDF
Crypkit 1
PDF
I E E E 2009 Java Projects
PDF
B E Projects M C A Projects B
PDF
J2 E E Projects, I E E E Projects 2009
PDF
J2 M E Projects, I E E E Projects 2009
PDF
Engineering College Projects, M C A Projects, B E Projects, B Tech Pr...
PDF
B E M E Projects M C A Projects B
PDF
I E E E 2009 Java Projects, I E E E 2009 A S P
PDF
Advantages Of Software Projects N C C T
PDF
Engineering Projects
PDF
Software Projects Java Projects Mobile Computing
Biomedical Wearable Device For Remote Monitoring Ofphysiological Signals
Digital Water Marking For Video Piracy Detection
Self Repairing Tree Topology Enabling Content Based Routing In Local Area Ne...
Cockpit White Box
Rail Track Inspector
Botminer Clustering Analysis Of Network Traffic For Protocol And Structure...
Bot Robo Tanker Sound Detector
Distance Protection
Bluetooth Jammer
Crypkit 1
I E E E 2009 Java Projects
B E Projects M C A Projects B
J2 E E Projects, I E E E Projects 2009
J2 M E Projects, I E E E Projects 2009
Engineering College Projects, M C A Projects, B E Projects, B Tech Pr...
B E M E Projects M C A Projects B
I E E E 2009 Java Projects, I E E E 2009 A S P
Advantages Of Software Projects N C C T
Engineering Projects
Software Projects Java Projects Mobile Computing

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
cuic standard and advanced reporting.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
KodekX | Application Modernization Development
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Programs and apps: productivity, graphics, security and other tools
Dropbox Q2 2025 Financial Results & Investor Presentation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
cuic standard and advanced reporting.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25 Week I
Unlocking AI with Model Context Protocol (MCP)
Diabetes mellitus diagnosis method based random forest with bat algorithm
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
KodekX | Application Modernization Development

Java Abs Grid Information Retrival System

  • 1. GRID INFORMATION RETRIVAL SYSTEM USING JAVA INTRODUCTION GRID IR is Information Retrieval on the grid! It is a new initiative to bring together information retrieval techniques with grid computing. IR or information retrieval is a field of research concerned with searching unstructured (or quasi-structured) data such as text documents and the retrieval of results pertinent to a user’s query. Modern web search engines are the most widely known implementations of IR system. Grid computing is the accomplishment of computational tasks on a set of computers connected by a network. This is similar to distributed computing, except with a more finely grained implementation for task assignment and coordination among the grid elements. Grid computing provides clustering of remotely distributed computing. The principal focus of grid computing to date has been on maximizing the use of available processor resources for compute-intensive applications. Grid computing along with storage virtualization and server virtualization enables a Utility Computing. Applying the resources of many computers in a network to a single problem at the same time – usually a scientific or technical problem that requires a great number of computer processing cycles or access to large amounts of data. Grid computing uses software to divide and farm out pieces of a program to as many as several thousand computers. A number of corporations, professional groups and university consortia have developed frameworks and software for managing grid computing projects. Grid computing is a model for allowing companies to use a large number of computing resources on demand, no matter where they are located. Grid IR applies the tools of grid computing to IR to provide a common infrastructure for distributed IR. It also brings the capabilities of IR to grid computing. GRID IR is a newly proposed initiative to implement a specific architecture for realizing IR on the open grid service architecture (OGSA) grid-computing platform. Traditional IR models are broken into constituent pieces and described as OGSA grid services. A model for interaction among these services describes the GRID IR system.
  • 2. AIM/OBJECTIVE OF THE SYSTEM The main aim of grid IR is to allow users information needs to be matched to documents by document collections, indexes and query engines which all exist as grid services. The project is implemented using JAVA. MS-ACCESS database is used for indexing the keywords of the document. PROPOSED SYSTEM SOFTWARE REQUIREMENTS Operating system : Windows XP/2000 Software : JDK 1.3 or higher Database : MS-ACCESS PROPOSED SYSTEM HARDWARE REQUIREMENTS Processor : Intel Pentium PIII or higher RAM : 128 MB or higher HDD : 80 GB HDD FDD : 1.44 MB or higher Monitor / Keyboard / CD drive PROPOSED SYSTEM DESCRIPTION Grid Computing is an advanced technology of distributed computing. A Grid is a collection of computers, storage and other devices, which are joined together by any means of communication like Internet and which can be used to manage information and solve their problems among themselves. Grid Computing allows usage of the unutilized resources of other systems. This is achieved by distributing the workload of the system to the other systems in order to use their unused resources such as their memory, processor, etc which results in balancing the workload, decreasing the network traffic, bandwidth, etc. This concept is used in our project to render a large image in a very short time by distributing the image to many systems for using their resources. As the workload is evenly distributed among the grid network, even the large work can be done in a short time itself. The main scope is that using the unused resources to complete the work efficiently. This project helps to use the resources efficiently and cost effective.
  • 3. Grid Computing is about making large amounts of computing power available for applications and users. Collaborative development of Java Grid Engine technology provides the proper development framework to ensure that Grid Engine technology meets the requirements of the largest number of users. Grid computing is a form of networking. Unlike conventional networks that focus on communication among devices, grid computing harnesses unused processing cycles of all computers in a network for solving problems too intensive for any stand-alone machine. A common example of a well-known grid computing project is the SETI (Search for Extraterrestrial Intelligence) @Home project, in which PC users worldwide donate unused processor cycles to help the search for signs of extraterrestrial life by analyzing signals coming from outer space. The proposed project relies on individual users to volunteer to allow the project to harness the unused processing power of the user's computer. This method saves the project both money and resources. This project in Java based Grid computing does require special imaging software that is unique to the computing project for which the grid is being used. The basic idea of grid IR is to define an IR system in terms of three functional components, implemented as grid services: the collection manager service (CM), the indexing/searching service (IS), and the Query processing service (QP). These services are autonomous, and being grid services, they are distributed. Since they can be used to create new IR systems or link existing ones together in an interoperable network of IR services. Information retrieval(IR) is the science and practice of identifying documents or sub-documents that needs information needs. Usually, IR deals with textual documents in semi-structured (e.g., HTML, XML) or unstructured (plain text) format. In order to boost processing power, institutions aggregated computing resources across the entire institution. The same idea of sharing resources has paved the way for grid computing but with a far wider scale and scope. Grid computing, in effect, provides a global reach to distributed computing. It promises lower total computing costs along with on-demand, reliable, and inexpensive access to the vast, available computing resources that would other wise go unused.
  • 4. GRID COMPUTING FEATURES The requirements for grid-computing infrastructure can be described by the following attributed: • Pooling of resources to increase utilization • Provisioning of work based on policies and dynamic requirements • Virtualization at every layer of the computing stack • Self-adaptive software that largely tunes and fixes itself • Unified management and provisioning. PROPOSED SYSTEM MODULES Java Grid project is divided into three modules server, client and worker 1. SERVER MODULE User interface Job Scheduler Workload Management Resource Management Data Management 2. WORKER MODULE Job Requests Receiver Job Processing Manager Job Requests Sender 3. CLIENT MODULE Job Fragmenter Job Requests Sender Job Results Receiver Job Results Aggregator
  • 5. GRID - MODULE DESCRIPTION 1. SERVER MODULE Server module, which maintains the number of clients and worker connected to the grid engine, amount of work load given to the worker, add grid node, remove grid node, data available in the clients. 2. CLIENT MODULE The given job is divided into job fragments and given to the grid server to process, client aggregate the resultant job fragments form the grid server. The purpose of the client is to divide and aggregate the job. 3. WORKER MODULE Worker process the job given by the grid server and then result is send to the grid server. Worker module runs server automatically identifies and connect the worker to the grid engine. In this project worker process the job such as rendering of images using pov ray software. GRID – CLIENT GRID - WORKER G Job Fragmenter R I Job Requests D Job Requests Receiver Job Requests Sender S E R Job Processing Manager V E Job Results Receiver R Job Results Sender Job Results Aggregator