SlideShare a Scribd company logo
GLOBALSOFT TECHNOLOGIES 
IEEE PROJECTS & SOFTWARE DEVELOPMENTS 
IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE 
BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS 
CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 
Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com 
Searching Dimension Incomplete Databases 
Abstract 
Similarity query is a fundamental problem in database, data mining and information retrieval research. 
Recently, querying incomplete data has attracted extensive attention as it poses new challenges to 
traditional querying techniques. The existing work on querying incomplete data addresses the problem 
where the data values on certain dimensions are unknown. However, in many real -life applications, such 
as data collected by a sensor network in a noisy environment, not only the data values but also the 
dimension information may be missing. In this work, we propose to investigate the problem of similarity 
search on dimension incomplete data. A probabilistic framework is developed to model this problem so 
that the users can find objects in the database that are similar to the query with probability guarantee. 
Missing dimension information poses great computational challenge, since all possible combinations of 
missing dimensions need to be examined when evaluating the similarity between the query and the 
data objects. We develop the lower and upper bounds of the probability that a data object is similar to 
the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining 
all missing dimension combinations. A probability triangle inequality is also employed to furthe r prune 
the search space and speed up the query process. The proposed probabilistic framework and techniques 
can be applied to both whole and subsequence queries. Extensive experimental results on real -life data 
sets demonstrate the effectiveness and efficiency of our approach. 
Existing system 
Similarity query is a fundamental problem in database, data mining and information retrieval research. 
Recently, querying incomplete data has attracted extensive attention as it poses new challenges to 
traditional querying techniques. The existing work on querying incomplete data addresses the problem 
where the data values on certain dimensions are unknown. However, in many real -life applications, such 
as data collected by a sensor network in a noisy environment, not only the data values but also the 
dimension information may be missing.
Proposed system 
we propose to investigate the problem of similarity search on dimension incomplete data. A 
probabilistic framework is developed to model this problem so that the users can find objects in the 
database that are similar to the query with probability guarantee. Missing dimension information poses 
great computational challenge, since all possible combinations of missing dimensions need to be 
examined when evaluating the similarity between the query and the data objects. We develop the lower 
and upper bounds of the probability that a data object is similar to the query. These bounds enable 
efficient filtering of irrelevant data objects without explicitly examining all missing dimension 
combinations. A probability triangle inequality is also employed to further prune the search space and 
speed up the query process. The proposed probabilistic framework and techniques can be applied to 
both whole and subsequence queries. Extensive experimental results on real -life data sets demonstrate 
the effectiveness and efficiency of our approach. 
SYSTEM CONFIGURATION:- 
HARDWARE CONFIGURATION:- 
 Processor - Pentium –IV 
 Speed - 1.1 Ghz 
 RAM - 256 MB(min) 
 Hard Disk - 20 GB 
 Key Board - Standard Windows Keyboard 
 Mouse - Two or Three Button Mouse 
 Monitor - SVGA 
SOFTWARE CONFIGURATION:- 
 Operating System : Windows XP 
 Programming Language : JAVA 
 Java Version : JDK 1.6 & above.
IEEE 2014 JAVA DATA MINING PROJECTS Searching dimension incomplete databases

More Related Content

PPTX
Data analytics in computer networking
PDF
SDN Dependability: Assessment, Techniques, and Tools - SDN Research Group - I...
PDF
Query aware determinization of uncertain objects
PPTX
Big Data Analytics and Advanced Computer Networking Scenarios
PPTX
Search, Discovery and Analysis of Sensory Data Streams
PDF
YDC Resume
PPTX
Internet Search: the past, present and the future
Data analytics in computer networking
SDN Dependability: Assessment, Techniques, and Tools - SDN Research Group - I...
Query aware determinization of uncertain objects
Big Data Analytics and Advanced Computer Networking Scenarios
Search, Discovery and Analysis of Sensory Data Streams
YDC Resume
Internet Search: the past, present and the future

What's hot (19)

PDF
Towards Automatic Composition of Multicomponent Predictive Systems
DOCX
Himansu sahoo resume-ds
PDF
Towards reproducibility and maximally-open data
PDF
Quick presentation for the OpenML workshop in Eindhoven 2014
PDF
IEEE Datamining 2016 Title and Abstract
PDF
MINING USER-AWARE RARE SEQUENTIAL TOPIC PATTERNS IN DOCUMENT STREAMS
PDF
Query aware determinization of uncertain objects
PDF
IEEE Big data 2016 Title and Abstract
PPTX
Visualization and Analysis of Dynamic Networks
PPTX
Charleston Conference 2016
PDF
Top data science projects
PDF
Study on Cyber Security:Establishing a Sustainable Cyber Security Framework f...
PDF
MAPREDUCE IMPLEMENTATION FOR MALICIOUS WEBSITES CLASSIFICATION
PDF
Slides - Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
PDF
Evaluation of a Multiple Regression Model for Noisy and Missing Data
PDF
accessible-streaming-algorithms
PPT
Pedro-Combining rapid data modelling and ontology services
PDF
Talk Big Data Conference Munich - Data Science needs real Data Scientists.
PPTX
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Towards Automatic Composition of Multicomponent Predictive Systems
Himansu sahoo resume-ds
Towards reproducibility and maximally-open data
Quick presentation for the OpenML workshop in Eindhoven 2014
IEEE Datamining 2016 Title and Abstract
MINING USER-AWARE RARE SEQUENTIAL TOPIC PATTERNS IN DOCUMENT STREAMS
Query aware determinization of uncertain objects
IEEE Big data 2016 Title and Abstract
Visualization and Analysis of Dynamic Networks
Charleston Conference 2016
Top data science projects
Study on Cyber Security:Establishing a Sustainable Cyber Security Framework f...
MAPREDUCE IMPLEMENTATION FOR MALICIOUS WEBSITES CLASSIFICATION
Slides - Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Evaluation of a Multiple Regression Model for Noisy and Missing Data
accessible-streaming-algorithms
Pedro-Combining rapid data modelling and ontology services
Talk Big Data Conference Munich - Data Science needs real Data Scientists.
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Ad

Similar to IEEE 2014 JAVA DATA MINING PROJECTS Searching dimension incomplete databases (20)

PDF
Enhance The Technique For Searching Dimension Incomplete Databases
PDF
Searching in metric spaces
PDF
A STUDY ON SIMILARITY MEASURE FUNCTIONS ON ENGINEERING MATERIALS SELECTION
PDF
4.on demand quality of web services using ranking by multi criteria 31-35
PDF
11.0004www.iiste.org call for paper.on demand quality of web services using r...
PDF
An Robust Outsourcing of Multi Party Dataset by Utilizing Super-Modularity an...
PDF
Nonmetric similarity search
PDF
On nonmetric similarity search problems in complex domains
PPTX
Once upon a time in Datatown ...
PDF
Topics In Rough Set Theory Current Applications To Granular Computing Seiki A...
PDF
An Advanced IR System of Relational Keyword Search Technique
PDF
B0930610
DOCX
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Facilitating document annotation usin...
DOCX
JAVA 2013 IEEE CLOUDCOMPUTING PROJECT Facilitating document annotation using ...
DOCX
Facilitating document annotation using content and querying value
PPTX
Search as-you-type (Exact search)
DOCX
Facilitating document annotation using content and querying value
DOCX
JAVA 2013 IEEE DATAMINING PROJECT Facilitating document annotation using cont...
PDF
High Dimensional Indexing Transformational Approaches to High-Dimensional Ran...
PPTX
Missing Data imputation
Enhance The Technique For Searching Dimension Incomplete Databases
Searching in metric spaces
A STUDY ON SIMILARITY MEASURE FUNCTIONS ON ENGINEERING MATERIALS SELECTION
4.on demand quality of web services using ranking by multi criteria 31-35
11.0004www.iiste.org call for paper.on demand quality of web services using r...
An Robust Outsourcing of Multi Party Dataset by Utilizing Super-Modularity an...
Nonmetric similarity search
On nonmetric similarity search problems in complex domains
Once upon a time in Datatown ...
Topics In Rough Set Theory Current Applications To Granular Computing Seiki A...
An Advanced IR System of Relational Keyword Search Technique
B0930610
DOTNET 2013 IEEE CLOUDCOMPUTING PROJECT Facilitating document annotation usin...
JAVA 2013 IEEE CLOUDCOMPUTING PROJECT Facilitating document annotation using ...
Facilitating document annotation using content and querying value
Search as-you-type (Exact search)
Facilitating document annotation using content and querying value
JAVA 2013 IEEE DATAMINING PROJECT Facilitating document annotation using cont...
High Dimensional Indexing Transformational Approaches to High-Dimensional Ran...
Missing Data imputation
Ad

More from IEEEFINALYEARSTUDENTPROJECTS (20)

DOCX
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Efficient and privacy aware data agg...
DOCX
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Building a scalable system for steal...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Token mac a fair mac protocol for pa...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Tag sense leveraging smartphones for...
DOC
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Privacy preserving optimal meeting l...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Preserving location privacy in geo s...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Friendbook a semantic based friend r...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Efficient and privacy aware data agg...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Cloud assisted mobile-access of heal...
DOCX
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS A low complexity algorithm for neigh...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Hierarchical prediction and context ...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Designing an-efficient-image encrypt...
DOCX
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Click prediction-for-web-image-reran...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Web service recommendation via expl...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Scalable and accurate prediction of...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Privacy enhanced web service compos...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Decentralized enactment of bpel pro...
DOCX
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS A novel time obfuscated algorithm ...
DOCX
IEEE 2014 JAVA SOFTWARE ENGINEER PROJECTS Conservation of information softwar...
DOC
IEEE 2014 JAVA DATA MINING PROJECTS Xs path navigation on xml schemas made easy
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Efficient and privacy aware data agg...
IEEE 2014 JAVA NETWORK SECURITY PROJECTS Building a scalable system for steal...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Token mac a fair mac protocol for pa...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Tag sense leveraging smartphones for...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Privacy preserving optimal meeting l...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Preserving location privacy in geo s...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Friendbook a semantic based friend r...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Efficient and privacy aware data agg...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS Cloud assisted mobile-access of heal...
IEEE 2014 JAVA MOBILE COMPUTING PROJECTS A low complexity algorithm for neigh...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Hierarchical prediction and context ...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Designing an-efficient-image encrypt...
IEEE 2014 JAVA IMAGE PROCESSING PROJECTS Click prediction-for-web-image-reran...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Web service recommendation via expl...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Scalable and accurate prediction of...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Privacy enhanced web service compos...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS Decentralized enactment of bpel pro...
IEEE 2014 JAVA SERVICE COMPUTING PROJECTS A novel time obfuscated algorithm ...
IEEE 2014 JAVA SOFTWARE ENGINEER PROJECTS Conservation of information softwar...
IEEE 2014 JAVA DATA MINING PROJECTS Xs path navigation on xml schemas made easy

Recently uploaded (20)

PPTX
additive manufacturing of ss316l using mig welding
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Safety Seminar civil to be ensured for safe working.
PPT
Mechanical Engineering MATERIALS Selection
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
composite construction of structures.pdf
PPTX
Artificial Intelligence
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
CH1 Production IntroductoryConcepts.pptx
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PPT
Project quality management in manufacturing
PPTX
Sustainable Sites - Green Building Construction
PPTX
Internet of Things (IOT) - A guide to understanding
additive manufacturing of ss316l using mig welding
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Model Code of Practice - Construction Work - 21102022 .pdf
Foundation to blockchain - A guide to Blockchain Tech
Safety Seminar civil to be ensured for safe working.
Mechanical Engineering MATERIALS Selection
Embodied AI: Ushering in the Next Era of Intelligent Systems
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
composite construction of structures.pdf
Artificial Intelligence
Lecture Notes Electrical Wiring System Components
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
CH1 Production IntroductoryConcepts.pptx
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
Project quality management in manufacturing
Sustainable Sites - Green Building Construction
Internet of Things (IOT) - A guide to understanding

IEEE 2014 JAVA DATA MINING PROJECTS Searching dimension incomplete databases

  • 1. GLOBALSOFT TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS IEEE FINAL YEAR PROJECTS|IEEE ENGINEERING PROJECTS|IEEE STUDENTS PROJECTS|IEEE BULK PROJECTS|BE/BTECH/ME/MTECH/MS/MCA PROJECTS|CSE/IT/ECE/EEE PROJECTS CELL: +91 98495 39085, +91 99662 35788, +91 98495 57908, +91 97014 40401 Visit: www.finalyearprojects.org Mail to:ieeefinalsemprojects@gmail.com Searching Dimension Incomplete Databases Abstract Similarity query is a fundamental problem in database, data mining and information retrieval research. Recently, querying incomplete data has attracted extensive attention as it poses new challenges to traditional querying techniques. The existing work on querying incomplete data addresses the problem where the data values on certain dimensions are unknown. However, in many real -life applications, such as data collected by a sensor network in a noisy environment, not only the data values but also the dimension information may be missing. In this work, we propose to investigate the problem of similarity search on dimension incomplete data. A probabilistic framework is developed to model this problem so that the users can find objects in the database that are similar to the query with probability guarantee. Missing dimension information poses great computational challenge, since all possible combinations of missing dimensions need to be examined when evaluating the similarity between the query and the data objects. We develop the lower and upper bounds of the probability that a data object is similar to the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining all missing dimension combinations. A probability triangle inequality is also employed to furthe r prune the search space and speed up the query process. The proposed probabilistic framework and techniques can be applied to both whole and subsequence queries. Extensive experimental results on real -life data sets demonstrate the effectiveness and efficiency of our approach. Existing system Similarity query is a fundamental problem in database, data mining and information retrieval research. Recently, querying incomplete data has attracted extensive attention as it poses new challenges to traditional querying techniques. The existing work on querying incomplete data addresses the problem where the data values on certain dimensions are unknown. However, in many real -life applications, such as data collected by a sensor network in a noisy environment, not only the data values but also the dimension information may be missing.
  • 2. Proposed system we propose to investigate the problem of similarity search on dimension incomplete data. A probabilistic framework is developed to model this problem so that the users can find objects in the database that are similar to the query with probability guarantee. Missing dimension information poses great computational challenge, since all possible combinations of missing dimensions need to be examined when evaluating the similarity between the query and the data objects. We develop the lower and upper bounds of the probability that a data object is similar to the query. These bounds enable efficient filtering of irrelevant data objects without explicitly examining all missing dimension combinations. A probability triangle inequality is also employed to further prune the search space and speed up the query process. The proposed probabilistic framework and techniques can be applied to both whole and subsequence queries. Extensive experimental results on real -life data sets demonstrate the effectiveness and efficiency of our approach. SYSTEM CONFIGURATION:- HARDWARE CONFIGURATION:-  Processor - Pentium –IV  Speed - 1.1 Ghz  RAM - 256 MB(min)  Hard Disk - 20 GB  Key Board - Standard Windows Keyboard  Mouse - Two or Three Button Mouse  Monitor - SVGA SOFTWARE CONFIGURATION:-  Operating System : Windows XP  Programming Language : JAVA  Java Version : JDK 1.6 & above.