SlideShare a Scribd company logo
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
K-SUBSPACES QUANTIZATION FOR APPROXIMATE NEAREST
NEIGHBOR SEARCH.
Abstract
Approximate Nearest Neighbor (ANN) search has become a popular approach
for performing fast and efficient retrieval on very large-scale datasets in recent
years, as the size and dimension of data grow continuously. In this paper, we
propose a novel vector quantization method for ANN search which enables
faster and more accurate retrieval on publicly available datasets. We define
vector quantization as a multiple affine subspace learning problem and explore
the quantization centroids on multiple affine subspaces. We propose an
iterative approach to minimize the quantization error in order to create a novel
quantization scheme, which outperforms the state-of-the-art algorithms. The
computational cost of our method is also comparable to that of the competing
methods.
CONCLUSION
In this study a novel vector quantization algorithm is proposed for the
approximate nearest neighbor search problem. The proposed method explores
the quantization centers in affine subspaces through an iterative technique,
which jointly attempts to minimize the quantization error of the training
samples in the learnt subspaces, while minimizing the projection error of the
samples to the corresponding subspaces. The proposed method has proven to
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
outperform the state-of-the-art-methods, with comparable computational cost
and additional storage. In this paper it is also shown that, dimension reduction
is an important source of quantization error, and by exploiting subspace
clustering techniques the quantization error can be reduced, leading to a
better quantization performance. So far we have focused mainly on exhaustive
search but an index-based non-exhaustive extension for the propose method
can be further investigated. Our approach can also be extended to labeled
datasets in order to test k-nearest neighbor classification performance. These
will be the topics of our future work.
REFERENCES
[1] P. Indyk and R. Motwani, “Approximate nearest neighbors: Towards
removing the curse of dimensionality,” in Proc. 30th Annu. ACM Symp. Theory
Comput., 1998, pp. 604–613.
[2] J. Wang, H. T. Shen, J. Song, and J. Ji, “Hashing for similarity search: A
survey,” arXiv preprint, 2014, p. 1408.2927.
[3] M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni, “Localitysensitive
hashing scheme based on P-stable distributions,” in Proc. 20th Annu. Symp.
Comput. Geom., 2004, pp. 253–262.
[4] K. Terasawa and Y. Tanaka, “Spherical LSH for approximate nearest
neighbor search on unit hypersphere,” in Proc. 10th Int. Conf. Algorithms Data
Struct., 2007, pp. 27–38.
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
[5] X. He, D. Cai, S. Yan, and H. Zhang, “Neighborhood preserving embedding,”
in Proc. 10th IEEE Int. Conf. Comput. Vis., 2005, pp. 1208–1213.
[6] H. Jegou, M. Douze, C. Schmid, and P. Perez, “Aggregating local descriptors
into a compact image representation,” in Proc. IEEE Conf. Comput. Vis. Pattern
Recog., 2010, pp. 3304–3311.
[7] J. Heo, Y. Lee, and J. He, “Spherical hashing,” in Proc. IEEE Conf. Comput.
Vis. Pattern Recog., 2012, pp. 2957–2964.
[8] A. Gordo, F. Perronnin, Y. Gong, and S. Lazebnik, “Asymmetric distances for
binary embeddings,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 36, no. 1, pp.
33–47, Jan. 2014.
[9] W. Dong, M. Charikar, and K. Li, “Asymmetric distance estimation with
sketches for similarity search in high-dimensional spaces,” in Proc. 31st Annu.
Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, 2008, pp. 123–130.
[10] S. Lloyd, “Least squares quantization in PCM,” IEEE Trans. Inf. Theory, vol.
28, no. 2, pp. 129–137, Mar. 1982.

More Related Content

PDF
K NEAREST NEIGHBOUR JOINS FOR BIG DATA ON MAPREDUCE: A THEORETICAL AND EXPER...
DOCX
Clustering big spatiotemporal interval data
PDF
SCALABLE SEMI-SUPERVISED LEARNING BY EFFICIENT ANCHOR GRAPH REGULARIZATION
PDF
EPAS: A SAMPLING BASED SIMILARITY IDENTIFICATION ALGORITHM FOR THE CLOUD
PDF
Poster Final
PPTX
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
PDF
INVERTED LINEAR QUADTREE: EFFICIENT TOP K SPATIAL KEYWORD SEARCH
DOCX
Density maximization for improving graph matching with its applications
K NEAREST NEIGHBOUR JOINS FOR BIG DATA ON MAPREDUCE: A THEORETICAL AND EXPER...
Clustering big spatiotemporal interval data
SCALABLE SEMI-SUPERVISED LEARNING BY EFFICIENT ANCHOR GRAPH REGULARIZATION
EPAS: A SAMPLING BASED SIMILARITY IDENTIFICATION ALGORITHM FOR THE CLOUD
Poster Final
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
INVERTED LINEAR QUADTREE: EFFICIENT TOP K SPATIAL KEYWORD SEARCH
Density maximization for improving graph matching with its applications

What's hot (19)

PDF
Using parallel hierarchical clustering to
PDF
A LOCALITY SENSITIVE LOW-RANK MODEL FOR IMAGE TAG COMPLETION
DOCX
Matching of large images through coupled decomposition
PPTX
Undergraduate Modeling Workshop - Forest Cover Working Group Final Presentati...
PDF
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
PDF
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
PDF
Towards Glyph-based Visualizations for Big Data Clustering
DOCX
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...
DOCX
ACE–AN EFFECTIVE ANTI-FORENSIC CONTRAST ENHANCEMENT TECHNIQUE
PDF
PROXIES FOR SHORTEST PATH AND DISTANCE QUERIES
PDF
Graph based Clustering
PDF
Matlab adaptive image search with hash codes
PPTX
Fuzzy grained analysis
PPT
environmental scivis via dynamic and thematc mapping
PPTX
EfficientNet
PDF
Unit Commitment with Lagrange Relaxation Using Particle Swarm Optimization
DOCX
AN INFORMATION THEORY-BASED FEATURE SELECTIONFRAMEWORK FOR BIG DATA UNDER APA...
PPTX
Causal impact study
PDF
Fast and scalable range query processing with strong privacy protection for c...
Using parallel hierarchical clustering to
A LOCALITY SENSITIVE LOW-RANK MODEL FOR IMAGE TAG COMPLETION
Matching of large images through coupled decomposition
Undergraduate Modeling Workshop - Forest Cover Working Group Final Presentati...
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Towards Glyph-based Visualizations for Big Data Clustering
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Multi illuminant estimation with c...
ACE–AN EFFECTIVE ANTI-FORENSIC CONTRAST ENHANCEMENT TECHNIQUE
PROXIES FOR SHORTEST PATH AND DISTANCE QUERIES
Graph based Clustering
Matlab adaptive image search with hash codes
Fuzzy grained analysis
environmental scivis via dynamic and thematc mapping
EfficientNet
Unit Commitment with Lagrange Relaxation Using Particle Swarm Optimization
AN INFORMATION THEORY-BASED FEATURE SELECTIONFRAMEWORK FOR BIG DATA UNDER APA...
Causal impact study
Fast and scalable range query processing with strong privacy protection for c...
Ad

Similar to K-SUBSPACES QUANTIZATION FOR APPROXIMATE NEAREST NEIGHBOR SEARCH (20)

DOCX
A hybrid approach to clustering in big data
PDF
IEEE Datamining 2016 Title and Abstract
PDF
IEEE Big data 2016 Title and Abstract
PDF
ME Synopsis
PDF
Cost Estimation Predictive Modeling: Regression versus Neural Network
PDF
CLUSTERING DATA STREAMS BASED ON SHARED DENSITY BETWEEN MICRO-CLUSTERS
PDF
Estimating project development effort using clustered regression approach
PDF
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
PDF
RESOLVING MULTI-PARTY PRIVACY CONFLICTS IN SOCIAL MEDIA
PDF
Drsp dimension reduction for similarity matching and pruning of time series ...
DOCX
CAN SENSORS COLLECT BIG DATA? AN ENERGY EFFICIENT BIG DATA GATHERING ALGORIT...
DOCX
Collaboration and fairness-aware big data management in distributed clouds
PDF
Web image annotation by diffusion maps manifold learning algorithm
DOCX
Hashedcubes simple, low memory, real time visual
DOCX
Hashedcubes simple, low memory, real time visual
PDF
Research Proposal
PDF
Performance Analysis and Parallelization of CosineSimilarity of Documents
PDF
Critical Paths Identification on Fuzzy Network Project
DOCX
Improving viability of electric taxis by taxi service strategy optimization a...
PDF
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
A hybrid approach to clustering in big data
IEEE Datamining 2016 Title and Abstract
IEEE Big data 2016 Title and Abstract
ME Synopsis
Cost Estimation Predictive Modeling: Regression versus Neural Network
CLUSTERING DATA STREAMS BASED ON SHARED DENSITY BETWEEN MICRO-CLUSTERS
Estimating project development effort using clustered regression approach
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
RESOLVING MULTI-PARTY PRIVACY CONFLICTS IN SOCIAL MEDIA
Drsp dimension reduction for similarity matching and pruning of time series ...
CAN SENSORS COLLECT BIG DATA? AN ENERGY EFFICIENT BIG DATA GATHERING ALGORIT...
Collaboration and fairness-aware big data management in distributed clouds
Web image annotation by diffusion maps manifold learning algorithm
Hashedcubes simple, low memory, real time visual
Hashedcubes simple, low memory, real time visual
Research Proposal
Performance Analysis and Parallelization of CosineSimilarity of Documents
Critical Paths Identification on Fuzzy Network Project
Improving viability of electric taxis by taxi service strategy optimization a...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Ad

More from Nexgen Technology (20)

DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CH...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENN...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
DOCX
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENNA...
DOCX
Ieee 2020 21 vlsi projects in pondicherry,ieee vlsi projects in chennai
DOCX
Ieee 2020 21 power electronics in pondicherry,Ieee 2020 21 power electronics
DOCX
Ieee 2020 -21 ns2 in pondicherry, Ieee 2020 -21 ns2 projects,best project cen...
DOCX
Ieee 2020 21 ns2 in pondicherry,best project center in pondicherry,final year...
DOCX
Ieee 2020 21 java dotnet in pondicherry,final year projects in pondicherry,pr...
DOCX
Ieee 2020 21 iot in pondicherry,final year projects in pondicherry,project ce...
DOCX
Ieee 2020 21 blockchain in pondicherry,final year projects in pondicherry,bes...
DOCX
Ieee 2020 -21 bigdata in pondicherry,project center in pondicherry,best proje...
DOCX
Ieee 2020 21 embedded in pondicherry,final year projects in pondicherry,best...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CH...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENN...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHE...
MECHANICAL PROJECTS IN PONDICHERRY, 2020-21 MECHANICAL PROJECTS IN CHENNA...
Ieee 2020 21 vlsi projects in pondicherry,ieee vlsi projects in chennai
Ieee 2020 21 power electronics in pondicherry,Ieee 2020 21 power electronics
Ieee 2020 -21 ns2 in pondicherry, Ieee 2020 -21 ns2 projects,best project cen...
Ieee 2020 21 ns2 in pondicherry,best project center in pondicherry,final year...
Ieee 2020 21 java dotnet in pondicherry,final year projects in pondicherry,pr...
Ieee 2020 21 iot in pondicherry,final year projects in pondicherry,project ce...
Ieee 2020 21 blockchain in pondicherry,final year projects in pondicherry,bes...
Ieee 2020 -21 bigdata in pondicherry,project center in pondicherry,best proje...
Ieee 2020 21 embedded in pondicherry,final year projects in pondicherry,best...

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
master seminar digital applications in india
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Cell Types and Its function , kingdom of life
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Classroom Observation Tools for Teachers
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Structure & Organelles in detailed.
Microbial disease of the cardiovascular and lymphatic systems
Final Presentation General Medicine 03-08-2024.pptx
master seminar digital applications in india
Complications of Minimal Access Surgery at WLH
Cell Types and Its function , kingdom of life
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Anesthesia in Laparoscopic Surgery in India
O7-L3 Supply Chain Operations - ICLT Program
human mycosis Human fungal infections are called human mycosis..pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Classroom Observation Tools for Teachers
O5-L3 Freight Transport Ops (International) V1.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...

K-SUBSPACES QUANTIZATION FOR APPROXIMATE NEAREST NEIGHBOR SEARCH

  • 1. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com K-SUBSPACES QUANTIZATION FOR APPROXIMATE NEAREST NEIGHBOR SEARCH. Abstract Approximate Nearest Neighbor (ANN) search has become a popular approach for performing fast and efficient retrieval on very large-scale datasets in recent years, as the size and dimension of data grow continuously. In this paper, we propose a novel vector quantization method for ANN search which enables faster and more accurate retrieval on publicly available datasets. We define vector quantization as a multiple affine subspace learning problem and explore the quantization centroids on multiple affine subspaces. We propose an iterative approach to minimize the quantization error in order to create a novel quantization scheme, which outperforms the state-of-the-art algorithms. The computational cost of our method is also comparable to that of the competing methods. CONCLUSION In this study a novel vector quantization algorithm is proposed for the approximate nearest neighbor search problem. The proposed method explores the quantization centers in affine subspaces through an iterative technique, which jointly attempts to minimize the quantization error of the training samples in the learnt subspaces, while minimizing the projection error of the samples to the corresponding subspaces. The proposed method has proven to
  • 2. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com outperform the state-of-the-art-methods, with comparable computational cost and additional storage. In this paper it is also shown that, dimension reduction is an important source of quantization error, and by exploiting subspace clustering techniques the quantization error can be reduced, leading to a better quantization performance. So far we have focused mainly on exhaustive search but an index-based non-exhaustive extension for the propose method can be further investigated. Our approach can also be extended to labeled datasets in order to test k-nearest neighbor classification performance. These will be the topics of our future work. REFERENCES [1] P. Indyk and R. Motwani, “Approximate nearest neighbors: Towards removing the curse of dimensionality,” in Proc. 30th Annu. ACM Symp. Theory Comput., 1998, pp. 604–613. [2] J. Wang, H. T. Shen, J. Song, and J. Ji, “Hashing for similarity search: A survey,” arXiv preprint, 2014, p. 1408.2927. [3] M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni, “Localitysensitive hashing scheme based on P-stable distributions,” in Proc. 20th Annu. Symp. Comput. Geom., 2004, pp. 253–262. [4] K. Terasawa and Y. Tanaka, “Spherical LSH for approximate nearest neighbor search on unit hypersphere,” in Proc. 10th Int. Conf. Algorithms Data Struct., 2007, pp. 27–38.
  • 3. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com [5] X. He, D. Cai, S. Yan, and H. Zhang, “Neighborhood preserving embedding,” in Proc. 10th IEEE Int. Conf. Comput. Vis., 2005, pp. 1208–1213. [6] H. Jegou, M. Douze, C. Schmid, and P. Perez, “Aggregating local descriptors into a compact image representation,” in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2010, pp. 3304–3311. [7] J. Heo, Y. Lee, and J. He, “Spherical hashing,” in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2012, pp. 2957–2964. [8] A. Gordo, F. Perronnin, Y. Gong, and S. Lazebnik, “Asymmetric distances for binary embeddings,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 36, no. 1, pp. 33–47, Jan. 2014. [9] W. Dong, M. Charikar, and K. Li, “Asymmetric distance estimation with sketches for similarity search in high-dimensional spaces,” in Proc. 31st Annu. Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, 2008, pp. 123–130. [10] S. Lloyd, “Least squares quantization in PCM,” IEEE Trans. Inf. Theory, vol. 28, no. 2, pp. 129–137, Mar. 1982.