SlideShare a Scribd company logo
Distributed In-Memory Processing of All k Nearest Neighbor Queries
Abstract:
A wide spectrum of Internet-scale mobile applications, ranging from social
networking, gaming and entertainment to emergency response and crisis
management, all require efficient and scalable All k Nearest Neighbor (AkNN)
computations over millions of moving objects every few seconds to be
operational. Most traditional techniques for computing AkNN queries are
centralized, lacking both scalability and efficiency. Only recently, distributed
techniques for shared-nothing cloud infrastructures have been proposed to
achieve scalability for large datasets. These batch-oriented algorithms are sub-
optimal due to inefficient data space partitioning and data replication among
processing units. In this paper, we present Spitfire, a distributed algorithm that
provides a scalable and high-performance AkNN processing framework. Our
proposed algorithm deploys a fast load-balanced partitioning scheme along with
an efficient replication-set selection algorithm, to provide fast main-memory
computations of the exact AkNN results in a batch-oriented manner. We evaluate,
both analytically and experimentally, how the pruning efficiency of the Spitfire
algorithm plays a pivotal role in reducing communication and response time up to
an order of magnitude, compared to three other state-of-the-art distributed
AkNN algorithms executed in distributed main-memory.

More Related Content

PDF
Secure power grid simulation on cloud
DOCX
Cross cloud map reduce for big data
DOCX
Fast Communication-efficient Spectral Clustering Over Distributed Data
PDF
Ahmed Absi slides bigbwa
PPTX
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
PDF
A modeling approach for cloud infrastructure planning considering dependabili...
PDF
Dotnet modeling and optimizing the performance- security tradeoff on d-ncs u...
PPTX
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5
Secure power grid simulation on cloud
Cross cloud map reduce for big data
Fast Communication-efficient Spectral Clustering Over Distributed Data
Ahmed Absi slides bigbwa
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
A modeling approach for cloud infrastructure planning considering dependabili...
Dotnet modeling and optimizing the performance- security tradeoff on d-ncs u...
The Impact of Cloud Computing on Predictive Analytics 7-29-09 v5

What's hot (14)

DOCX
A time efficient approach for detecting errors in big sensor data on cloud
DOCX
PDF
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
PPTX
Data Protection & Wireless Security Advisor
PPTX
Probabilistic Programming for Dynamic Data Assimilation on an Agent-Based Model
PPTX
EnviroInfo 2013: Energy Efficiency in Cloud Software Architectures
PDF
Secure optimization computation outsourcing in cloud computing a case study o...
PDF
Practical conflict graphs in the wild
PPTX
Spectral Clustering
DOCX
A time efficient approach for detecting errors in big sensor data on cloud
PDF
Resume_Dec_16
DOCX
SECURE OPTIMIZATION COMPUTATION OUTSOURCING IN CLOUD COMPUTING: A CASE STUDY ...
PDF
EnBIS 2016 opening
PDF
A rough set-based incremental approach for updating approximations under dyna...
A time efficient approach for detecting errors in big sensor data on cloud
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
Data Protection & Wireless Security Advisor
Probabilistic Programming for Dynamic Data Assimilation on an Agent-Based Model
EnviroInfo 2013: Energy Efficiency in Cloud Software Architectures
Secure optimization computation outsourcing in cloud computing a case study o...
Practical conflict graphs in the wild
Spectral Clustering
A time efficient approach for detecting errors in big sensor data on cloud
Resume_Dec_16
SECURE OPTIMIZATION COMPUTATION OUTSOURCING IN CLOUD COMPUTING: A CASE STUDY ...
EnBIS 2016 opening
A rough set-based incremental approach for updating approximations under dyna...
Ad

Viewers also liked (15)

PDF
Reading Group Presentation: Why Eve and Mallory Love Android
PPT
Bag charm presentation linked in
PPTX
Geek Sync I SQL Server 2016 Performance Tricks You Need to Know
PPTX
Ghid de optimizare site web On page
PPT
Exclusive Marketing insights from the 2008 Barack Obama Presidential Campaign
PPTX
Freelance - The Flourishing New Gig Economy
PPTX
Measuring Front-End Performance - What, When and How?
PPT
Managing a Website Redesign
PDF
"Отказоустойчивый standby PostgreSQL (HAProxy + PgBouncer)" Виктор Ягофаров (...
PPTX
"GESTION DOCUMENTAL"
PPTX
security problems in the tcp/ip protocol suite
PPT
Η ηγεμονία της σπάρτης :Μια κυριαρχία σε αμφισβήτηση
PPT
Seerat e nabi (s.a.w.w ) in makkah
PPTX
Replacement Problem
PPTX
Apriori algorithm
Reading Group Presentation: Why Eve and Mallory Love Android
Bag charm presentation linked in
Geek Sync I SQL Server 2016 Performance Tricks You Need to Know
Ghid de optimizare site web On page
Exclusive Marketing insights from the 2008 Barack Obama Presidential Campaign
Freelance - The Flourishing New Gig Economy
Measuring Front-End Performance - What, When and How?
Managing a Website Redesign
"Отказоустойчивый standby PostgreSQL (HAProxy + PgBouncer)" Виктор Ягофаров (...
"GESTION DOCUMENTAL"
security problems in the tcp/ip protocol suite
Η ηγεμονία της σπάρτης :Μια κυριαρχία σε αμφισβήτηση
Seerat e nabi (s.a.w.w ) in makkah
Replacement Problem
Apriori algorithm
Ad

Similar to Distributed in memory processing of all k nearest neighbor queries (20)

PDF
IEEE Networking 2016 Title and Abstract
DOCX
Ieee transactions on 2018 network and service management
DOCX
An optimization framework for mobile data collection in energy harvesting wir...
PDF
IEEE Emerging topic in computing Title and Abstract 2016
PDF
Automated LiveMigration of VMs
DOCX
JAVA 2013 IEEE NETWORKING PROJECT Harvesting aware energy management for time...
DOCX
Harvesting aware energy management for time-critical wireless sensor networks
DOCX
Ns2 2015 2016 ieee project list-(v)_with abstract(S3 Infotech:9884848198)
PDF
Data Partitioning in Mongo DB with Cloud
PPTX
Anchor : A Versatile and Efficient Framework for Resource Management in the C...
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
PDF
Mca & diplamo java titles
IEEE Networking 2016 Title and Abstract
Ieee transactions on 2018 network and service management
An optimization framework for mobile data collection in energy harvesting wir...
IEEE Emerging topic in computing Title and Abstract 2016
Automated LiveMigration of VMs
JAVA 2013 IEEE NETWORKING PROJECT Harvesting aware energy management for time...
Harvesting aware energy management for time-critical wireless sensor networks
Ns2 2015 2016 ieee project list-(v)_with abstract(S3 Infotech:9884848198)
Data Partitioning in Mongo DB with Cloud
Anchor : A Versatile and Efficient Framework for Resource Management in the C...
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles
Mca & diplamo java titles

More from ieeepondy (20)

PDF
Demand aware network function placement
PDF
Service description in the nfv revolution trends, challenges and a way forward
PDF
Spatial related traffic sign inspection for inventory purposes using mobile l...
PDF
Standards for hybrid clouds
PDF
Rfhoc a random forest approach to auto-tuning hadoop's configuration
PDF
Resource and instance hour minimization for deadline constrained dag applicat...
PDF
Reliable and confidential cloud storage with efficient data forwarding functi...
PDF
Rebuttal to “comments on ‘control cloud data access privilege and anonymity w...
PDF
Scalable cloud–sensor architecture for the internet of things
PDF
Scalable algorithms for nearest neighbor joins on big trajectory data
PDF
Robust workload and energy management for sustainable data centers
PDF
Privacy preserving deep computation model on cloud for big data feature learning
PDF
Pricing the cloud ieee projects, ieee projects chennai, ieee projects 2016,ie...
PDF
Protection of big data privacy
PDF
Power optimization with bler constraint for wireless fronthauls in c ran
PDF
Performance aware cloud resource allocation via fitness-enabled auction
PDF
Performance limitations of a text search application running in cloud instances
PDF
Performance analysis and optimal cooperative cluster size for randomly distri...
PDF
Predictive control for energy aware consolidation in cloud datacenters
PDF
Over flow multi site aware big data management for scientific workflows on cl...
Demand aware network function placement
Service description in the nfv revolution trends, challenges and a way forward
Spatial related traffic sign inspection for inventory purposes using mobile l...
Standards for hybrid clouds
Rfhoc a random forest approach to auto-tuning hadoop's configuration
Resource and instance hour minimization for deadline constrained dag applicat...
Reliable and confidential cloud storage with efficient data forwarding functi...
Rebuttal to “comments on ‘control cloud data access privilege and anonymity w...
Scalable cloud–sensor architecture for the internet of things
Scalable algorithms for nearest neighbor joins on big trajectory data
Robust workload and energy management for sustainable data centers
Privacy preserving deep computation model on cloud for big data feature learning
Pricing the cloud ieee projects, ieee projects chennai, ieee projects 2016,ie...
Protection of big data privacy
Power optimization with bler constraint for wireless fronthauls in c ran
Performance aware cloud resource allocation via fitness-enabled auction
Performance limitations of a text search application running in cloud instances
Performance analysis and optimal cooperative cluster size for randomly distri...
Predictive control for energy aware consolidation in cloud datacenters
Over flow multi site aware big data management for scientific workflows on cl...

Recently uploaded (20)

PDF
Pre independence Education in Inndia.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Institutional Correction lecture only . . .
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Pre independence Education in Inndia.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
TR - Agricultural Crops Production NC III.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
Final Presentation General Medicine 03-08-2024.pptx
Computing-Curriculum for Schools in Ghana
GDM (1) (1).pptx small presentation for students
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Institutional Correction lecture only . . .
O7-L3 Supply Chain Operations - ICLT Program
Abdominal Access Techniques with Prof. Dr. R K Mishra
O5-L3 Freight Transport Ops (International) V1.pdf
human mycosis Human fungal infections are called human mycosis..pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Cell Structure & Organelles in detailed.
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx

Distributed in memory processing of all k nearest neighbor queries

  • 1. Distributed In-Memory Processing of All k Nearest Neighbor Queries Abstract: A wide spectrum of Internet-scale mobile applications, ranging from social networking, gaming and entertainment to emergency response and crisis management, all require efficient and scalable All k Nearest Neighbor (AkNN) computations over millions of moving objects every few seconds to be operational. Most traditional techniques for computing AkNN queries are centralized, lacking both scalability and efficiency. Only recently, distributed techniques for shared-nothing cloud infrastructures have been proposed to achieve scalability for large datasets. These batch-oriented algorithms are sub- optimal due to inefficient data space partitioning and data replication among processing units. In this paper, we present Spitfire, a distributed algorithm that provides a scalable and high-performance AkNN processing framework. Our proposed algorithm deploys a fast load-balanced partitioning scheme along with an efficient replication-set selection algorithm, to provide fast main-memory computations of the exact AkNN results in a batch-oriented manner. We evaluate, both analytically and experimentally, how the pruning efficiency of the Spitfire algorithm plays a pivotal role in reducing communication and response time up to an order of magnitude, compared to three other state-of-the-art distributed AkNN algorithms executed in distributed main-memory.