SlideShare a Scribd company logo
Energy-Efficient Reduce-and-Rank Using Input-Adaptive
Approximations
ABSTRACT:
Approximate computing is an emerging design paradigm that exploits the intrinsic
ability of applications to produce acceptable outputs even when their computations
are executed approximately. In this paper, we explore approximate computing for a
key computation pattern, reduce-and rank (RnR), which is prevalent in a wide
range of workloads, including video processing, recognition, search, and data
mining. An RnR kernel performs a reduction operation (e.g., distance computation,
dot product, and L1-norm) between an input vector and each of a set of reference
vectors, and ranks the reduction outputs to select the top reference vectors for the
current input. We propose three complementary approximation strategies for the
RnR computation pattern. The first is interleaved reduction and-ranking, wherein
the vector reductions are decomposed into multiple partial reductions and
interleaved with the rank computation. Leveraging this transformation, we propose
the use of intermediate reduction results and ranks to identify future computations
that are likely to have a low impact on the output, and can, hence, be
approximated. The second strategy, input similarity-based approximation, exploits
the spatial or temporal correlation of inputs (e.g., pixels of an image or frames of a
video) to identify computations that are amenable to approximation. The third
strategy, reference vector reordering, rearranges the order in which the reference
vectors are processed such that vectors that are relatively more critical in
evaluating the correct output, are processed at the beginning of RnR operation. The
number of these critical reference vectors is usually small, which renders a
substantial portion of the total computation to be amenable to approximation.
These strategies address a key challenge in approximate computing—identification
of which computations to approximate—and may be used to drive any
approximation mechanism, such as computation skipping or precision scaling to
realize performance and energy improvements. A second key challenge in
approximate computing is that the extent to which computations can be
approximated varies significantly from application to application, and across inputs
for even a single application. Hence, input-adaptive approximation, or the ability to
automatically modulate the degree of approximation based on the nature of each
individual input, is essential for obtaining optimal energy savings. In addition, to
enable quality configurability in RnR kernels, we propose a kernel-level quality
metric that correlates well to application-level quality, and identify key parameters
that can be used to tune the proposed approximation strategies dynamically. We
develop a runtime framework that modulates the identified parameters during the
execution of RnR kernels to minimize their energy while meeting a given target
quality. To evaluate the proposed concepts, we designed quality-configurable
hardware implementations of six RnR-based applications from the recognition,
mining, search, and video processing application domains in 45-nm technology.
The proposed architecture of this paper analysis the logic size, area and power
consumption using Xilinx 14.2.
SOFTWARE IMPLEMENTATION:
 Modelsim
 Xilinx ISE

More Related Content

PDF
Keep Calm and React with Foresight: Strategies for Low-Latency and Energy-Eff...
PDF
Power Consumption and Energy Estimation in Smartphones
PDF
Flowchart design for algorithms
PDF
IRJET- A Comparative Forecasting Analysis of ARIMA Model Vs Random Forest Alg...
PDF
Implementation of Radix-4 Booth Multiplier by VHDL
PDF
Designing of multilevel dpfc to improve power quality
PDF
Comparative Study on the Performance of A Coherency-based Simple Dynamic Equi...
Keep Calm and React with Foresight: Strategies for Low-Latency and Energy-Eff...
Power Consumption and Energy Estimation in Smartphones
Flowchart design for algorithms
IRJET- A Comparative Forecasting Analysis of ARIMA Model Vs Random Forest Alg...
Implementation of Radix-4 Booth Multiplier by VHDL
Designing of multilevel dpfc to improve power quality
Comparative Study on the Performance of A Coherency-based Simple Dynamic Equi...

What's hot (18)

PDF
IRJET-Load Forecasting using Fuzzy Logic
PPTX
K10620 digvijay
PDF
Power system state estimation using teaching learning-based optimization algo...
PPTX
ALGORITHM LEVEL ANALYSIS and DATA CORRELATION
PDF
A Brief Survey of Current Power Limiting Strategies
PDF
Reduced carrier PWM scheme with unified logical expressions for reduced switc...
DOCX
Two step optimization approach for the design of multiplierless linear-phase ...
DOCX
A novel control method for transformerless H bridge cascaded STATCOM with sta...
PDF
A verilog based simulation methodology for estimating statistical test for th...
PPT
MPC Tuning Based On Desired Frequency Domain Closed Loop Response
PDF
Gfs crane dc measuring pu at multiple levels
PDF
paper11
PDF
A Research on Optimal Power Flow Solutions For Variable Loa
PPTX
Time domain harmonic state estimation using filtered measurements based on fo...
PPT
Short-term Load Forecasting based on Neural network and Local RegressionJie
DOCX
SECURE OPTIMIZATION COMPUTATION OUTSOURCING IN CLOUD COMPUTING: A CASE STUDY ...
PDF
A MULTI-OBJECTIVE PERSPECTIVE FOR OPERATOR SCHEDULING USING FINEGRAINED DVS A...
PDF
Model Selection Using Conformal Predictors
IRJET-Load Forecasting using Fuzzy Logic
K10620 digvijay
Power system state estimation using teaching learning-based optimization algo...
ALGORITHM LEVEL ANALYSIS and DATA CORRELATION
A Brief Survey of Current Power Limiting Strategies
Reduced carrier PWM scheme with unified logical expressions for reduced switc...
Two step optimization approach for the design of multiplierless linear-phase ...
A novel control method for transformerless H bridge cascaded STATCOM with sta...
A verilog based simulation methodology for estimating statistical test for th...
MPC Tuning Based On Desired Frequency Domain Closed Loop Response
Gfs crane dc measuring pu at multiple levels
paper11
A Research on Optimal Power Flow Solutions For Variable Loa
Time domain harmonic state estimation using filtered measurements based on fo...
Short-term Load Forecasting based on Neural network and Local RegressionJie
SECURE OPTIMIZATION COMPUTATION OUTSOURCING IN CLOUD COMPUTING: A CASE STUDY ...
A MULTI-OBJECTIVE PERSPECTIVE FOR OPERATOR SCHEDULING USING FINEGRAINED DVS A...
Model Selection Using Conformal Predictors
Ad

Similar to Energy-Efficient Reduce-and-Rank Using Input-Adaptive Approximations (20)

PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
PDF
Approximation techniques used for general purpose algorithms
PDF
Download full ebook of Approximate Computing Weiqiang Liu instant download pdf
PDF
Low Power Context Aware Hierarchical System Design
PDF
Thesis small
DOCX
Input-Based Dynamic Reconfiguration of Approximate Arithmetic Units for Video...
PDF
Tutorial-on-DNN-09A-Co-design-Sparsity.pdf
PDF
Final Report - Major Project - MAP
PDF
JJ_Thesis
PDF
A Dynamic Programming Approach to Energy-Efficient Scheduling on Multi-FPGA b...
PPT
"An adaptive modular approach to the mining of sensor network ...
PDF
Daniela thesis
PDF
Tutorial-on-DNN-07-Co-design-Precision.pdf
PDF
Architectureaware Optimization Strategies In Realtime Image Processing Ballaa...
PDF
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
PDF
matconvnet-manual.pdf
PDF
Deep learning_ adaptive computation and machine learning ( PDFDrive ).pdf
PDF
Matconvnet manual
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
APPROXIMATE ARITHMETIC CIRCUIT DESIGN FOR ERROR RESILIENT APPLICATIONS
Approximation techniques used for general purpose algorithms
Download full ebook of Approximate Computing Weiqiang Liu instant download pdf
Low Power Context Aware Hierarchical System Design
Thesis small
Input-Based Dynamic Reconfiguration of Approximate Arithmetic Units for Video...
Tutorial-on-DNN-09A-Co-design-Sparsity.pdf
Final Report - Major Project - MAP
JJ_Thesis
A Dynamic Programming Approach to Energy-Efficient Scheduling on Multi-FPGA b...
"An adaptive modular approach to the mining of sensor network ...
Daniela thesis
Tutorial-on-DNN-07-Co-design-Precision.pdf
Architectureaware Optimization Strategies In Realtime Image Processing Ballaa...
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
matconvnet-manual.pdf
Deep learning_ adaptive computation and machine learning ( PDFDrive ).pdf
Matconvnet manual
Ad

More from JAYAPRAKASH JPINFOTECH (20)

PDF
Java Web Application Project Titles 2023-2024.pdf
PDF
Dot Net Final Year IEEE Project Titles.pdf
PDF
MATLAB Final Year IEEE Project Titles 2023 - 2024.pdf
PDF
Python IEEE Project Titles 2023 - 2024.pdf
PDF
Python ieee project titles 2021 - 2022 | Machine Learning Final Year Project...
DOCX
Spammer detection and fake user Identification on Social Networks
DOCX
Sentiment Classification using N-gram IDF and Automated Machine Learning
DOCX
Privacy-Preserving Social Media DataPublishing for Personalized Ranking-Based...
DOCX
FunkR-pDAE: Personalized Project Recommendation Using Deep Learning
DOCX
Discovering the Type 2 Diabetes in Electronic Health Records using the Sparse...
DOCX
Crop Yield Prediction and Efficient use of Fertilizers
DOCX
Collaborative Filtering-based Electricity Plan Recommender System
DOCX
Achieving Data Truthfulness and Privacy Preservation in Data Markets
DOCX
V2V Routing in a VANET Based on the Auto regressive Integrated Moving Average...
DOCX
Towards Fast and Reliable Multi-hop Routing in VANETs
DOCX
Selective Authentication Based Geographic Opportunistic Routing in Wireless S...
DOCX
Robust Defense Scheme Against Selective DropAttack in Wireless Ad Hoc Networks
DOCX
Privacy-Preserving Cloud-based Road Condition Monitoring with Source Authenti...
DOCX
Novel Intrusion Detection and Prevention for Mobile Ad Hoc Networks
DOCX
Node-Level Trust Evaluation in Wireless Sensor Networks
Java Web Application Project Titles 2023-2024.pdf
Dot Net Final Year IEEE Project Titles.pdf
MATLAB Final Year IEEE Project Titles 2023 - 2024.pdf
Python IEEE Project Titles 2023 - 2024.pdf
Python ieee project titles 2021 - 2022 | Machine Learning Final Year Project...
Spammer detection and fake user Identification on Social Networks
Sentiment Classification using N-gram IDF and Automated Machine Learning
Privacy-Preserving Social Media DataPublishing for Personalized Ranking-Based...
FunkR-pDAE: Personalized Project Recommendation Using Deep Learning
Discovering the Type 2 Diabetes in Electronic Health Records using the Sparse...
Crop Yield Prediction and Efficient use of Fertilizers
Collaborative Filtering-based Electricity Plan Recommender System
Achieving Data Truthfulness and Privacy Preservation in Data Markets
V2V Routing in a VANET Based on the Auto regressive Integrated Moving Average...
Towards Fast and Reliable Multi-hop Routing in VANETs
Selective Authentication Based Geographic Opportunistic Routing in Wireless S...
Robust Defense Scheme Against Selective DropAttack in Wireless Ad Hoc Networks
Privacy-Preserving Cloud-based Road Condition Monitoring with Source Authenti...
Novel Intrusion Detection and Prevention for Mobile Ad Hoc Networks
Node-Level Trust Evaluation in Wireless Sensor Networks

Recently uploaded (20)

PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Institutional Correction lecture only . . .
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
GDM (1) (1).pptx small presentation for students
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Lesson notes of climatology university.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
202450812 BayCHI UCSC-SV 20250812 v17.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
Institutional Correction lecture only . . .
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
GDM (1) (1).pptx small presentation for students
O7-L3 Supply Chain Operations - ICLT Program
Module 4: Burden of Disease Tutorial Slides S2 2025
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
A systematic review of self-coping strategies used by university students to ...
VCE English Exam - Section C Student Revision Booklet
O5-L3 Freight Transport Ops (International) V1.pdf
Lesson notes of climatology university.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf

Energy-Efficient Reduce-and-Rank Using Input-Adaptive Approximations

  • 1. Energy-Efficient Reduce-and-Rank Using Input-Adaptive Approximations ABSTRACT: Approximate computing is an emerging design paradigm that exploits the intrinsic ability of applications to produce acceptable outputs even when their computations are executed approximately. In this paper, we explore approximate computing for a key computation pattern, reduce-and rank (RnR), which is prevalent in a wide range of workloads, including video processing, recognition, search, and data mining. An RnR kernel performs a reduction operation (e.g., distance computation, dot product, and L1-norm) between an input vector and each of a set of reference vectors, and ranks the reduction outputs to select the top reference vectors for the current input. We propose three complementary approximation strategies for the RnR computation pattern. The first is interleaved reduction and-ranking, wherein the vector reductions are decomposed into multiple partial reductions and interleaved with the rank computation. Leveraging this transformation, we propose the use of intermediate reduction results and ranks to identify future computations that are likely to have a low impact on the output, and can, hence, be approximated. The second strategy, input similarity-based approximation, exploits the spatial or temporal correlation of inputs (e.g., pixels of an image or frames of a video) to identify computations that are amenable to approximation. The third
  • 2. strategy, reference vector reordering, rearranges the order in which the reference vectors are processed such that vectors that are relatively more critical in evaluating the correct output, are processed at the beginning of RnR operation. The number of these critical reference vectors is usually small, which renders a substantial portion of the total computation to be amenable to approximation. These strategies address a key challenge in approximate computing—identification of which computations to approximate—and may be used to drive any approximation mechanism, such as computation skipping or precision scaling to realize performance and energy improvements. A second key challenge in approximate computing is that the extent to which computations can be approximated varies significantly from application to application, and across inputs for even a single application. Hence, input-adaptive approximation, or the ability to automatically modulate the degree of approximation based on the nature of each individual input, is essential for obtaining optimal energy savings. In addition, to enable quality configurability in RnR kernels, we propose a kernel-level quality metric that correlates well to application-level quality, and identify key parameters that can be used to tune the proposed approximation strategies dynamically. We develop a runtime framework that modulates the identified parameters during the execution of RnR kernels to minimize their energy while meeting a given target quality. To evaluate the proposed concepts, we designed quality-configurable hardware implementations of six RnR-based applications from the recognition,
  • 3. mining, search, and video processing application domains in 45-nm technology. The proposed architecture of this paper analysis the logic size, area and power consumption using Xilinx 14.2. SOFTWARE IMPLEMENTATION:  Modelsim  Xilinx ISE