SlideShare a Scribd company logo
1
andrea.solazzo@mail.polimi.it
CNN2ECST
Andrea Solazzo
Matteo De Silvestri
Irene De Rose
matteo.desilvestri@mail.polimi.it
irene.derose@mail.polimi.it
Thursday, March 17, 2016
XOHW16 Meeting
2
HW acceleration
Convolutional Neural Networks have a data-flow
computation pattern that results to be highly suitable for
hardware acceleration
Zhang, Chen, et al. "Optimizing fpga-based accelerator design for deep convolutional neural networks." Proceedings of the 2015 ACM/SIGDA
International Symposium on Field-Programmable Gate Arrays. ACM, 2015.
3
Why FPGA?
● CNNs have a huge design space
● Finding the “optimal” model requires some tuning
● Many “degrees of freedom” (#layers, #neurons, …)
3
Why FPGA?
✓ Reconfigurability allows to implement
different models and select the best one
directly in hardware
● CNNs have a huge design space
● Finding the “optimal” model requires some tuning
● Many “degrees of freedom” (#layers, #neurons, …)
4
A. Dundar, J. Jin, V. Gokhale, B. Krishnamurthy, A. Canziani, B. Martini, and E. Culurciello. Accelerating Deep Neural Networks on Mobile
Processor with Embedded Programmable Logic. In Proc. NIPS’13, 2013.
Why FPGA?
5
A. Dundar, J. Jin, V. Gokhale, B. Krishnamurthy, A. Canziani, B. Martini, and E. Culurciello. Accelerating Deep Neural Networks on Mobile
Processor with Embedded Programmable Logic. In Proc. NIPS’13, 2013.
The proper trade-off
between performances
and power consumption
reflects on a high
embeddable factor,
calculated as
performance over Watt
(GOP/s /W)
Why FPGA?
7
CNN2ECST
CNNECST-Convolutional Neural Network
(www.facebook.com/cnn2ecst)
@cnn2ecst
(www.twitter.com/cnn2ecst)

More Related Content

PDF
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
PDF
"Trade-offs in Implementing Deep Neural Networks on FPGAs," a Presentation fr...
PDF
"Fast Deployment of Low-power Deep Learning on CEVA Vision Processors," a Pre...
PDF
"Accelerating Deep Learning Using Altera FPGAs," a Presentation from Intel
PDF
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
PDF
Accelerate Machine Learning Software on Intel Architecture
PDF
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...
PDF
Deep learning: Hardware Landscape
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
"Trade-offs in Implementing Deep Neural Networks on FPGAs," a Presentation fr...
"Fast Deployment of Low-power Deep Learning on CEVA Vision Processors," a Pre...
"Accelerating Deep Learning Using Altera FPGAs," a Presentation from Intel
"Using SGEMM and FFTs to Accelerate Deep Learning," a Presentation from ARM
Accelerate Machine Learning Software on Intel Architecture
Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...
Deep learning: Hardware Landscape

What's hot (20)

PDF
Deep learning with FPGA
PDF
Deep Learning Accelerator Design Techniques
PPTX
Multicore Intel Processors Performance Evaluation
PDF
AI is Impacting HPC Everywhere
PDF
A Platform for Accelerating Machine Learning Applications
PPTX
Serving BERT Models in Production with TorchServe
PDF
Hardware Acceleration for Machine Learning
PDF
Use C++ and Intel® Threading Building Blocks (Intel® TBB) for Hardware Progra...
PDF
NVIDIA 深度學習教育機構 (DLI): Approaches to object detection
PPTX
AI Hardware Landscape 2021
PDF
Massively Parallel K-Nearest Neighbor Computation on Distributed Architectures
PDF
“Introduction to the TVM Open Source Deep Learning Compiler Stack,” a Present...
PDF
MIT's experience on OpenPOWER/POWER 9 platform
PDF
OpenCL - The Open Standard for Heterogeneous Parallel Programming
PDF
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
PDF
HPC Accelerating Combustion Engine Design
PDF
Rethinking computation: A processor architecture for machine intelligence
PDF
White Paper - CEVA-XM4 Intelligent Vision Processor
PDF
ECP Application Development
PDF
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
Deep learning with FPGA
Deep Learning Accelerator Design Techniques
Multicore Intel Processors Performance Evaluation
AI is Impacting HPC Everywhere
A Platform for Accelerating Machine Learning Applications
Serving BERT Models in Production with TorchServe
Hardware Acceleration for Machine Learning
Use C++ and Intel® Threading Building Blocks (Intel® TBB) for Hardware Progra...
NVIDIA 深度學習教育機構 (DLI): Approaches to object detection
AI Hardware Landscape 2021
Massively Parallel K-Nearest Neighbor Computation on Distributed Architectures
“Introduction to the TVM Open Source Deep Learning Compiler Stack,” a Present...
MIT's experience on OpenPOWER/POWER 9 platform
OpenCL - The Open Standard for Heterogeneous Parallel Programming
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
HPC Accelerating Combustion Engine Design
Rethinking computation: A processor architecture for machine intelligence
White Paper - CEVA-XM4 Intelligent Vision Processor
ECP Application Development
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
Ad

Viewers also liked (20)

PDF
The Truth About Metal Music
PPTX
Market Research Efx
PDF
Xây dựng giao diện website dựa trên mã nguồn joomla(tiếp theo)
PPTX
Apresentação da COESCOLA - Aprendizagem Livre e Colaborativa
DOCX
OEE Canyon Guide Training Checklist (1)
PPT
Storia degli scorpions
PPTX
How to deal with cs work
PDF
Adverteren op Facebook: Geavanceerde campagne-optimalisatie en analyse
PDF
Xub magis republic day edition vol1
PDF
Introducción a la biología - Célula
PDF
GeospatialDataAnalysis
PDF
Klikkrant GO! - 20100309
DOCX
Guía pensandolo bien
PPTX
EEON103 Хичээл 13
PPTX
Forrester & Perficient on SharePoint as a Social Business Platform
PDF
Understanding Veeam Methodologies and impact on Storage I/O - in persian
PPTX
7/27/16 Deep Learning Top 5
PPT
Head hunter 23.09.2010
PPSX
How to deal with deadlines
DOC
Market research case indian paints limited
The Truth About Metal Music
Market Research Efx
Xây dựng giao diện website dựa trên mã nguồn joomla(tiếp theo)
Apresentação da COESCOLA - Aprendizagem Livre e Colaborativa
OEE Canyon Guide Training Checklist (1)
Storia degli scorpions
How to deal with cs work
Adverteren op Facebook: Geavanceerde campagne-optimalisatie en analyse
Xub magis republic day edition vol1
Introducción a la biología - Célula
GeospatialDataAnalysis
Klikkrant GO! - 20100309
Guía pensandolo bien
EEON103 Хичээл 13
Forrester & Perficient on SharePoint as a Social Business Platform
Understanding Veeam Methodologies and impact on Storage I/O - in persian
7/27/16 Deep Learning Top 5
Head hunter 23.09.2010
How to deal with deadlines
Market research case indian paints limited
Ad

Similar to 2. Cnnecst-Why the use of FPGA? (20)

PDF
Deep Learning Initiative @ NECSTLab
PPTX
CNNECST: an FPGA-based approach for the hardware acceleration of Convolutiona...
PPTX
CNNECST: an FPGA-based approach for the hardware acceleration of Convolutiona...
PPTX
NNECST: an FPGA-based approach for the hardware acceleration of Convolutional...
PPTX
CNN Dataflow implementation on FPGAs
PPTX
CNN Dataflow Implementation on FPGAs
PPTX
CNN Dataflow Implementation on FPGAs
PDF
Professional Project - C++ OpenCL - Platform agnostic hardware acceleration ...
PDF
ACCELERATED DEEP LEARNING INFERENCE FROM CONSTRAINED EMBEDDED DEVICES
PDF
Challenges and Opportunities of FPGA Acceleration in Big Data
PDF
FPGA Hardware Accelerator for Machine Learning
PPTX
realtime_ai_systems_academia.pptx
PPTX
Morph : a novel accelerator
PDF
Smart Data Slides: Emerging Hardware Choices for Modern AI Data Management
PDF
Imaging automotive 2015 addfor v002
PDF
Imaging automotive 2015 addfor v002
PDF
Deep Learning Update May 2016
PDF
electronics-11-03883.pdf
PDF
A Framework with Cloud Integration for CNN Acceleration on FPGA Devices
PDF
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...
Deep Learning Initiative @ NECSTLab
CNNECST: an FPGA-based approach for the hardware acceleration of Convolutiona...
CNNECST: an FPGA-based approach for the hardware acceleration of Convolutiona...
NNECST: an FPGA-based approach for the hardware acceleration of Convolutional...
CNN Dataflow implementation on FPGAs
CNN Dataflow Implementation on FPGAs
CNN Dataflow Implementation on FPGAs
Professional Project - C++ OpenCL - Platform agnostic hardware acceleration ...
ACCELERATED DEEP LEARNING INFERENCE FROM CONSTRAINED EMBEDDED DEVICES
Challenges and Opportunities of FPGA Acceleration in Big Data
FPGA Hardware Accelerator for Machine Learning
realtime_ai_systems_academia.pptx
Morph : a novel accelerator
Smart Data Slides: Emerging Hardware Choices for Modern AI Data Management
Imaging automotive 2015 addfor v002
Imaging automotive 2015 addfor v002
Deep Learning Update May 2016
electronics-11-03883.pdf
A Framework with Cloud Integration for CNN Acceleration on FPGA Devices
FCCM2020: High-Throughput Convolutional Neural Network on an FPGA by Customiz...

Recently uploaded (20)

PPTX
STEEL- intro-1.pptxhejwjenwnwnenemwmwmwm
PPTX
Wireless and Mobile Backhaul Market.pptx
PPTX
title _yeOPC_Poisoning_Presentation.pptx
PPTX
quadraticequations-111211090004-phpapp02.pptx
PPTX
Embedded for Artificial Intelligence 1.pptx
PDF
Smarter Security: How Door Access Control Works with Alarms & CCTV
DOCX
fsdffdghjjgfxfdghjvhjvgfdfcbchghgghgcbjghf
PDF
How NGOs Save Costs with Affordable IT Rentals
PPTX
Computers and mobile device: Evaluating options for home and work
PPT
Hypersensitivity Namisha1111111111-WPS.ppt
PPT
Lines and angles cbse class 9 math chemistry
PDF
Prescription1 which to be used for periodo
PPTX
making presentation that do no stick.pptx
DOCX
A PROPOSAL ON IoT climate sensor 2.docx
PPTX
PROGRAMMING-QUARTER-2-PYTHON.pptxnsnsndn
PPTX
Nanokeyer nano keyekr kano ketkker nano keyer
PPT
chapter_1_a.ppthduushshwhwbshshshsbbsbsbsbsh
PPTX
Prograce_Present.....ggation_Simple.pptx
PPTX
KVL KCL ppt electrical electronics eee tiet
PPTX
sdn_based_controller_for_mobile_network_traffic_management1.pptx
STEEL- intro-1.pptxhejwjenwnwnenemwmwmwm
Wireless and Mobile Backhaul Market.pptx
title _yeOPC_Poisoning_Presentation.pptx
quadraticequations-111211090004-phpapp02.pptx
Embedded for Artificial Intelligence 1.pptx
Smarter Security: How Door Access Control Works with Alarms & CCTV
fsdffdghjjgfxfdghjvhjvgfdfcbchghgghgcbjghf
How NGOs Save Costs with Affordable IT Rentals
Computers and mobile device: Evaluating options for home and work
Hypersensitivity Namisha1111111111-WPS.ppt
Lines and angles cbse class 9 math chemistry
Prescription1 which to be used for periodo
making presentation that do no stick.pptx
A PROPOSAL ON IoT climate sensor 2.docx
PROGRAMMING-QUARTER-2-PYTHON.pptxnsnsndn
Nanokeyer nano keyekr kano ketkker nano keyer
chapter_1_a.ppthduushshwhwbshshshsbbsbsbsbsh
Prograce_Present.....ggation_Simple.pptx
KVL KCL ppt electrical electronics eee tiet
sdn_based_controller_for_mobile_network_traffic_management1.pptx

2. Cnnecst-Why the use of FPGA?

  • 1. 1 andrea.solazzo@mail.polimi.it CNN2ECST Andrea Solazzo Matteo De Silvestri Irene De Rose matteo.desilvestri@mail.polimi.it irene.derose@mail.polimi.it Thursday, March 17, 2016 XOHW16 Meeting
  • 2. 2 HW acceleration Convolutional Neural Networks have a data-flow computation pattern that results to be highly suitable for hardware acceleration Zhang, Chen, et al. "Optimizing fpga-based accelerator design for deep convolutional neural networks." Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 2015.
  • 3. 3 Why FPGA? ● CNNs have a huge design space ● Finding the “optimal” model requires some tuning ● Many “degrees of freedom” (#layers, #neurons, …)
  • 4. 3 Why FPGA? ✓ Reconfigurability allows to implement different models and select the best one directly in hardware ● CNNs have a huge design space ● Finding the “optimal” model requires some tuning ● Many “degrees of freedom” (#layers, #neurons, …)
  • 5. 4 A. Dundar, J. Jin, V. Gokhale, B. Krishnamurthy, A. Canziani, B. Martini, and E. Culurciello. Accelerating Deep Neural Networks on Mobile Processor with Embedded Programmable Logic. In Proc. NIPS’13, 2013. Why FPGA?
  • 6. 5 A. Dundar, J. Jin, V. Gokhale, B. Krishnamurthy, A. Canziani, B. Martini, and E. Culurciello. Accelerating Deep Neural Networks on Mobile Processor with Embedded Programmable Logic. In Proc. NIPS’13, 2013. The proper trade-off between performances and power consumption reflects on a high embeddable factor, calculated as performance over Watt (GOP/s /W) Why FPGA?