SlideShare a Scribd company logo
www.flextiles.eu 
FlexTiles 
Workshop at FPL’2014 conference: FlexTiles FP7 project Low-Power DSP Accelerator Embedded in a Heterogeneous Many-Core Architecture 
Marc MORGAN 
CSEM – Swiss Center for Electronics and Microtechnology
1 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
CSEM overview on a single slide 
•private company, founded in the 1980’s, not for profit 
•approx. 450 employees on 5 sites in Switzerland (HQ in Neuchatel) and a site Brazil 
•5 research programs: 
1.ultra-low power integrated systems (SoC, Vision, Wireless) 
2.systems engineering (med tech, instrumentation, automation) 
3.MEMS 
4.surface engineering (nano, bio, printable electronics) 
5.photovoltaic 
•approx. 70 MCHF annual budget 
•over 20 start-ups and spin-offs since 1995
2 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Many-core architecture: GPPs + accelerators 
An array of general purpose processors (GPP) 
Connected via a Network-on-Chip (NoC) 
Complemented with accelerators to optimize speed and power: 
DSP processors or specialized logic implemented in embedded-FPGA 
Plus memory nodes and I/O
3 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Many-core architecture: GPPs + accelerators (cont’d) 
Several IPs are available for the building blocks 
both in the consortium and on the market 
architectural choices attempt to retain genericity of the platform CSEM provides an ultra-low power DSP processor for the DSP accelerator It plugs into a generic accelerator interface (AI)
4 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Accelerator interface (AI) 
Interfaces the NoC’s NI to the accelerator by providing services: 
programming, control/status, data in, data out, debug 
DMA access, word FIFOs, notification
5 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
DSP accelerator architecture 
Choices for the DSP accelerator avoid DSP specific features 
the DSP will not run an OS or kernel 
the DSP will not use (or at least not require) interruptions 
Note: CSEM’s icyflex4 ULP DSP could support both of the above Implement a FIFO manager to handle input and output tokens from/to the accelerator interface (AI) Implement debug and tracing facilities 
Debug: JTAG 1149.1 TAP 
Tracing: programmable tracing unit
6 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
DSP accelerator architecture (cont’d)
7 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Management of the DSP accelerator 
Each accelerator is managed by software running on GPPs 
virtualization manager: attribution of the accelerator 
resource manager: control of the accelerator These managers are in charge of: 
transfer of the application (ELF) to the accelerator 
signaling the accelerator when to start and when to stop 
recovering statistics on usage of the accelerator to optimize the execution of the application on the many-core platform The tracing unit can be managed from the processor or from the JTAG interface
8 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Application (C code) 
C to SpearDE representation Conversion (Thales) 
Data parallelisation Mapping (Thales) 
Graphic input (manual) + C kernels 
Streaming optimisation 
(ACE) 
Compilation & Link 
(ACE) 
architecture representation 
Master Cores 
GPP 
Slave cores eFPGA, DSP 
Library of IPs 
Tool flow and Model of Computation 
Binaries 
Acc compiler or C2VHDL tools 
(CSEM / UR1 / RUB) 
Masters control slaves 
Architecture 
configuration 
GUI (KIT)
9 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
icyflex software development kit 
GNU C compiler (gcc) v 4.6.3 
icyflex instruction parallelism supported by latest releases of gcc 
libc and libm from RedHat’s NewLib 
software implementation of IEEE floating-point standard GNU assembler / linker (binutils), v 2.20 
BFD / ELF32 object file format 
Binary, SREC, IHEX memory image file formats GNU debugger (gdb), v 6.7.1 
Mode 1: instruction set simulator of the icyflex core 
Mode 2: On-Chip Debug (OCD) through a JTAG interface icyflex instruction set simulator (ISS), written in C++ 
Phase-accurate, pipelined 
Wrappers to SystemC, VHDL (Modelsim), Matlab/Simulink Eclipse integrated development environment, v Helios 
CDT C/C++ IDE plug-in 
icyflex plug-in 
.c 
.o 
.exe 
.log 
gcc 
ld 
gdb 
gdb
10 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
icyflex family of ultra-low power processors 
icyflex2 
Control 
Computing 
Power 
DSP 
icyflex1 
icyflex4 
1 MUL 2 MAC 4 MAC … 36 MAC 
Application 
6 μW/MHz 
25 μW/MHz 
10-150 μW/MHz 
12 MAC 
power indicated for TSMC 65 nm CMOS
11 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
icyflex2 vs icyflex4 
Feature 
icyflex2 
icyflex4 VPS=2 
Optimized for 
Control 
DSP 
P, X, Y memory buses, 
ISA, HW loops, saturation, … 
Instruction word [bits] 
32 (1 or 2 sub) 
64 (1, 2 or 3 sub) 
Memory access [bits] 
8, 16 or 32 
2x (8, 16, 32, 64, 128) 
Data processing [bits] 
16 or 32, trunc 
2x (16 or 32 or 64), full 
Single Instr. Multiple Data (SIMD) 
No 
Yes, up to 8 MAC 
Instruction set is reconfigurable 
on the fly 
No 
Yes 
Software Development Kit (SDK) 
GNU-based tool suite (gcc, gdb) + cycle- accurate instruction set simulator (ISS) 
Hardware Devt Kit (HDK) 
FPGA-based, customizable 
VPS = Vector Processing Slices in the Vector Processing Unit of the DSP
12 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
blank instructions 
configured at run-time 
icyflex: reconfigurable instructions and addressing modes 
Instruction set 
ADD 
MUL 
SHR 
MAC 
JUMP 
configurable 
configurable 
SHIFT 
MUX 
ALU 
ACC 
ACC 
SHIFT 
MUX 
ALU 
ACC 
ACC 
instruction decoding 
cycle N: config MOP 
cycle N+1: use MOP
13 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
DSP in FlexTiles emulators 
Emulator 1 (software): 
Using Open Virtual Platform (OVP) 
Not cycle accurate 
The icyflex4 DSP is emulated by a GPP running at a higher frequency Emulator 2 (hardware): 
Using an FPGA board with two Xilinx Virtex6 FPGAs 
Uses a DFF version of the DSP accelerator
14 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
Exploitation of FlexTiles results at CSEM 
CSEM specializes in low power solutions A well-balanced multi-processor design can optimize energy consumption by reducing voltage and frequency For multi-core: we offer CSEM solutions For many-core: CSEM collaborates with 1 or more of our partners 
including e.g. a follow up project to produce FlexTiles chips
15 / 
The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 
FlexTiles FP7 project 
For more information regarding the FlexTiles project, visit: 
http://www.flextiles.eu 
Please take 5 minutes to fill out the survey 
on the project web site under the Contact menu 
The FlexTiles project is funded in part by FP7, the seventh framework programme of the European Commission.
www.flextiles.eu 
FlexTiles 
Thank you for your attention! For more information: http://www.csem.ch Questions? mailto:marc.morgan@csem.ch

More Related Content

PDF
FPL'2014 - FlexTiles Workshop - 7 - FlexTiles Emulation platform
PDF
FPL'2014 - FlexTiles Workshop - 6 - FlexTiles Embedded FPGA Accelerators
PDF
FPL'2014 - FlexTiles Workshop - 8 - FlexTiles Demo
PDF
FPL'2014 - FlexTiles Workshop - 4 - FlexTiles Virtual Platform
PDF
System Design on Zynq using SDSoC
PPTX
FlexTiles Platform integrated in 19" Rack Enclosure
PPT
OSGi: Best Tool In Your Embedded Systems Toolbox
FPL'2014 - FlexTiles Workshop - 7 - FlexTiles Emulation platform
FPL'2014 - FlexTiles Workshop - 6 - FlexTiles Embedded FPGA Accelerators
FPL'2014 - FlexTiles Workshop - 8 - FlexTiles Demo
FPL'2014 - FlexTiles Workshop - 4 - FlexTiles Virtual Platform
System Design on Zynq using SDSoC
FlexTiles Platform integrated in 19" Rack Enclosure
OSGi: Best Tool In Your Embedded Systems Toolbox

What's hot (20)

PPTX
An open flow for dn ns on ultra low-power RISC-V cores
DOC
Ankit sarin
PDF
Ethercat.org industrial ethernet technologies
PDF
RISC-V & SoC Architectural Exploration for AI and ML Accelerators
PPTX
SpaceVPX Tutorial 06282016
PDF
FROM FPGA TO ASIC IMPLEMENTATION OF AN OPENRISC BASED SOC FOR VOIP APPLICATION
PPTX
EMC2 Xilinx SDSoC presentation
PDF
Preparing to program Aurora at Exascale - Early experiences and future direct...
PDF
VPX Profiles To Platforms, SIE Computing Solutions
PDF
New Trends with VME and OpenVPX - Part 2
PPT
Eclipse RT Day
PPTX
RISC-V NOEL-V - A new high performance RISC-V Processor Family
PPTX
RISC-V 30907 summit 2020 joint picocom_mentor
PDF
Energy Efficient Computing using Dynamic Tuning
PPTX
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
PDF
ARM HPC Ecosystem
PPT
BFSK RT In FPGA Thesis Pres Jps
PDF
Andes RISC-V processor solutions
PDF
DOME 64-bit μDataCenter
PDF
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
An open flow for dn ns on ultra low-power RISC-V cores
Ankit sarin
Ethercat.org industrial ethernet technologies
RISC-V & SoC Architectural Exploration for AI and ML Accelerators
SpaceVPX Tutorial 06282016
FROM FPGA TO ASIC IMPLEMENTATION OF AN OPENRISC BASED SOC FOR VOIP APPLICATION
EMC2 Xilinx SDSoC presentation
Preparing to program Aurora at Exascale - Early experiences and future direct...
VPX Profiles To Platforms, SIE Computing Solutions
New Trends with VME and OpenVPX - Part 2
Eclipse RT Day
RISC-V NOEL-V - A new high performance RISC-V Processor Family
RISC-V 30907 summit 2020 joint picocom_mentor
Energy Efficient Computing using Dynamic Tuning
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
ARM HPC Ecosystem
BFSK RT In FPGA Thesis Pres Jps
Andes RISC-V processor solutions
DOME 64-bit μDataCenter
Building efficient 5G NR base stations with Intel® Xeon® Scalable Processors
Ad

Similar to FPL'2014 - FlexTiles Workshop - 3 - FlexTiles DSP Accelerators (20)

PPTX
Conference on Adaptive Hardware and Systems (AHS'14) - The DSP for FlexTiles
PDF
FPL'2014 - FlexTiles Workshop - 5 - FlexTiles Simulation Platform
PPT
Conference on Adaptive Hardware and Systems (AHS'14) - FlexTiles FPGA Emulation
PPTX
Conference on Adaptive Hardware and Systems (AHS'14) - Why FlexTiles uses OVP...
PDF
FPL'2014 - FlexTiles Workshop - 1 - FlexTiles Overview
PPTX
Conference on Adaptive Hardware and Systems (AHS'14) - What is FlexTiles?
PDF
Microcontrollers 8051 MSP430 notes
PDF
Int 1010 Tcp Offload
DOCX
Flexible dsp accelerator architecture exploiting carry save arithmetic
PDF
Creating an Embedded System Lab
PPTX
Gpu archi
PDF
Software Embedded 10903
PDF
Eee iv-microcontrollers [10 es42]-notes
PPTX
Project Slides for Website 2020-22.pptx
PDF
FPGA @ UPB-BGA
PDF
Resume_DigitalIC_1
PDF
Training Lecture
PDF
Cuda tutorial
PPTX
Summer training embedded system and its scope
Conference on Adaptive Hardware and Systems (AHS'14) - The DSP for FlexTiles
FPL'2014 - FlexTiles Workshop - 5 - FlexTiles Simulation Platform
Conference on Adaptive Hardware and Systems (AHS'14) - FlexTiles FPGA Emulation
Conference on Adaptive Hardware and Systems (AHS'14) - Why FlexTiles uses OVP...
FPL'2014 - FlexTiles Workshop - 1 - FlexTiles Overview
Conference on Adaptive Hardware and Systems (AHS'14) - What is FlexTiles?
Microcontrollers 8051 MSP430 notes
Int 1010 Tcp Offload
Flexible dsp accelerator architecture exploiting carry save arithmetic
Creating an Embedded System Lab
Gpu archi
Software Embedded 10903
Eee iv-microcontrollers [10 es42]-notes
Project Slides for Website 2020-22.pptx
FPGA @ UPB-BGA
Resume_DigitalIC_1
Training Lecture
Cuda tutorial
Summer training embedded system and its scope
Ad

More from FlexTiles Team (17)

PPTX
Conference on Adaptive Hardware and Systems (AHS'14) - The FlexTiles Embedded...
PDF
Conference on Adaptive Hardware and Systems (AHS'14) - The 3D FlexTiles Concept
PPTX
Adaptive Hardware and Systems (AHS'14) - FlexTiles OVP Demo
PPT
Conference on Adaptive Hardware and Systems (AHS'14) - FlexTiles Introductions
PPT
Reconfigurable 3D MultiCore Concept by Prof. Michael Hübner @ ARC 2013
PPTX
The FlexTiles Development Platform offers Dual FPGA for 3D SoC Prototyping
PDF
FlexTiles Platform
PDF
FlexTiles Development Platform
PDF
FlexTiles Platform - Xilinx Virtex-6 DUO
PPTX
INA OCMC 2012
PPTX
DATE 2012
PPTX
DAC 2012
PPTX
SAMOS 2012
PPTX
RAW 2012
PPTX
ISVLSI 2012
PPTX
Fall School on Programming Paradigms for Multi-core Embedded Systems 2012
PPTX
HiPEAC 2012
Conference on Adaptive Hardware and Systems (AHS'14) - The FlexTiles Embedded...
Conference on Adaptive Hardware and Systems (AHS'14) - The 3D FlexTiles Concept
Adaptive Hardware and Systems (AHS'14) - FlexTiles OVP Demo
Conference on Adaptive Hardware and Systems (AHS'14) - FlexTiles Introductions
Reconfigurable 3D MultiCore Concept by Prof. Michael Hübner @ ARC 2013
The FlexTiles Development Platform offers Dual FPGA for 3D SoC Prototyping
FlexTiles Platform
FlexTiles Development Platform
FlexTiles Platform - Xilinx Virtex-6 DUO
INA OCMC 2012
DATE 2012
DAC 2012
SAMOS 2012
RAW 2012
ISVLSI 2012
Fall School on Programming Paradigms for Multi-core Embedded Systems 2012
HiPEAC 2012

Recently uploaded (20)

PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Fundamentals of Mechanical Engineering.pptx
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Current and future trends in Computer Vision.pptx
PPTX
communication and presentation skills 01
PDF
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
PPT
Occupational Health and Safety Management System
PPTX
UNIT - 3 Total quality Management .pptx
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PPTX
introduction to high performance computing
PDF
Integrating Fractal Dimension and Time Series Analysis for Optimized Hyperspe...
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Fundamentals of Mechanical Engineering.pptx
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Fundamentals of safety and accident prevention -final (1).pptx
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
Automation-in-Manufacturing-Chapter-Introduction.pdf
Current and future trends in Computer Vision.pptx
communication and presentation skills 01
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
Occupational Health and Safety Management System
UNIT - 3 Total quality Management .pptx
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
introduction to high performance computing
Integrating Fractal Dimension and Time Series Analysis for Optimized Hyperspe...

FPL'2014 - FlexTiles Workshop - 3 - FlexTiles DSP Accelerators

  • 1. www.flextiles.eu FlexTiles Workshop at FPL’2014 conference: FlexTiles FP7 project Low-Power DSP Accelerator Embedded in a Heterogeneous Many-Core Architecture Marc MORGAN CSEM – Swiss Center for Electronics and Microtechnology
  • 2. 1 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 CSEM overview on a single slide •private company, founded in the 1980’s, not for profit •approx. 450 employees on 5 sites in Switzerland (HQ in Neuchatel) and a site Brazil •5 research programs: 1.ultra-low power integrated systems (SoC, Vision, Wireless) 2.systems engineering (med tech, instrumentation, automation) 3.MEMS 4.surface engineering (nano, bio, printable electronics) 5.photovoltaic •approx. 70 MCHF annual budget •over 20 start-ups and spin-offs since 1995
  • 3. 2 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Many-core architecture: GPPs + accelerators An array of general purpose processors (GPP) Connected via a Network-on-Chip (NoC) Complemented with accelerators to optimize speed and power: DSP processors or specialized logic implemented in embedded-FPGA Plus memory nodes and I/O
  • 4. 3 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Many-core architecture: GPPs + accelerators (cont’d) Several IPs are available for the building blocks both in the consortium and on the market architectural choices attempt to retain genericity of the platform CSEM provides an ultra-low power DSP processor for the DSP accelerator It plugs into a generic accelerator interface (AI)
  • 5. 4 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Accelerator interface (AI) Interfaces the NoC’s NI to the accelerator by providing services: programming, control/status, data in, data out, debug DMA access, word FIFOs, notification
  • 6. 5 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 DSP accelerator architecture Choices for the DSP accelerator avoid DSP specific features the DSP will not run an OS or kernel the DSP will not use (or at least not require) interruptions Note: CSEM’s icyflex4 ULP DSP could support both of the above Implement a FIFO manager to handle input and output tokens from/to the accelerator interface (AI) Implement debug and tracing facilities Debug: JTAG 1149.1 TAP Tracing: programmable tracing unit
  • 7. 6 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 DSP accelerator architecture (cont’d)
  • 8. 7 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Management of the DSP accelerator Each accelerator is managed by software running on GPPs virtualization manager: attribution of the accelerator resource manager: control of the accelerator These managers are in charge of: transfer of the application (ELF) to the accelerator signaling the accelerator when to start and when to stop recovering statistics on usage of the accelerator to optimize the execution of the application on the many-core platform The tracing unit can be managed from the processor or from the JTAG interface
  • 9. 8 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Application (C code) C to SpearDE representation Conversion (Thales) Data parallelisation Mapping (Thales) Graphic input (manual) + C kernels Streaming optimisation (ACE) Compilation & Link (ACE) architecture representation Master Cores GPP Slave cores eFPGA, DSP Library of IPs Tool flow and Model of Computation Binaries Acc compiler or C2VHDL tools (CSEM / UR1 / RUB) Masters control slaves Architecture configuration GUI (KIT)
  • 10. 9 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 icyflex software development kit GNU C compiler (gcc) v 4.6.3 icyflex instruction parallelism supported by latest releases of gcc libc and libm from RedHat’s NewLib software implementation of IEEE floating-point standard GNU assembler / linker (binutils), v 2.20 BFD / ELF32 object file format Binary, SREC, IHEX memory image file formats GNU debugger (gdb), v 6.7.1 Mode 1: instruction set simulator of the icyflex core Mode 2: On-Chip Debug (OCD) through a JTAG interface icyflex instruction set simulator (ISS), written in C++ Phase-accurate, pipelined Wrappers to SystemC, VHDL (Modelsim), Matlab/Simulink Eclipse integrated development environment, v Helios CDT C/C++ IDE plug-in icyflex plug-in .c .o .exe .log gcc ld gdb gdb
  • 11. 10 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 icyflex family of ultra-low power processors icyflex2 Control Computing Power DSP icyflex1 icyflex4 1 MUL 2 MAC 4 MAC … 36 MAC Application 6 μW/MHz 25 μW/MHz 10-150 μW/MHz 12 MAC power indicated for TSMC 65 nm CMOS
  • 12. 11 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 icyflex2 vs icyflex4 Feature icyflex2 icyflex4 VPS=2 Optimized for Control DSP P, X, Y memory buses, ISA, HW loops, saturation, … Instruction word [bits] 32 (1 or 2 sub) 64 (1, 2 or 3 sub) Memory access [bits] 8, 16 or 32 2x (8, 16, 32, 64, 128) Data processing [bits] 16 or 32, trunc 2x (16 or 32 or 64), full Single Instr. Multiple Data (SIMD) No Yes, up to 8 MAC Instruction set is reconfigurable on the fly No Yes Software Development Kit (SDK) GNU-based tool suite (gcc, gdb) + cycle- accurate instruction set simulator (ISS) Hardware Devt Kit (HDK) FPGA-based, customizable VPS = Vector Processing Slices in the Vector Processing Unit of the DSP
  • 13. 12 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 blank instructions configured at run-time icyflex: reconfigurable instructions and addressing modes Instruction set ADD MUL SHR MAC JUMP configurable configurable SHIFT MUX ALU ACC ACC SHIFT MUX ALU ACC ACC instruction decoding cycle N: config MOP cycle N+1: use MOP
  • 14. 13 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 DSP in FlexTiles emulators Emulator 1 (software): Using Open Virtual Platform (OVP) Not cycle accurate The icyflex4 DSP is emulated by a GPP running at a higher frequency Emulator 2 (hardware): Using an FPGA board with two Xilinx Virtex6 FPGAs Uses a DFF version of the DSP accelerator
  • 15. 14 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 Exploitation of FlexTiles results at CSEM CSEM specializes in low power solutions A well-balanced multi-processor design can optimize energy consumption by reducing voltage and frequency For multi-core: we offer CSEM solutions For many-core: CSEM collaborates with 1 or more of our partners including e.g. a follow up project to produce FlexTiles chips
  • 16. 15 / The information contained in this document and any attachments are the property of FlexTiles consortium. You are hereby notified that any review, dissemination, distribution, copying or otherwise use of this document must be done in accordance with the CA of the project (TRT/DJ/624412785.2011). Template version 1.0 FlexTiles FP7 project For more information regarding the FlexTiles project, visit: http://www.flextiles.eu Please take 5 minutes to fill out the survey on the project web site under the Contact menu The FlexTiles project is funded in part by FP7, the seventh framework programme of the European Commission.
  • 17. www.flextiles.eu FlexTiles Thank you for your attention! For more information: http://www.csem.ch Questions? mailto:marc.morgan@csem.ch