SlideShare a Scribd company logo
Polaris
A workflow to manage allocation and relocation of tasks in a reconfigurable architecture Final goal: complete architecture (bitstreams) generation Polaris
Management of 2D Reconfiguration in a Reconfigurable System Massimo Morandi [email_address]
Outline Introduction Problem description  Project Goals and Contributions Project in details Phases Results Future Work
Problem Description New Generation of FPGAs Virtex-4 and Virtex-5 Allow bi-dimensional reconfiguration This permits to: Better exploit reconfigurable area Obtain modules performance optimizations More complex management: Handle one more degree of freedom Avoid more fragmentation Perform good placement choices to keep low TRR Keep acceptable intra-module routing paths
Project Goals and Contributions Analyze effects of 2D reconfiguration New advantages New problems Examine possible solutions to new problems Explore literature to find promising ideas Evaluate those solutions in various scenarios Propose a new solution Combining ideas from literature with new ones Obtaining good cost-quality tradeoff
Setting and Advantages Definition Definition of the setting: 2D self partial dynamical run-time reconfiguration Analysis of the advantages of 2D Reconfiguration In area usage and performance
2D Fragmentation Problem Analysis of the 2D-fragmentation problem Area generally more fragmented Can nullify the area optimizations obtained
Placement Decisions Analysis of 2D placement choices effects: Again, bad choices can lead to performance loss
Allocation manager Definition of allocation manager desired features: Low TRR Low management overhead High routing efficiency Low fragmentation Definition of allocation manager structure: Empty space manager Complete space  Heuristic selection Fitter General (FF,BL,BF,WF…) Focused (FA,RA… )
Most relevant works Maintain complete information on empty space: KAMER: Keep All Maximally Empty Rectangles Apply a general fitting strategy CUR: Maintain the Countour of a Union of Rectangles Apply a focused fitting strategy Heuristically prune part of the information: KNER: Keep Non-overlapping Empty Rectangles Apply a general fitting strategy 2D-HASHING: Keep Non-ov. Empty Rectangles in optimized data structure Apply (exclusively) a general fitting strategy
Evaluation and Proposed Approach Proposed Approach Heuristic (KNER-like) empty space manager, to keep low complexity for use in a self-reconfigurable system Fitting strategy focused on minimizing routing paths, to maintain high performance of the reconfigurable system (chosen metric to minimize Manhattan distance) High placement quality => high complexity Lowest compl. => no focused fitting (bad especially for routing)
Structure of the allocation manager Task, defined by: Arrival time, ASAP, (ALAP), H, W, Latency, Communicating Tasks Hosted in a queue which also adds a pointer to the rectangle where it is placed Reconfigurable Device, represented as: Binary Tree structure, each node is a Rectangle, each leaf is an empty Rectangle. Navigation trough pointers to left child, right child, next leaf and a function to find previous leaf (for bookkeeping after split or merge) Rectangle, defined by: X, Y, H, W Initially one, (X,Y)=(0,0), H=FPGA Rows, W=FPGA Cols
The Placement Algorithm
Experimental Results Benchmark of 100 randomly generated tasks: Size (5% to 25% of FPGA), randomly interconnected Execution time: 3x less than CUR, close to KNER Communication cost: 3x less than KNER, close to CUR Task Rejection Rate: all solutions quite close
Future Work Apply the proposed solution to self reconfiguration: Adapt the algorithm to run on the internal processor Create a validation reconfigurable architecture Integrate the architecture with relocation Tune the algorithm to improve results: Experiment techniques to reduce TRR Try to optimize the code to have an algorithm with lower running time Evaluate other fitting strategies
Questions?
Relocation for 2D Reconfigurable Systems Marco Novati [email_address]
Project Outline Introduction Problem description Project Goals  Project in details Phases  Results What’s next
Problem   Description Self Dynamical Runtime 2D Reconfiguration Xilinx Virtex-4 and Virtex-5 Relocation, different solutions Software Hardware We chose an hardware solution BiRF Square
Project Goals Study of the new FPGA Families Examination of Xilinx documentation on V4 and V5 Analysis of the new bitstream structure Generation of V4 and V5 bitstream Development of the new version of BiRF Implementation Validation
New Frame Addressing: Possibility of addressing rows and columns Frame Addressing (1/2)
Frame Addressing (2/2)
New Parser
CRC Calculation Particular CRC value, used by Xilinx tools Two version of BiRF Square: By using the “predefined” value With actual CRC calculation An optimized algorithm has been used
Synthesis results On a Virtex-4 with speed grade -12 General purpose version: max frequency of 160 MHz Specific version:  max frequency of 290 Mhz
Target Device
Validation Architecture
Results (1/2) BiRF Square Permits apply relocation in a  self partially and dynamically 2D-reconfigurable system The occupation ratio is relatively small Frequency more than acceptable Reduction of internal memory requirements
Results (2/2) Throughput  of 7,3 MB/s: A total configuration file size is about 1 MB Considering an architecture: 1/3 of the area as fixed part  2/3 as reconfigurable part with 6 slots With such hypothesis Size of a partial bitstream will be about 110 KB Relocation time of about 15 ms
What’s Next Future improvements: Direct access to the memory (DMA)  Direct manipulation of the bitstream Portability Integration with ICAP Elimination of the relocation overhead  Relocation time << reconfiguration time Future work: Provide a simulation framework to monitor the reconfigurable system evolution and to evaluate different choices The final goal: Creation of a real architecture that exploits self partial and dynamical 2D-reconfiguration,with relocation
Questions

More Related Content

PPT
Distributed systems scheduling
PPT
Chap2 slides
PPTX
Grds conferences icst and icbelsh (5)
PPT
Os5
PPTX
Feedback queuing models for time shared systems
PDF
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
PPTX
Feedback Queueing Models for Time Shared Systems
PPTX
Hadoop deconstructing map reduce job step by step
Distributed systems scheduling
Chap2 slides
Grds conferences icst and icbelsh (5)
Os5
Feedback queuing models for time shared systems
Reduced Complexity Transfer Function Computation for Complex Indoor Channels ...
Feedback Queueing Models for Time Shared Systems
Hadoop deconstructing map reduce job step by step

What's hot (20)

PDF
Keep Calm and React with Foresight: Strategies for Low-Latency and Energy-Eff...
PPTX
Adaptive Execution Support for Malleable Computation
PPT
Chap1 slides
PPTX
Model compression
PPTX
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
PPT
Communication
PDF
CFD Project
PDF
Hadoop secondary sort and a custom comparator
PPTX
DFA minimization algorithms in map reduce
PDF
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PPT
How CFD can help gain the most efficient data centre designs
PDF
slot_shifting
PPTX
Aggarwal Draft
PDF
Cfd simulation of flow heat and mass transfer
PPTX
MapReduce Paradigm
PDF
Self-adaptive container monitoring with performance-aware Load-Shedding policies
PDF
MapReduce: Simplified Data Processing On Large Clusters
PDF
4838281 operating-system-scheduling-on-multicore-architectures
PPTX
층류 익형의 설계 최적화
PDF
Airspace configuration using_air_traffic_complexity_metrics
Keep Calm and React with Foresight: Strategies for Low-Latency and Energy-Eff...
Adaptive Execution Support for Malleable Computation
Chap1 slides
Model compression
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
Communication
CFD Project
Hadoop secondary sort and a custom comparator
DFA minimization algorithms in map reduce
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
How CFD can help gain the most efficient data centre designs
slot_shifting
Aggarwal Draft
Cfd simulation of flow heat and mass transfer
MapReduce Paradigm
Self-adaptive container monitoring with performance-aware Load-Shedding policies
MapReduce: Simplified Data Processing On Large Clusters
4838281 operating-system-scheduling-on-multicore-architectures
층류 익형의 설계 최적화
Airspace configuration using_air_traffic_complexity_metrics
Ad

Viewers also liked (6)

PDF
Ley 594 de 2000
PDF
Reverse i xtreme3d
DOCX
Mely sis 0005+informatica+forense
PPTX
Intrucciones para....
PPT
El Vagón Del Amor (Carabajal, cáceres, Vasquez y Rovaretti)
PDF
Old Stanford Farm Branch Palo Alto, CA After
Ley 594 de 2000
Reverse i xtreme3d
Mely sis 0005+informatica+forense
Intrucciones para....
El Vagón Del Amor (Carabajal, cáceres, Vasquez y Rovaretti)
Old Stanford Farm Branch Palo Alto, CA After
Ad

Similar to 3D-DRESD Polaris (20)

PPT
3rd 3DDRESD: BiRF
PPT
UIC Thesis Novati
PPT
UIC Thesis Morandi
PPT
3rd 3DDRESD: Polaris
PPT
UIC Thesis Montone
PPT
3rd 3DDRESD: Floorplacer
PPT
UIC Panella Thesis
PPT
3rd 3DDRESD: DReAMS
PPT
HPPS - Final - 06/14/2007
PPT
UIC Thesis Candiloro
PPT
MPHD RC Overview
PDF
Performance Evaluation of FPGA Based Runtime Dynamic Partial Reconfiguration ...
PDF
IRJET- Review on Dynamic Reconfiguration of Filters for Signal Processing
PDF
Performance Evaluation of FPGA Based Runtime Dynamic Partial Reconfiguration ...
PPT
3D-DRESD DReAMS
PPT
RCW@DEI - Design Flow 4 SoPc
PPTX
Research challenges in Reconfigurable Computing
PDF
Selective fitting strategy based real time placement algorithm for dynamicall...
PPS
Interconnect Architectures
PPT
HPPS 2007 Projects Presentation
3rd 3DDRESD: BiRF
UIC Thesis Novati
UIC Thesis Morandi
3rd 3DDRESD: Polaris
UIC Thesis Montone
3rd 3DDRESD: Floorplacer
UIC Panella Thesis
3rd 3DDRESD: DReAMS
HPPS - Final - 06/14/2007
UIC Thesis Candiloro
MPHD RC Overview
Performance Evaluation of FPGA Based Runtime Dynamic Partial Reconfiguration ...
IRJET- Review on Dynamic Reconfiguration of Filters for Signal Processing
Performance Evaluation of FPGA Based Runtime Dynamic Partial Reconfiguration ...
3D-DRESD DReAMS
RCW@DEI - Design Flow 4 SoPc
Research challenges in Reconfigurable Computing
Selective fitting strategy based real time placement algorithm for dynamicall...
Interconnect Architectures
HPPS 2007 Projects Presentation

More from Marco Santambrogio (20)

PPT
RCIM 2008 - - hArtes Atmel
PDF
RCIM 2008 - - UniCal
PDF
RCIM 2008 - - ALTERA
PDF
DHow2 - L6 VHDL
PDF
DHow2 - L6 Ant
PDF
DHow2 - L5
PDF
RCIM 2008 - - ALaRI
PDF
RCIM 2008 - Modello Scheduling
PDF
RCIM 2008 - HLR
PDF
RCIM 2008 -- EHW
PDF
RCIM 2008 - Modello Generale
PDF
RCIM 2008 - Allocation Relocation
PPT
RCIM 2008 - - hArtes_Ferrara
PDF
RCIM 2008 - Janus
PDF
RCIM 2008 - Intro
PPT
DHow2 - L2
PPT
DHow2 - L4
PPT
DHow2 - L1
PPT
RCW@DEI - Treasure hunt
PDF
RCW@DEI - ADL
RCIM 2008 - - hArtes Atmel
RCIM 2008 - - UniCal
RCIM 2008 - - ALTERA
DHow2 - L6 VHDL
DHow2 - L6 Ant
DHow2 - L5
RCIM 2008 - - ALaRI
RCIM 2008 - Modello Scheduling
RCIM 2008 - HLR
RCIM 2008 -- EHW
RCIM 2008 - Modello Generale
RCIM 2008 - Allocation Relocation
RCIM 2008 - - hArtes_Ferrara
RCIM 2008 - Janus
RCIM 2008 - Intro
DHow2 - L2
DHow2 - L4
DHow2 - L1
RCW@DEI - Treasure hunt
RCW@DEI - ADL

Recently uploaded (20)

PDF
Buy Verified Stripe Accounts for Sale - Secure and.pdf
PPTX
The discussion on the Economic in transportation .pptx
PDF
ECONOMICS AND ENTREPRENEURS LESSONSS AND
PDF
THE EFFECT OF FOREIGN AID ON ECONOMIC GROWTH IN ETHIOPIA
PDF
Understanding University Research Expenditures (1)_compressed.pdf
PPTX
Session 3. Time Value of Money.pptx_finance
PPTX
Antihypertensive_Drugs_Presentation_Poonam_Painkra.pptx
PPTX
introuction to banking- Types of Payment Methods
PDF
Copia de Minimal 3D Technology Consulting Presentation.pdf
PDF
Chapter 9 IFRS Ed-Ed4_2020 Intermediate Accounting
PDF
Q2 2025 :Lundin Gold Conference Call Presentation_Final.pdf
PDF
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
PPTX
Session 14-16. Capital Structure Theories.pptx
PPTX
FL INTRODUCTION TO AGRIBUSINESS CHAPTER 1
PDF
Dr Tran Quoc Bao the first Vietnamese speaker at GITEX DigiHealth Conference ...
PPTX
Session 11-13. Working Capital Management and Cash Budget.pptx
PDF
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
PDF
caregiving tools.pdf...........................
PPTX
Basic Concepts of Economics.pvhjkl;vbjkl;ptx
PPTX
How best to drive Metrics, Ratios, and Key Performance Indicators
Buy Verified Stripe Accounts for Sale - Secure and.pdf
The discussion on the Economic in transportation .pptx
ECONOMICS AND ENTREPRENEURS LESSONSS AND
THE EFFECT OF FOREIGN AID ON ECONOMIC GROWTH IN ETHIOPIA
Understanding University Research Expenditures (1)_compressed.pdf
Session 3. Time Value of Money.pptx_finance
Antihypertensive_Drugs_Presentation_Poonam_Painkra.pptx
introuction to banking- Types of Payment Methods
Copia de Minimal 3D Technology Consulting Presentation.pdf
Chapter 9 IFRS Ed-Ed4_2020 Intermediate Accounting
Q2 2025 :Lundin Gold Conference Call Presentation_Final.pdf
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
Session 14-16. Capital Structure Theories.pptx
FL INTRODUCTION TO AGRIBUSINESS CHAPTER 1
Dr Tran Quoc Bao the first Vietnamese speaker at GITEX DigiHealth Conference ...
Session 11-13. Working Capital Management and Cash Budget.pptx
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
caregiving tools.pdf...........................
Basic Concepts of Economics.pvhjkl;vbjkl;ptx
How best to drive Metrics, Ratios, and Key Performance Indicators

3D-DRESD Polaris

  • 2. A workflow to manage allocation and relocation of tasks in a reconfigurable architecture Final goal: complete architecture (bitstreams) generation Polaris
  • 3. Management of 2D Reconfiguration in a Reconfigurable System Massimo Morandi [email_address]
  • 4. Outline Introduction Problem description Project Goals and Contributions Project in details Phases Results Future Work
  • 5. Problem Description New Generation of FPGAs Virtex-4 and Virtex-5 Allow bi-dimensional reconfiguration This permits to: Better exploit reconfigurable area Obtain modules performance optimizations More complex management: Handle one more degree of freedom Avoid more fragmentation Perform good placement choices to keep low TRR Keep acceptable intra-module routing paths
  • 6. Project Goals and Contributions Analyze effects of 2D reconfiguration New advantages New problems Examine possible solutions to new problems Explore literature to find promising ideas Evaluate those solutions in various scenarios Propose a new solution Combining ideas from literature with new ones Obtaining good cost-quality tradeoff
  • 7. Setting and Advantages Definition Definition of the setting: 2D self partial dynamical run-time reconfiguration Analysis of the advantages of 2D Reconfiguration In area usage and performance
  • 8. 2D Fragmentation Problem Analysis of the 2D-fragmentation problem Area generally more fragmented Can nullify the area optimizations obtained
  • 9. Placement Decisions Analysis of 2D placement choices effects: Again, bad choices can lead to performance loss
  • 10. Allocation manager Definition of allocation manager desired features: Low TRR Low management overhead High routing efficiency Low fragmentation Definition of allocation manager structure: Empty space manager Complete space Heuristic selection Fitter General (FF,BL,BF,WF…) Focused (FA,RA… )
  • 11. Most relevant works Maintain complete information on empty space: KAMER: Keep All Maximally Empty Rectangles Apply a general fitting strategy CUR: Maintain the Countour of a Union of Rectangles Apply a focused fitting strategy Heuristically prune part of the information: KNER: Keep Non-overlapping Empty Rectangles Apply a general fitting strategy 2D-HASHING: Keep Non-ov. Empty Rectangles in optimized data structure Apply (exclusively) a general fitting strategy
  • 12. Evaluation and Proposed Approach Proposed Approach Heuristic (KNER-like) empty space manager, to keep low complexity for use in a self-reconfigurable system Fitting strategy focused on minimizing routing paths, to maintain high performance of the reconfigurable system (chosen metric to minimize Manhattan distance) High placement quality => high complexity Lowest compl. => no focused fitting (bad especially for routing)
  • 13. Structure of the allocation manager Task, defined by: Arrival time, ASAP, (ALAP), H, W, Latency, Communicating Tasks Hosted in a queue which also adds a pointer to the rectangle where it is placed Reconfigurable Device, represented as: Binary Tree structure, each node is a Rectangle, each leaf is an empty Rectangle. Navigation trough pointers to left child, right child, next leaf and a function to find previous leaf (for bookkeeping after split or merge) Rectangle, defined by: X, Y, H, W Initially one, (X,Y)=(0,0), H=FPGA Rows, W=FPGA Cols
  • 15. Experimental Results Benchmark of 100 randomly generated tasks: Size (5% to 25% of FPGA), randomly interconnected Execution time: 3x less than CUR, close to KNER Communication cost: 3x less than KNER, close to CUR Task Rejection Rate: all solutions quite close
  • 16. Future Work Apply the proposed solution to self reconfiguration: Adapt the algorithm to run on the internal processor Create a validation reconfigurable architecture Integrate the architecture with relocation Tune the algorithm to improve results: Experiment techniques to reduce TRR Try to optimize the code to have an algorithm with lower running time Evaluate other fitting strategies
  • 18. Relocation for 2D Reconfigurable Systems Marco Novati [email_address]
  • 19. Project Outline Introduction Problem description Project Goals Project in details Phases Results What’s next
  • 20. Problem Description Self Dynamical Runtime 2D Reconfiguration Xilinx Virtex-4 and Virtex-5 Relocation, different solutions Software Hardware We chose an hardware solution BiRF Square
  • 21. Project Goals Study of the new FPGA Families Examination of Xilinx documentation on V4 and V5 Analysis of the new bitstream structure Generation of V4 and V5 bitstream Development of the new version of BiRF Implementation Validation
  • 22. New Frame Addressing: Possibility of addressing rows and columns Frame Addressing (1/2)
  • 25. CRC Calculation Particular CRC value, used by Xilinx tools Two version of BiRF Square: By using the “predefined” value With actual CRC calculation An optimized algorithm has been used
  • 26. Synthesis results On a Virtex-4 with speed grade -12 General purpose version: max frequency of 160 MHz Specific version: max frequency of 290 Mhz
  • 29. Results (1/2) BiRF Square Permits apply relocation in a self partially and dynamically 2D-reconfigurable system The occupation ratio is relatively small Frequency more than acceptable Reduction of internal memory requirements
  • 30. Results (2/2) Throughput of 7,3 MB/s: A total configuration file size is about 1 MB Considering an architecture: 1/3 of the area as fixed part 2/3 as reconfigurable part with 6 slots With such hypothesis Size of a partial bitstream will be about 110 KB Relocation time of about 15 ms
  • 31. What’s Next Future improvements: Direct access to the memory (DMA) Direct manipulation of the bitstream Portability Integration with ICAP Elimination of the relocation overhead Relocation time << reconfiguration time Future work: Provide a simulation framework to monitor the reconfigurable system evolution and to evaluate different choices The final goal: Creation of a real architecture that exploits self partial and dynamical 2D-reconfiguration,with relocation