COE 504 – Heterogeneous Computing
        Project Presentation




             Presenter:
  Mohammed Omer Assayony(201102150)




                                      1
   Pipelined Computation
   Pipelined Computation Applications
    ◦ Type 1 : Multiple instances of the complete problem
    ◦ Type 2 : Overlapping Pipeline Stages
   DDS Architecture for Pipelining Computation




                                                            2
 Pipelined computation is a parallel technique
  used when the problem could be divided into
  series of tasks that have to be completed in
  sequential order.
 One form of functional decomposition:
    ◦ Each task (function) is implemented as a
      process, called a stage.
    ◦ Each stage contributes to the overall solution of the
      problem and passes on the required data for
      subsequent stage.

                                                              3
4
   Though the execution of pipelining seems
    sequential, it can provide reasonable speedup
    in TWO TYPES of computations:

     Type1: When more than one instance of the
     complete problem is to be executed.
     Type 2: When the data required starting the next
     state can be passed forward before the current stage
     has completed all its internal operation.


                                                            5
   Pipelining increases the throughput (number of
    instances completed per time unit).
   Typical Application Examples:
    1. Pipelining Addition.
    2. Frequency Filtering.
    3. Sound Digitizing……...




                                                     6
Time for m instances = (pipeline fill + number of instances) x stage delay
                      = ( p- 1        +       m              ) x stage delay


             Throughput = m/((p- 1 +m) x stage delay)
             If m >> p
                 Throughput = 1/stage delay
                                                                               7
   In this type of computation, pipelining improves the
    execution time of a single instance due to
    computation overlapping.
   The gained improvement depends on communication

The fast the communication, the better the performance

   Typical Application Examples:
    ◦ Linear Algebra Algorithm:
       Gauss Elimination, Back-substitution …….

                                                           8
9
Report Result



Subscriber               Publisher             Subscriber                Publisher   Subscriber                Publisher


             Processin
                 g                                          Processing                            Processing



         Process 1                                      Process 1


                          General DDS Architecture for Pipelining Computation




                                                                                                                           10
Q&A
      11

More Related Content

PDF
Gf3511031106
PDF
High Speed and Time Efficient 1-D DWT on Xilinx Virtex4 DWT Using 9/7 Filter ...
PDF
Pretzel: optimized Machine Learning framework for low-latency and high throug...
PDF
Solution(1)
PDF
Pretzel: optimized Machine Learning framework for low-latency and high throug...
PDF
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
PDF
Implementation and evaluation of novel scheduler of UC/OS (RTOS)
PPT
Chap4 slides
Gf3511031106
High Speed and Time Efficient 1-D DWT on Xilinx Virtex4 DWT Using 9/7 Filter ...
Pretzel: optimized Machine Learning framework for low-latency and high throug...
Solution(1)
Pretzel: optimized Machine Learning framework for low-latency and high throug...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
Implementation and evaluation of novel scheduler of UC/OS (RTOS)
Chap4 slides

What's hot (19)

PPT
Basic Communication
PDF
Using the black-box approach with machine learning methods in ...
PDF
Transfer Learning for Performance Analysis of Configurable Systems: A Causal ...
PDF
Improvement of Congestion window and Link utilization of High Speed Protocols...
PDF
Traffic Class Assignment for Mixed-Criticality Frames in TTEthernet
PDF
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
PPT
Collective Communications in MPI
PPT
Chap11 slides
PDF
VaMoS 2022 - Transfer Learning across Distinct Software Systems
PDF
VTU 5TH SEM CSE OPERATING SYSTEMS SOLVED PAPERS
PDF
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
PDF
Automated Parameterization of Performance Models from Measurements
PPT
program partitioning and scheduling IN Advanced Computer Architecture
PDF
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
PPT
Distributed System by Pratik Tambekar
PDF
Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...
PPT
Chap4 slides
PPT
Chap12 slides
ODP
Mobicents Media Server theory, practice, cloud considerations, design discuss...
Basic Communication
Using the black-box approach with machine learning methods in ...
Transfer Learning for Performance Analysis of Configurable Systems: A Causal ...
Improvement of Congestion window and Link utilization of High Speed Protocols...
Traffic Class Assignment for Mixed-Criticality Frames in TTEthernet
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
Collective Communications in MPI
Chap11 slides
VaMoS 2022 - Transfer Learning across Distinct Software Systems
VTU 5TH SEM CSE OPERATING SYSTEMS SOLVED PAPERS
FPGA Implementation of 2-D DCT & DWT Engines for Vision Based Tracking of Dyn...
Automated Parameterization of Performance Models from Measurements
program partitioning and scheduling IN Advanced Computer Architecture
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Distributed System by Pratik Tambekar
Fairness in Transfer Control Protocol for Congestion Control in Multiplicativ...
Chap4 slides
Chap12 slides
Mobicents Media Server theory, practice, cloud considerations, design discuss...
Ad

Viewers also liked (17)

PPTX
3 Pipelining
PDF
Instruction pipelining (ii)
PPT
Instruction pipelining
PDF
Aca2 06 new
PPT
Pipelinig hazardous
PPT
Pipeline
PPT
Performance Enhancement with Pipelining
PPT
pipeline and vector processing
PDF
Array Processor
PPT
Lec18 pipeline
PPTX
pipelining
PDF
Pipeline and data hazard
PPTX
Instruction pipeline: Computer Architecture
PPTX
Instruction pipelining
PPTX
Pipelining and vector processing
PPT
pipelining
PDF
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
3 Pipelining
Instruction pipelining (ii)
Instruction pipelining
Aca2 06 new
Pipelinig hazardous
Pipeline
Performance Enhancement with Pipelining
pipeline and vector processing
Array Processor
Lec18 pipeline
pipelining
Pipeline and data hazard
Instruction pipeline: Computer Architecture
Instruction pipelining
Pipelining and vector processing
pipelining
A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...
Ad

Similar to Pipelining Coputing (20)

PPTX
Computer arithmetic in computer architecture
PDF
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
PPTX
Pipelining in Computer System Achitecture
PPTX
Unit 4 COA.pptx
PDF
pipelining ppt.pdf
PPTX
Optimization of Continuous Queries in Federated Database and Stream Processin...
PDF
Framework for Inter-Model Analysis of Cyber-Physical Systems
PPTX
pipeline in computer architecture design
PPT
Computer Organozation
PPT
UDT
PDF
High Performance Engineering - 01-intro.pdf
PPT
Unit 6 of OS in computer science and engineering
PPT
UDT
PPTX
BTCS501_MM_Ch9.pptx
PDF
CS304PC:Computer Organization and Architecture Session 33 demo 1 ppt.pdf
PDF
Parallel Processing Techniques Pipelining
PPT
Pipelining (COA)okokokokokokokokokokok.ppt
PPTX
CPU Pipelining and Hazards - An Introduction
PPT
Unit 3-pipelining & vector processing
PPT
Computer_Architecture_3rd_Edition_by_Moris_Mano_Ch_09.ppt
Computer arithmetic in computer architecture
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
Pipelining in Computer System Achitecture
Unit 4 COA.pptx
pipelining ppt.pdf
Optimization of Continuous Queries in Federated Database and Stream Processin...
Framework for Inter-Model Analysis of Cyber-Physical Systems
pipeline in computer architecture design
Computer Organozation
UDT
High Performance Engineering - 01-intro.pdf
Unit 6 of OS in computer science and engineering
UDT
BTCS501_MM_Ch9.pptx
CS304PC:Computer Organization and Architecture Session 33 demo 1 ppt.pdf
Parallel Processing Techniques Pipelining
Pipelining (COA)okokokokokokokokokokok.ppt
CPU Pipelining and Hazards - An Introduction
Unit 3-pipelining & vector processing
Computer_Architecture_3rd_Edition_by_Moris_Mano_Ch_09.ppt

Pipelining Coputing

  • 1. COE 504 – Heterogeneous Computing Project Presentation Presenter: Mohammed Omer Assayony(201102150) 1
  • 2. Pipelined Computation  Pipelined Computation Applications ◦ Type 1 : Multiple instances of the complete problem ◦ Type 2 : Overlapping Pipeline Stages  DDS Architecture for Pipelining Computation 2
  • 3.  Pipelined computation is a parallel technique used when the problem could be divided into series of tasks that have to be completed in sequential order.  One form of functional decomposition: ◦ Each task (function) is implemented as a process, called a stage. ◦ Each stage contributes to the overall solution of the problem and passes on the required data for subsequent stage. 3
  • 4. 4
  • 5. Though the execution of pipelining seems sequential, it can provide reasonable speedup in TWO TYPES of computations: Type1: When more than one instance of the complete problem is to be executed. Type 2: When the data required starting the next state can be passed forward before the current stage has completed all its internal operation. 5
  • 6. Pipelining increases the throughput (number of instances completed per time unit).  Typical Application Examples: 1. Pipelining Addition. 2. Frequency Filtering. 3. Sound Digitizing……... 6
  • 7. Time for m instances = (pipeline fill + number of instances) x stage delay = ( p- 1 + m ) x stage delay Throughput = m/((p- 1 +m) x stage delay) If m >> p Throughput = 1/stage delay 7
  • 8. In this type of computation, pipelining improves the execution time of a single instance due to computation overlapping.  The gained improvement depends on communication The fast the communication, the better the performance  Typical Application Examples: ◦ Linear Algebra Algorithm:  Gauss Elimination, Back-substitution ……. 8
  • 9. 9
  • 10. Report Result Subscriber Publisher Subscriber Publisher Subscriber Publisher Processin g Processing Processing Process 1 Process 1 General DDS Architecture for Pipelining Computation 10
  • 11. Q&A 11