SlideShare a Scribd company logo
Parallel Processing
Chapter 7
1
Introduction
 A traditional way to increase computer system performance is to use
multiple processors that can execute in parallel to support a given
workload.
 The most common multiple-processor organizations is Symmetric
Multiprocessors (SMPs).
 SMP consists of multiple similar processors within the same computer,
interconnected by a bus or some sort of switching arrangement.
 Each processor has its own cache and so it is possible for a given line of
data to be present in more than one cache.
 When more than one processor is implemented on a single chip, the
configuration is referred to as multiprocessor system.
 This design scheme is used to replicate some of the components of a
single processor so that the processor can execute multiple
threads/processes/tasks concurrently.
2
Cont…d
 This view of the computer is upgraded to the micro-
operation levels, multiple control signals are generated at
the same time.
 Instruction pipelining, at least to the extent of
overlapping fetch and execute operations, has been
around for a long time. Both of these are examples of
performing functions in parallel. This approach is taken
further with superscalar organization, which exploits
instruction-level parallelism.
 With a superscalar machine, there are multiple execution
units within a single processor, and these may execute
multiple instructions from the same program in parallel.
3
Introd… cont…d
 Parallel processing/computing, uses multiple processing
elements simultaneously to solve a problem. This is accomplished
by breaking the problem/task into independent parts so that each
processing element can execute its part of the algorithm
simultaneously with the others.
 The processing elements can be diverse and include resources such
as a single computer with multiple processors, several networked
computers, specialized hardware, or any combination of the above.
 Parallel processing or parallel computing is a form of
computation in which many calculations are carried out
simultaneously, operating on the principle that large problems can
often be divided into smaller ones, which are then solved
concurrently ("in parallel").
4
Introd… cont…d
 A parallel processing system is able to perform concurrent
data processing to achieve faster execution time
 The system may have two or more ALUs and be able to
execute two or more instructions at the same time
 Also, the system may have two or more processors
operating concurrently , the goal is to increase the
throughput – the amount of processing that can be
accomplished during a given interval of time
 A multifunctional organization is usually associated with a
complex control unit to coordinate all the activities among
the various components.
5
Functional units of parallel processing
 Parallel processing can be classified from:
 The internal organization of the processors
 The interconnection structure between processors
 The flow of information through the system
 The number of instructions and data items that are
manipulated simultaneously
 The sequence of instructions read from memory is the
instruction stream
 The operations performed on the data in the processor is
the data stream
• Parallel processing may occur in the instruction stream, the
data stream, or both
6
Eight functional units of CPU
7
Computer classification
8
 A computer can be classified as:
i. Single Instruction Stream, Single Data Stream – SISD
ii. Single Instruction Stream, Multiple Data Stream – SIMD
iii. Multiple Instruction Stream, Single Data Stream – MISD
iv. Multiple Instruction Stream, Multiple Data Stream – MIMD
 SISD -Instructions are executed sequentially. Parallel processing may
be achieved by means of multiple functional units or by pipeline
processing
 SIMD – Includes multiple processing units with a single control unit.All
processors receive the same instruction, but operate on different data.
 MISD-- The same data stream flows through a linear array of processors
executing different instruction streams.
 MIMD – A computer system capable of processing several programs at
the same time.
Pipelining
9
 Pipelining is an implementation technique where multiple
instructions are overlapped in execution.
 The computer pipeline is divided into stages. Each stage
completes a part of an instruction in parallel. The stages are
connected one to the next to form a pipe - instructions enter at
one end, progress through the stages, and exit at the other end.
 Pipelining refers to the technique in which a given task is
divided into a number of subtasks that need to be performed in
sequence. Each subtask is performed by a given functional unit.
The units are connected in a serial fashion and all of them operate
simultaneously. The use of pipelining improves the performance
compared to the traditional sequential execution of tasks.
Pipelining cont…d
10
 In general, pipelining is a technique of decomposing a
sequential process into sub operations, with each sub process
being executed in a special dedicated segment that operates
concurrently with all other segments. Each segment performs
partial processing dictated by the way the task is partitioned.
The result obtained from the computation in each segment is
transferred to the next segment in the pipeline.The final result
is obtained after the data have passed through all segments.
Example
11
Suppose that we want to perform the combined multiply
and add operations with a stream of numbers.
Ai  Bi + Ci , for i = 1, 2, 3, …, 7
The sub-operations performed in each segment are:
R1 A
← i , R2 B
← i Input Ai and Bi
R3 R1
←  R2, R4 C
← i Multiply and input Ci
R5 R3 + R4
← Add Ci to product
Examples of pipelining process
12
Contents of registers in the above example
13
14
Good Luck !
Q?

More Related Content

PPTX
Parallel processing
PPTX
PP - CH01 (2).pptxhhsjoshhshhshhhshhshsbx
PPTX
Parallel computing and its applications
PDF
Parallel and Distributed Computing chapter 1
PPTX
PPTX
pipelining
PPTX
Parallel Processing.pptx
PPTX
Cloud computing and distributed systems.
Parallel processing
PP - CH01 (2).pptxhhsjoshhshhshhhshhshsbx
Parallel computing and its applications
Parallel and Distributed Computing chapter 1
pipelining
Parallel Processing.pptx
Cloud computing and distributed systems.

Similar to Ch-7.pptx about architecture and computer (20)

PPTX
Cloud computing: Parallel and distributed processing.
PPT
1.prallelism
PPT
1.prallelism
PPTX
Pipelining and vector processing
PDF
Module 2.pdf
PPTX
paradigms cloud.pptx
PPTX
DLCA Flynns Classification Introduction to multicore arch
PPTX
PPTX
PDF
Computer organization and architectures chapter8
PDF
Process Control Block (PCB) print 4.pdf
PPT
Real-Time Scheduling Algorithms
PDF
A Parallel Computing-a Paradigm to achieve High Performance
PPTX
UNIT 6 in computer organization cse.pptx
DOCX
Parallel computing persentation
DOC
PARALLEL ARCHITECTURE AND COMPUTING - SHORT NOTES
PPTX
Parallel Processing Presentation2
PDF
CS6801-MULTI-CORE-ARCHITECTURE-AND-PROGRAMMING_watermark.pdf
PDF
Lec+3-Introduction-to-Distributed-Systems.pdf
Cloud computing: Parallel and distributed processing.
1.prallelism
1.prallelism
Pipelining and vector processing
Module 2.pdf
paradigms cloud.pptx
DLCA Flynns Classification Introduction to multicore arch
Computer organization and architectures chapter8
Process Control Block (PCB) print 4.pdf
Real-Time Scheduling Algorithms
A Parallel Computing-a Paradigm to achieve High Performance
UNIT 6 in computer organization cse.pptx
Parallel computing persentation
PARALLEL ARCHITECTURE AND COMPUTING - SHORT NOTES
Parallel Processing Presentation2
CS6801-MULTI-CORE-ARCHITECTURE-AND-PROGRAMMING_watermark.pdf
Lec+3-Introduction-to-Distributed-Systems.pdf
Ad

More from Toyba2 (9)

PPT
ch 3_The CPU_modified.ppt of central processing unit
PPTX
ch 2_Component and function of computer .pptx
PPTX
ch 1_Evolution of computer architecture.pptx
PPTX
Maxhine learning rec02 - MLP and BP.pptx
PPTX
Maintenance of computer hard warelab.pptx
PPTX
Machine learning rec02 - MLP and BP.pptx
PPTX
Maintenance of computer hard warelab.pptx
PPTX
This is about session rec02 - MLP and BP.pptx
PPTX
Computer hard ware and Maintenance lab.pptx
ch 3_The CPU_modified.ppt of central processing unit
ch 2_Component and function of computer .pptx
ch 1_Evolution of computer architecture.pptx
Maxhine learning rec02 - MLP and BP.pptx
Maintenance of computer hard warelab.pptx
Machine learning rec02 - MLP and BP.pptx
Maintenance of computer hard warelab.pptx
This is about session rec02 - MLP and BP.pptx
Computer hard ware and Maintenance lab.pptx
Ad

Recently uploaded (20)

PPTX
Your Guide to a Winning Interview Aug 2025.
PPTX
A slide for students with the advantagea
PPTX
PE3-WEEK-3sdsadsadasdadadwadwdsdddddd.pptx
PPTX
The Stock at arrangement the stock and product.pptx
PPTX
Theory of Change. AFH-FRDP OCEAN ToCpptx
PPT
NO000387 (1).pptsbsnsnsnsnsnsnsmsnnsnsnsjsnnsnsnsnnsnnansnwjwnshshshs
PPTX
_Dispute Resolution_July 2022.pptxmhhghhhh
PPTX
DPT-MAY24.pptx for review and ucploading
PPTX
STS CHAP 4 human development as reflected
PDF
313302 DBMS UNIT 1 PPT for diploma Computer Eng Unit 2
PDF
Sheri Ann Lowe Compliance Strategist Resume
DOCX
mcsp232projectguidelinesjan2023 (1).docx
PPT
Gsisgdkddkvdgjsjdvdbdbdbdghjkhgcvvkkfcxxfg
PPTX
Sports and Dance -lesson 3 powerpoint presentation
PPTX
Surgical thesis protocol formation ppt.pptx
PPTX
The-Scope-of-Food-Quality-and-Safety.pptx managemement
PDF
APNCET2025RESULT Result Result 2025 2025
PDF
Understanding the Rhetorical Situation Presentation in Blue Orange Muted Il_2...
PPTX
Prokaryotes v Eukaryotes PowerPoint.pptx
PPTX
1751884730-Visual Basic -Unitj CS B.pptx
Your Guide to a Winning Interview Aug 2025.
A slide for students with the advantagea
PE3-WEEK-3sdsadsadasdadadwadwdsdddddd.pptx
The Stock at arrangement the stock and product.pptx
Theory of Change. AFH-FRDP OCEAN ToCpptx
NO000387 (1).pptsbsnsnsnsnsnsnsmsnnsnsnsjsnnsnsnsnnsnnansnwjwnshshshs
_Dispute Resolution_July 2022.pptxmhhghhhh
DPT-MAY24.pptx for review and ucploading
STS CHAP 4 human development as reflected
313302 DBMS UNIT 1 PPT for diploma Computer Eng Unit 2
Sheri Ann Lowe Compliance Strategist Resume
mcsp232projectguidelinesjan2023 (1).docx
Gsisgdkddkvdgjsjdvdbdbdbdghjkhgcvvkkfcxxfg
Sports and Dance -lesson 3 powerpoint presentation
Surgical thesis protocol formation ppt.pptx
The-Scope-of-Food-Quality-and-Safety.pptx managemement
APNCET2025RESULT Result Result 2025 2025
Understanding the Rhetorical Situation Presentation in Blue Orange Muted Il_2...
Prokaryotes v Eukaryotes PowerPoint.pptx
1751884730-Visual Basic -Unitj CS B.pptx

Ch-7.pptx about architecture and computer

  • 2. Introduction  A traditional way to increase computer system performance is to use multiple processors that can execute in parallel to support a given workload.  The most common multiple-processor organizations is Symmetric Multiprocessors (SMPs).  SMP consists of multiple similar processors within the same computer, interconnected by a bus or some sort of switching arrangement.  Each processor has its own cache and so it is possible for a given line of data to be present in more than one cache.  When more than one processor is implemented on a single chip, the configuration is referred to as multiprocessor system.  This design scheme is used to replicate some of the components of a single processor so that the processor can execute multiple threads/processes/tasks concurrently. 2
  • 3. Cont…d  This view of the computer is upgraded to the micro- operation levels, multiple control signals are generated at the same time.  Instruction pipelining, at least to the extent of overlapping fetch and execute operations, has been around for a long time. Both of these are examples of performing functions in parallel. This approach is taken further with superscalar organization, which exploits instruction-level parallelism.  With a superscalar machine, there are multiple execution units within a single processor, and these may execute multiple instructions from the same program in parallel. 3
  • 4. Introd… cont…d  Parallel processing/computing, uses multiple processing elements simultaneously to solve a problem. This is accomplished by breaking the problem/task into independent parts so that each processing element can execute its part of the algorithm simultaneously with the others.  The processing elements can be diverse and include resources such as a single computer with multiple processors, several networked computers, specialized hardware, or any combination of the above.  Parallel processing or parallel computing is a form of computation in which many calculations are carried out simultaneously, operating on the principle that large problems can often be divided into smaller ones, which are then solved concurrently ("in parallel"). 4
  • 5. Introd… cont…d  A parallel processing system is able to perform concurrent data processing to achieve faster execution time  The system may have two or more ALUs and be able to execute two or more instructions at the same time  Also, the system may have two or more processors operating concurrently , the goal is to increase the throughput – the amount of processing that can be accomplished during a given interval of time  A multifunctional organization is usually associated with a complex control unit to coordinate all the activities among the various components. 5
  • 6. Functional units of parallel processing  Parallel processing can be classified from:  The internal organization of the processors  The interconnection structure between processors  The flow of information through the system  The number of instructions and data items that are manipulated simultaneously  The sequence of instructions read from memory is the instruction stream  The operations performed on the data in the processor is the data stream • Parallel processing may occur in the instruction stream, the data stream, or both 6
  • 8. Computer classification 8  A computer can be classified as: i. Single Instruction Stream, Single Data Stream – SISD ii. Single Instruction Stream, Multiple Data Stream – SIMD iii. Multiple Instruction Stream, Single Data Stream – MISD iv. Multiple Instruction Stream, Multiple Data Stream – MIMD  SISD -Instructions are executed sequentially. Parallel processing may be achieved by means of multiple functional units or by pipeline processing  SIMD – Includes multiple processing units with a single control unit.All processors receive the same instruction, but operate on different data.  MISD-- The same data stream flows through a linear array of processors executing different instruction streams.  MIMD – A computer system capable of processing several programs at the same time.
  • 9. Pipelining 9  Pipelining is an implementation technique where multiple instructions are overlapped in execution.  The computer pipeline is divided into stages. Each stage completes a part of an instruction in parallel. The stages are connected one to the next to form a pipe - instructions enter at one end, progress through the stages, and exit at the other end.  Pipelining refers to the technique in which a given task is divided into a number of subtasks that need to be performed in sequence. Each subtask is performed by a given functional unit. The units are connected in a serial fashion and all of them operate simultaneously. The use of pipelining improves the performance compared to the traditional sequential execution of tasks.
  • 10. Pipelining cont…d 10  In general, pipelining is a technique of decomposing a sequential process into sub operations, with each sub process being executed in a special dedicated segment that operates concurrently with all other segments. Each segment performs partial processing dictated by the way the task is partitioned. The result obtained from the computation in each segment is transferred to the next segment in the pipeline.The final result is obtained after the data have passed through all segments.
  • 11. Example 11 Suppose that we want to perform the combined multiply and add operations with a stream of numbers. Ai  Bi + Ci , for i = 1, 2, 3, …, 7 The sub-operations performed in each segment are: R1 A ← i , R2 B ← i Input Ai and Bi R3 R1 ←  R2, R4 C ← i Multiply and input Ci R5 R3 + R4 ← Add Ci to product
  • 13. Contents of registers in the above example 13