SlideShare a Scribd company logo
General Aspects of Computer Organization
(Lecture-4)
R S Ananda Murthy
Associate Professor and Head
Department of Electrical & Electronics Engineering,
Sri Jayachamarajendra College of Engineering,
Mysore 570 006
R S Ananda Murthy General Aspects of Computer Organization
Data Path In Side CPU
A
B
A+B
Registers ALU Input
Register
ALU
ALU Output
Register
ALU
Input Bus
BA A+B
Feeding two operands to the ALU and storing the output of
ALU in an internal register is called data path cycle.
Faster data path cycle results in faster program execution.
Multiple ALUs operating in parallel results in faster data
path cycle.
R S Ananda Murthy General Aspects of Computer Organization
RISC Design Speeds Up Program Execution
Most manufacturers today implement the following features in
their processors to improve performance –
All instructions are directly executed by hardware instead
of being interpreted by a microprogram.
Maximize the rate at which instructions are issued by
adopting instruction-level parallelism.
Use simple fixed-lengh instructions to speed-up decoding.
Avoid performing arithmetic and logical operations directly
on data present in the memory i.e., only LOAD and STORE
instructions should be executed with reference to memory.
Provide plently of registers in side the CPU.
R S Ananda Murthy General Aspects of Computer Organization
Pipelining for High Performance
Number of stages in a pipeline varies depending upon the
hardware design of the CPU.
Each stage in a pipeline is executed by a dedicated
hardware unit in side the CU.
Each stage in a pipeline takes the same amount of time to
complete its task.
Hardware units of different stages in a pipeline can work
concurrently.
Operation of hardware units is synchronized by the clock
signal.
To implement pipelining instructions must be of fixed length
and same instruction cycle time.
Pipelining requires sophisticated compiling techniques to
be implemented in the compiler.
R S Ananda Murthy General Aspects of Computer Organization
A 4-Stage Pipeline
Clock Cycle
W1E1D1F1
Instruction
I1
W2E2D2F2I2
W3E3D3F3I3
W4E4D4F4I4
1 2 3 4 5 6 7
Time
Hardware Stages in Pipeline
F: Fetch instruction
D: Decode and get operands
E: Execute the instruction
W: Write result at destination
Period of clock signal
No. of stages in pipeline
F: Fetch
Instruction
D: Decode
and get
operands
E: Execute
operation
W: Write
results
B1 B2 B3
B1, B2, and B3 are storage buffers.
Information is passed from one stage to the next through
storage buffers.
Time taken to execute each instruction is nT.
Processor Band Width is 1/(T106) MIPS (Million
Instructions Per Second).
R S Ananda Murthy General Aspects of Computer Organization
Superscalar Architecture
Instruction
decode
unit
Operand
fetch
unit
Instruction
execution
unit
Write
back
unit
S2 S3 S4 S5
Instruction
decode
unit
Operand
fetch
unit
Instruction
execution
unit
Write
back
unit
Instruction
fetch
unit
S1
Superscalar architecture has multiple pipelines as shown
above.
In the above example, a single fetch unit fetches a pair of
instructions together and puts each one into its own
pipeline, complete with its own ALU for parallel operation.
Compiler must ensure that the two instructions fetched do
not conflict over resource usage.
R S Ananda Murthy General Aspects of Computer Organization
Superscalar Architecture with Five Functional Units
Instruction
decode
unit
Operand
fetch
unit
LOAD
Write
back
unit
S2 S3
S4
S5S1
Instruction
fetch
unit
ALU
ALU
STORE
Floating
Point
Now-a-days the word “superscalar” is used to describe
processors that issue multiple instructions – often four to
six – in a single clock cycle.
Superscalar processors generally have one pipeline with
multiple functional units as shown above.
R S Ananda Murthy General Aspects of Computer Organization
License
This work is licensed under a
Creative Commons Attribution 4.0 International License.
R S Ananda Murthy General Aspects of Computer Organization

More Related Content

PDF
L7 starting-to-use-mcu
PDF
L1 intro-to-mpu-mcu
PDF
L10 assembly-language-programming-of-atmega328 p
PDF
L5 data-parallel-computers
PDF
L12 c-language-programming-of-atmega328 p
PDF
Realization of high performance run time loadable mips soft-core processor
PPTX
EC8791 designing with computing platform
PPTX
Computer Architecture – An Introduction
L7 starting-to-use-mcu
L1 intro-to-mpu-mcu
L10 assembly-language-programming-of-atmega328 p
L5 data-parallel-computers
L12 c-language-programming-of-atmega328 p
Realization of high performance run time loadable mips soft-core processor
EC8791 designing with computing platform
Computer Architecture – An Introduction

What's hot (20)

PPTX
Lecture 11
PPTX
Pipelining and vector processing
PDF
Bca examination 2015 csa
PPT
Program control
PPTX
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT II Embeded Hardware
PDF
Computer architecture
PPTX
Instruction set and instruction execution cycle
PDF
Instruction cycle
PPTX
Ec8791 arm 9 processor
PDF
Basic Computer Organization and Design
PDF
Embedded Systems (18EC62) - ARM Cortex-M3 Instruction Set and Programming (Mo...
PDF
Computer organiztion5
DOCX
Applications - embedded systems
PPT
Unit3 input
PPTX
Input & Output
PPT
Bca 2nd sem-u-2.2-overview of register transfer, micro operations and basic c...
PPTX
Embedded Systems Introdution
PDF
Computer architecture
PPTX
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT III The 8051 Microcontrollers
PPTX
Instruction cycle with interrupts
Lecture 11
Pipelining and vector processing
Bca examination 2015 csa
Program control
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT II Embeded Hardware
Computer architecture
Instruction set and instruction execution cycle
Instruction cycle
Ec8791 arm 9 processor
Basic Computer Organization and Design
Embedded Systems (18EC62) - ARM Cortex-M3 Instruction Set and Programming (Mo...
Computer organiztion5
Applications - embedded systems
Unit3 input
Input & Output
Bca 2nd sem-u-2.2-overview of register transfer, micro operations and basic c...
Embedded Systems Introdution
Computer architecture
SYBSC IT SEM IV EMBEDDED SYSTEMS UNIT III The 8051 Microcontrollers
Instruction cycle with interrupts
Ad

Viewers also liked (20)

PDF
L6 primary-memory
PDF
Introduction to-Tex-and-LaTeX
PDF
Transformers
PDF
Lecture-5 : Semiconductor Power Switching Devices-2
PDF
L3 instruction-execution-steps
PDF
L13 interrupts-in-atmega328 p
PDF
Lecture-3 : More Applications of Power Electronics
PDF
Lecture-7 : Semiconductor Power Switching Devices-4
PDF
Lecture-2 : Applications of Power Electronics
PDF
L14 kb-lcd-interfacing-with-atmega328 p
PDF
Three phase-circuits
PDF
L9 understanding-atmega328 p-2
PDF
L16 usart-atmega328 p
PDF
L8 understanding-atmega328 p-1
PDF
Synchronous generators
PDF
L11 assembly-language-programming-of-atmega328 p
PDF
L15 timers-counters-in-atmega328 p
PDF
Lecture-4 : Semiconductor Power Switching Devices-1
PDF
Lecture-1 : Introduction to Power Electronics
PDF
Trends in-power-electronics
L6 primary-memory
Introduction to-Tex-and-LaTeX
Transformers
Lecture-5 : Semiconductor Power Switching Devices-2
L3 instruction-execution-steps
L13 interrupts-in-atmega328 p
Lecture-3 : More Applications of Power Electronics
Lecture-7 : Semiconductor Power Switching Devices-4
Lecture-2 : Applications of Power Electronics
L14 kb-lcd-interfacing-with-atmega328 p
Three phase-circuits
L9 understanding-atmega328 p-2
L16 usart-atmega328 p
L8 understanding-atmega328 p-1
Synchronous generators
L11 assembly-language-programming-of-atmega328 p
L15 timers-counters-in-atmega328 p
Lecture-4 : Semiconductor Power Switching Devices-1
Lecture-1 : Introduction to Power Electronics
Trends in-power-electronics
Ad

Similar to L4 speeding-up-execution (20)

PPT
Basic structure of computers by aniket bhute
PPT
Basic structure of computers
PPT
Basic structure of computers
PPT
chapter 1 -Basic Structure of Computers (1).ppt
PPT
chapter1-basic-structure-of-computers.ppt
PPT
chapter1-basic-structure-of-computers.ppt
PDF
Design and development of a 5-stage Pipelined RISC processor based on MIPS
PPT
chapter1-basic-structure-of-computers.ppt
PPTX
Chapter 1 basic structure of computers
PPT
chapter 1 -Basic Structure of Computers.ppt
PPT
UNIT I.ppt
PPT
chapter1-basicstructureofcomputers.ppt
PDF
W04505116121
PPT
Chapter1 basic structure of computers
PPTX
Pipeline & Nonpipeline Processor
PPT
chapter1 -Basic co.pptjsjjsjdjxjdjdjdjjsjsjd
PDF
A044050107
PPTX
Lecture 46
PPTX
pipelining
PPT
Pipeline hazard
Basic structure of computers by aniket bhute
Basic structure of computers
Basic structure of computers
chapter 1 -Basic Structure of Computers (1).ppt
chapter1-basic-structure-of-computers.ppt
chapter1-basic-structure-of-computers.ppt
Design and development of a 5-stage Pipelined RISC processor based on MIPS
chapter1-basic-structure-of-computers.ppt
Chapter 1 basic structure of computers
chapter 1 -Basic Structure of Computers.ppt
UNIT I.ppt
chapter1-basicstructureofcomputers.ppt
W04505116121
Chapter1 basic structure of computers
Pipeline & Nonpipeline Processor
chapter1 -Basic co.pptjsjjsjdjxjdjdjdjjsjsjd
A044050107
Lecture 46
pipelining
Pipeline hazard

Recently uploaded (20)

PDF
Design Guidelines and solutions for Plastics parts
PPTX
Feature types and data preprocessing steps
PPTX
Artificial Intelligence
PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PDF
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
PPTX
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
PDF
Visual Aids for Exploratory Data Analysis.pdf
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PPTX
Information Storage and Retrieval Techniques Unit III
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PPTX
"Array and Linked List in Data Structures with Types, Operations, Implementat...
PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PDF
Soil Improvement Techniques Note - Rabbi
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
Design Guidelines and solutions for Plastics parts
Feature types and data preprocessing steps
Artificial Intelligence
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
R24 SURVEYING LAB MANUAL for civil enggi
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
Visual Aids for Exploratory Data Analysis.pdf
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Information Storage and Retrieval Techniques Unit III
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
"Array and Linked List in Data Structures with Types, Operations, Implementat...
Exploratory_Data_Analysis_Fundamentals.pdf
Soil Improvement Techniques Note - Rabbi
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
August 2025 - Top 10 Read Articles in Network Security & Its Applications
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf

L4 speeding-up-execution

  • 1. General Aspects of Computer Organization (Lecture-4) R S Ananda Murthy Associate Professor and Head Department of Electrical & Electronics Engineering, Sri Jayachamarajendra College of Engineering, Mysore 570 006 R S Ananda Murthy General Aspects of Computer Organization
  • 2. Data Path In Side CPU A B A+B Registers ALU Input Register ALU ALU Output Register ALU Input Bus BA A+B Feeding two operands to the ALU and storing the output of ALU in an internal register is called data path cycle. Faster data path cycle results in faster program execution. Multiple ALUs operating in parallel results in faster data path cycle. R S Ananda Murthy General Aspects of Computer Organization
  • 3. RISC Design Speeds Up Program Execution Most manufacturers today implement the following features in their processors to improve performance – All instructions are directly executed by hardware instead of being interpreted by a microprogram. Maximize the rate at which instructions are issued by adopting instruction-level parallelism. Use simple fixed-lengh instructions to speed-up decoding. Avoid performing arithmetic and logical operations directly on data present in the memory i.e., only LOAD and STORE instructions should be executed with reference to memory. Provide plently of registers in side the CPU. R S Ananda Murthy General Aspects of Computer Organization
  • 4. Pipelining for High Performance Number of stages in a pipeline varies depending upon the hardware design of the CPU. Each stage in a pipeline is executed by a dedicated hardware unit in side the CU. Each stage in a pipeline takes the same amount of time to complete its task. Hardware units of different stages in a pipeline can work concurrently. Operation of hardware units is synchronized by the clock signal. To implement pipelining instructions must be of fixed length and same instruction cycle time. Pipelining requires sophisticated compiling techniques to be implemented in the compiler. R S Ananda Murthy General Aspects of Computer Organization
  • 5. A 4-Stage Pipeline Clock Cycle W1E1D1F1 Instruction I1 W2E2D2F2I2 W3E3D3F3I3 W4E4D4F4I4 1 2 3 4 5 6 7 Time Hardware Stages in Pipeline F: Fetch instruction D: Decode and get operands E: Execute the instruction W: Write result at destination Period of clock signal No. of stages in pipeline F: Fetch Instruction D: Decode and get operands E: Execute operation W: Write results B1 B2 B3 B1, B2, and B3 are storage buffers. Information is passed from one stage to the next through storage buffers. Time taken to execute each instruction is nT. Processor Band Width is 1/(T106) MIPS (Million Instructions Per Second). R S Ananda Murthy General Aspects of Computer Organization
  • 6. Superscalar Architecture Instruction decode unit Operand fetch unit Instruction execution unit Write back unit S2 S3 S4 S5 Instruction decode unit Operand fetch unit Instruction execution unit Write back unit Instruction fetch unit S1 Superscalar architecture has multiple pipelines as shown above. In the above example, a single fetch unit fetches a pair of instructions together and puts each one into its own pipeline, complete with its own ALU for parallel operation. Compiler must ensure that the two instructions fetched do not conflict over resource usage. R S Ananda Murthy General Aspects of Computer Organization
  • 7. Superscalar Architecture with Five Functional Units Instruction decode unit Operand fetch unit LOAD Write back unit S2 S3 S4 S5S1 Instruction fetch unit ALU ALU STORE Floating Point Now-a-days the word “superscalar” is used to describe processors that issue multiple instructions – often four to six – in a single clock cycle. Superscalar processors generally have one pipeline with multiple functional units as shown above. R S Ananda Murthy General Aspects of Computer Organization
  • 8. License This work is licensed under a Creative Commons Attribution 4.0 International License. R S Ananda Murthy General Aspects of Computer Organization