SlideShare a Scribd company logo
Multi-threaded RTOS
How Multi-threading can increase
on-chip parallelism
Outline
 Introduction
 Multi-threading models
 Architectures of multi-threaded processors
 Simultaneous multi-threading and multi-
processors
 Cache design
 Examples of Multi-threaded environments
 Conclusions
Introduction
 Two forms of parallelism
 instruction-level parallelism (ILP)
 thread-level parallelism (TLP)
 Both identify independent instructions that can execute in parallel
 Wide-issue superscalar processors exploit ILP by executing multiple
instructions from a single program in a single cycle.
 Multiprocessors exploit TLP by executing different threads in parallel
on different processors.
 The first multi-threaded processor approaches in the 1970s and
1980s applied multi-threading at user-thread-level to solve the
memory access latency problem.
Introduction
 Motivations for multi-threaded processor architecture development
include chip area , cost and complexity.
 Simultaneous Multi-threading (SMT),
 Single chip multiprocessing (CMP),
 SMT VLIW architecture,
 Multithreaded Vector (SMV) architecture
 DSP applications inherently benefit from the following architectural
characteristics:
 Parallelization at multiple levels of hierarchy:
 - Instruction - separate instruction memory space
 - Data – separate date memory space
 - Thread- multiple functional units
 - Data transfer – multiple wide data buses
Vertical and Horizontal Waste
 Vertical waste is
introduced when the
processor issues no
instructions in a cycle
 Horizontal waste when
not all issue slots can
be filled in a cycle.
Vertical and Horizontal Waste
Multi-threaded Models
 Fine-Grain Multithreading
 Only one thread issues instructions
each cycle, but it can use the entire
issue width of the processor.
 SM: full Simultaneous Issue
 Single
 Dual
 Four
 SM: limited Connection
 Hardware context is connected
directly one of each type of
functional units.
 Less dynamic
Performance
SMT VLIW Architecture
Simultaneous Vector Multi-threaded Architecture (SVMT)
SMT vs. Multiprocessing
Cache design
Examples Multi-threaded RTOS
 Analog Devices VDK
 uClinux
 The RTXC Quadros RTOS
 RTCX/ss
 RTXC/ss
 ThreadX
Conclusions
 A simultaneous multithreaded architecture is superior in
performance to a multiple-issue multiprocessor (multi-issue CMP).
 SMT boost utilization by dynamically scheduling functional units
among multiple threads.
 SMT also increases hardware design flexibility.
 Simultaneous multithreading increases the complexity of instruction
scheduling.
 Increased parallelism offered makes multi-threading ideal for DSP
applications where each application can run as a separate thread.

More Related Content

PPT
Data race
PPTX
Graphics processing uni computer archiecture
PPTX
Multithreading computer architecture
PPT
Hardware multithreading
PPT
PDF
What is simultaneous multithreading
PDF
Multithreaded processors ppt
Data race
Graphics processing uni computer archiecture
Multithreading computer architecture
Hardware multithreading
What is simultaneous multithreading
Multithreaded processors ppt

What's hot (20)

PPT
PPTX
Parallel processing (simd and mimd)
PPT
Chapter 19 - Real Time Systems
PPTX
Real Time Kernels
PDF
Real Time Operating System Concepts
PPT
PPTX
Real time Operating System
PPT
Parallel processing Concepts
PDF
RTOS implementation
PPTX
Superscalar & superpipeline processor
PPTX
Real Time Operating System
PPTX
Process, Threads, Symmetric Multiprocessing and Microkernels in Operating System
PDF
Real-time soultion
PPTX
Pipelining, processors, risc and cisc
PDF
Array Processor
PDF
Multiprocessor
PPTX
PDF
Rtos princples adn case study
PPT
Flynns classification
Parallel processing (simd and mimd)
Chapter 19 - Real Time Systems
Real Time Kernels
Real Time Operating System Concepts
Real time Operating System
Parallel processing Concepts
RTOS implementation
Superscalar & superpipeline processor
Real Time Operating System
Process, Threads, Symmetric Multiprocessing and Microkernels in Operating System
Real-time soultion
Pipelining, processors, risc and cisc
Array Processor
Multiprocessor
Rtos princples adn case study
Flynns classification
Ad

Similar to Multi threaded rtos (20)

PPTX
Ca alternative architecture
DOCX
Multi-Core on Chip Architecture *doc - IK
PPTX
Topic 4- processes.pptx
PDF
Flynn taxonomies
PDF
Ef35745749
PDF
Module 1 of apj Abdul kablam university hpc.pdf
PPTX
PPTX
Clustering
PDF
Multithreading
PPT
Computer Architecture: A quantitative approach - Cap4 - Section 1
PPTX
Cloud computing and distributed systems.
PPTX
Cloud computing: Parallel and distributed processing.
PPT
Os Threads
PDF
threads (1).pdfmjlkjfwjgliwiufuaiusyroayr
PDF
CS304PC:Computer Organization and Architecture UNIT V_merged_merged.pdf
PPTX
Advanced processor principles
PPT
Operating Systems - "Chapter 4: Multithreaded Programming"
PPTX
processor struct
PPTX
Modern processor art
PDF
CH04.pdf
Ca alternative architecture
Multi-Core on Chip Architecture *doc - IK
Topic 4- processes.pptx
Flynn taxonomies
Ef35745749
Module 1 of apj Abdul kablam university hpc.pdf
Clustering
Multithreading
Computer Architecture: A quantitative approach - Cap4 - Section 1
Cloud computing and distributed systems.
Cloud computing: Parallel and distributed processing.
Os Threads
threads (1).pdfmjlkjfwjgliwiufuaiusyroayr
CS304PC:Computer Organization and Architecture UNIT V_merged_merged.pdf
Advanced processor principles
Operating Systems - "Chapter 4: Multithreaded Programming"
processor struct
Modern processor art
CH04.pdf
Ad

More from James Wong (20)

PPT
Recursion
PPTX
Business analytics and data mining
PPTX
Data mining and knowledge discovery
PPTX
Cache recap
PPTX
Big picture of data mining
PPTX
How analysis services caching works
PPTX
Optimizing shared caches in chip multiprocessors
PPTX
Directory based cache coherence
PPT
Abstract data types
PPTX
Abstraction file
PPTX
Hardware managed cache
PPTX
Object model
PPT
Abstract class
PPTX
Object oriented analysis
PPTX
Concurrency with java
PPTX
Data structures and algorithms
PPTX
Cobol, lisp, and python
PPTX
Inheritance
PPTX
Api crash
PPTX
Learning python
Recursion
Business analytics and data mining
Data mining and knowledge discovery
Cache recap
Big picture of data mining
How analysis services caching works
Optimizing shared caches in chip multiprocessors
Directory based cache coherence
Abstract data types
Abstraction file
Hardware managed cache
Object model
Abstract class
Object oriented analysis
Concurrency with java
Data structures and algorithms
Cobol, lisp, and python
Inheritance
Api crash
Learning python

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Mobile App Security Testing_ A Comprehensive Guide.pdf
The AUB Centre for AI in Media Proposal.docx
Advanced methodologies resolving dimensionality complications for autism neur...
Review of recent advances in non-invasive hemoglobin estimation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation_ Review paper, used for researhc scholars
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Spectral efficient network and resource selection model in 5G networks
Per capita expenditure prediction using model stacking based on satellite ima...
Network Security Unit 5.pdf for BCA BBA.
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Building Integrated photovoltaic BIPV_UPV.pdf
Empathic Computing: Creating Shared Understanding
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Multi threaded rtos

  • 1. Multi-threaded RTOS How Multi-threading can increase on-chip parallelism
  • 2. Outline  Introduction  Multi-threading models  Architectures of multi-threaded processors  Simultaneous multi-threading and multi- processors  Cache design  Examples of Multi-threaded environments  Conclusions
  • 3. Introduction  Two forms of parallelism  instruction-level parallelism (ILP)  thread-level parallelism (TLP)  Both identify independent instructions that can execute in parallel  Wide-issue superscalar processors exploit ILP by executing multiple instructions from a single program in a single cycle.  Multiprocessors exploit TLP by executing different threads in parallel on different processors.  The first multi-threaded processor approaches in the 1970s and 1980s applied multi-threading at user-thread-level to solve the memory access latency problem.
  • 4. Introduction  Motivations for multi-threaded processor architecture development include chip area , cost and complexity.  Simultaneous Multi-threading (SMT),  Single chip multiprocessing (CMP),  SMT VLIW architecture,  Multithreaded Vector (SMV) architecture  DSP applications inherently benefit from the following architectural characteristics:  Parallelization at multiple levels of hierarchy:  - Instruction - separate instruction memory space  - Data – separate date memory space  - Thread- multiple functional units  - Data transfer – multiple wide data buses
  • 5. Vertical and Horizontal Waste  Vertical waste is introduced when the processor issues no instructions in a cycle  Horizontal waste when not all issue slots can be filled in a cycle.
  • 7. Multi-threaded Models  Fine-Grain Multithreading  Only one thread issues instructions each cycle, but it can use the entire issue width of the processor.  SM: full Simultaneous Issue  Single  Dual  Four  SM: limited Connection  Hardware context is connected directly one of each type of functional units.  Less dynamic
  • 10. Simultaneous Vector Multi-threaded Architecture (SVMT)
  • 13. Examples Multi-threaded RTOS  Analog Devices VDK  uClinux  The RTXC Quadros RTOS  RTCX/ss  RTXC/ss  ThreadX
  • 14. Conclusions  A simultaneous multithreaded architecture is superior in performance to a multiple-issue multiprocessor (multi-issue CMP).  SMT boost utilization by dynamically scheduling functional units among multiple threads.  SMT also increases hardware design flexibility.  Simultaneous multithreading increases the complexity of instruction scheduling.  Increased parallelism offered makes multi-threading ideal for DSP applications where each application can run as a separate thread.