SlideShare a Scribd company logo
HIGH PERFORMANCE
COMPUTING
LECT_03
Convergence of Parallel
Architectures-I
BATCH: 11BS(IT)
PREPARED BY: MUKHTIAR AHMED
Asst. Prof. I.T Department
2
History
Application Software
System
Software SIMD
Message Passing
Shared Memory
Dataflow
Systolic
Arrays
Architecture
• Uncertainty of direction paralyzed parallel software development!
Historically, parallel architectures tied to programming models
• Divergent architectures, with no predictable pattern of growth.
3
Today
Extension of “computer architecture” to support communication
and cooperation
• OLD: Instruction Set Architecture
• NEW: Communication Architecture
Defines
• Critical abstractions, boundaries, and primitives (interfaces)
• Organizational structures that implement interfaces (hw or sw)
Compilers, libraries and OS are important bridges today
4
Modern Layered Framework
CAD
Multiprogramming Shared
address
Message
passing
Data
parallel
Database Scientific modeling Parallel applications
Programming models
Communication abstraction
User/system boundary
Compilation
or library
Operating systems support
Communication hardware
Physical communication medium
Hardware/software boundary
5
Programming Model
What programmer uses in coding applications
Specifies communication and synchronization
Examples:
• Multiprogramming: no communication or synch. at program level
• Shared address space: like bulletin board
• Message passing: like letters or phone calls, explicit point to point
• Data parallel: more regimented, global actions on data
– Implemented with shared address space or message passing
6
Communication Abstraction
User level communication primitives provided
• Realizes the programming model
• Mapping exists between language primitives of programming model
and these primitives
Supported directly by hw, or via OS, or via user sw
Lot of debate about what to support in sw and gap between layers
Today:
• Hw/sw interface tends to be flat, i.e. complexity roughly uniform
• Compilers and software play important roles as bridges today
• Technology trends exert strong influence
Result is convergence in organizational structure
• Relatively simple, general purpose communication primitives
7
Communication Architecture
= User/System Interface + Implementation
User/System Interface:
• Comm. primitives exposed to user-level by hw and system-level sw
Implementation:
• Organizational structures that implement the primitives: hw or OS
• How optimized are they? How integrated into processing node?
• Structure of network
Goals:
• Performance
• Broad applicability
• Programmability
• Scalability
• Low Cost
8
Evolution of Architectural Models
Historically machines tailored to programming models
• Prog. model, comm. abstraction, and machine organization lumped
together as the “architecture”
Evolution helps understand convergence
• Identify core concepts
• Shared Address Space
• Message Passing
• Data Parallel
Others:
• Dataflow
• Systolic Arrays
Examine programming model, motivation, intended applications, and
contributions to convergence
9
Shared Address Space Architectures
Any processor can directly reference any memory location
• Communication occurs implicitly as result of loads and stores
Convenient:
• Location transparency
• Similar programming model to time-sharing on uniprocessors
– Except processes run on different processors
– Good throughput on multiprogrammed workloads
Naturally provided on wide range of platforms
• History dates at least to precursors of mainframes in early 60s
• Wide range of scale: few to hundreds of processors
Popularly known as shared memory machines or model
• Ambiguous: memory may be physically distributed among processors
10
Shared Address Space Model
Process: virtual address space plus one or more threads of control
Portions of address spaces of processes are shared
•Writes to shared address visible to other threads (in other processes too)
•Natural extension of uniprocessors model: conventional memory
operations for comm.; special atomic operations for synchronization
•OS uses shared memory to coordinate processes
St or e
P1
P2
Pn
P0
Load
P0 pr i vat e
P1 pr i vat e
P2 pr i vat e
Pn pr i vat e
Virtual address spaces for a
collection of processes communicating
via shared addresses
Machine physical address space
Shared portion
of address space
Private portion
of address space
Common physical
addresses
11
Communication Hardware
Also natural extension of uniprocessor
Already have processor, one or more memory modules and I/O
controllers connected by hardware interconnect of some sort
Memory capacity increased by adding modules, I/O by controllers
•Add processors for processing!
•For higher-throughput multiprogramming, or parallel programs
I/O ctrlMem Mem Mem
Interconnect
Mem I/O ctrl
Processor Processor
Interconnect
I/O
dev ices

More Related Content

PPT
3 Tier Architecture
PPTX
Trinity: A Distributed Graph Engine on a Memory Cloud
DOC
Computing notes
PPT
System models in distributed system
PPTX
Software Architecture Patterns
PDF
Lecture 1 introduction to parallel and distributed computing
PPTX
ditributed databases
PPTX
Hpc 4 5
3 Tier Architecture
Trinity: A Distributed Graph Engine on a Memory Cloud
Computing notes
System models in distributed system
Software Architecture Patterns
Lecture 1 introduction to parallel and distributed computing
ditributed databases
Hpc 4 5

Viewers also liked (20)

PPS
Snoopy
PPT
Parallel Programming Primer
PDF
Implementation of FIFO in Linux
PDF
Thread dumps
PDF
Basic Multithreading using Posix Threads
PPTX
Basic Thread Knowledge
PPT
What is [Open] MPI?
PPTX
Thread scheduling in Operating Systems
PPT
Lecture 6
PPTX
message passing vs shared memory
PPT
Operating System-Threads-Galvin
PPT
Chapter 5 pc
PPT
Chapter 4 pc
PPT
Collective Communications in MPI
PPT
Introduction to MPI
PPT
Chapter 1 pc
PPT
Point-to-Point Communicationsin MPI
PPT
Chapter 6 pc
PPT
Chapter 3 pc
PPT
OS Process and Thread Concepts
Snoopy
Parallel Programming Primer
Implementation of FIFO in Linux
Thread dumps
Basic Multithreading using Posix Threads
Basic Thread Knowledge
What is [Open] MPI?
Thread scheduling in Operating Systems
Lecture 6
message passing vs shared memory
Operating System-Threads-Galvin
Chapter 5 pc
Chapter 4 pc
Collective Communications in MPI
Introduction to MPI
Chapter 1 pc
Point-to-Point Communicationsin MPI
Chapter 6 pc
Chapter 3 pc
OS Process and Thread Concepts
Ad

Similar to Hpc 3 (20)

PPTX
Hpc 6 7
PDF
IBM InterConnect 2015 - IIB Effective Application Development
PPT
parallel programming models
PPT
Floating Point Operations , Memory Chip Organization , Serial Bus Architectur...
PPT
Floating Point Operations , Memory Chip Organization , Serial Bus Architectur...
PDF
Lecture 2 more about parallel computing
PPTX
Tech presentation (part 1)
PDF
Elements of systems design
PPTX
Concurrency Programming in Java - 01 - Introduction to Concurrency Programming
PPTX
Distributed Systems.pptx
PDF
Pthread
PPTX
Lect 1 Distributed System.pptx
PDF
Cloud Computing BCS601 Notef of Viswesvaraya University
PPTX
Apos week 1 4
PDF
Azure Cosmos DB - The Swiss Army NoSQL Cloud Database
PPTX
Dedicated fully parallel architecture
ODP
HCE project brief
PDF
Inter-Process Communication in distributed systems
PPT
Elements of Systems Design.ppt
PPTX
Lec 2 (parallel design and programming)
Hpc 6 7
IBM InterConnect 2015 - IIB Effective Application Development
parallel programming models
Floating Point Operations , Memory Chip Organization , Serial Bus Architectur...
Floating Point Operations , Memory Chip Organization , Serial Bus Architectur...
Lecture 2 more about parallel computing
Tech presentation (part 1)
Elements of systems design
Concurrency Programming in Java - 01 - Introduction to Concurrency Programming
Distributed Systems.pptx
Pthread
Lect 1 Distributed System.pptx
Cloud Computing BCS601 Notef of Viswesvaraya University
Apos week 1 4
Azure Cosmos DB - The Swiss Army NoSQL Cloud Database
Dedicated fully parallel architecture
HCE project brief
Inter-Process Communication in distributed systems
Elements of Systems Design.ppt
Lec 2 (parallel design and programming)
Ad

More from Yasir Khan (20)

PPT
Lecture 6
PPT
Lecture 4
PPT
Lecture 3
PPT
Lecture 2
PPT
Lec#1
PPT
Ch10 (1)
PPT
PPT
PPT
Snooping protocols 3
PPT
Snooping 2
PPT
Introduction 1
PPT
Hpc sys
PPTX
Hpc 2
PPTX
Hpc 1
PPT
Flynns classification
PPT
Dir based imp_5
PPT
Natural Language Processing
PPT
Uncertainity
PPT
Logic
PPT
M6 game
Lecture 6
Lecture 4
Lecture 3
Lecture 2
Lec#1
Ch10 (1)
Snooping protocols 3
Snooping 2
Introduction 1
Hpc sys
Hpc 2
Hpc 1
Flynns classification
Dir based imp_5
Natural Language Processing
Uncertainity
Logic
M6 game

Recently uploaded (20)

PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Basic Mud Logging Guide for educational purpose
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
master seminar digital applications in india
PDF
Computing-Curriculum for Schools in Ghana
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
01-Introduction-to-Information-Management.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
Pharma ospi slides which help in ospi learning
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Basic Mud Logging Guide for educational purpose
PPH.pptx obstetrics and gynecology in nursing
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
master seminar digital applications in india
Computing-Curriculum for Schools in Ghana
2.FourierTransform-ShortQuestionswithAnswers.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Anesthesia in Laparoscopic Surgery in India
VCE English Exam - Section C Student Revision Booklet
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pre independence Education in Inndia.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student

Hpc 3

  • 1. HIGH PERFORMANCE COMPUTING LECT_03 Convergence of Parallel Architectures-I BATCH: 11BS(IT) PREPARED BY: MUKHTIAR AHMED Asst. Prof. I.T Department
  • 2. 2 History Application Software System Software SIMD Message Passing Shared Memory Dataflow Systolic Arrays Architecture • Uncertainty of direction paralyzed parallel software development! Historically, parallel architectures tied to programming models • Divergent architectures, with no predictable pattern of growth.
  • 3. 3 Today Extension of “computer architecture” to support communication and cooperation • OLD: Instruction Set Architecture • NEW: Communication Architecture Defines • Critical abstractions, boundaries, and primitives (interfaces) • Organizational structures that implement interfaces (hw or sw) Compilers, libraries and OS are important bridges today
  • 4. 4 Modern Layered Framework CAD Multiprogramming Shared address Message passing Data parallel Database Scientific modeling Parallel applications Programming models Communication abstraction User/system boundary Compilation or library Operating systems support Communication hardware Physical communication medium Hardware/software boundary
  • 5. 5 Programming Model What programmer uses in coding applications Specifies communication and synchronization Examples: • Multiprogramming: no communication or synch. at program level • Shared address space: like bulletin board • Message passing: like letters or phone calls, explicit point to point • Data parallel: more regimented, global actions on data – Implemented with shared address space or message passing
  • 6. 6 Communication Abstraction User level communication primitives provided • Realizes the programming model • Mapping exists between language primitives of programming model and these primitives Supported directly by hw, or via OS, or via user sw Lot of debate about what to support in sw and gap between layers Today: • Hw/sw interface tends to be flat, i.e. complexity roughly uniform • Compilers and software play important roles as bridges today • Technology trends exert strong influence Result is convergence in organizational structure • Relatively simple, general purpose communication primitives
  • 7. 7 Communication Architecture = User/System Interface + Implementation User/System Interface: • Comm. primitives exposed to user-level by hw and system-level sw Implementation: • Organizational structures that implement the primitives: hw or OS • How optimized are they? How integrated into processing node? • Structure of network Goals: • Performance • Broad applicability • Programmability • Scalability • Low Cost
  • 8. 8 Evolution of Architectural Models Historically machines tailored to programming models • Prog. model, comm. abstraction, and machine organization lumped together as the “architecture” Evolution helps understand convergence • Identify core concepts • Shared Address Space • Message Passing • Data Parallel Others: • Dataflow • Systolic Arrays Examine programming model, motivation, intended applications, and contributions to convergence
  • 9. 9 Shared Address Space Architectures Any processor can directly reference any memory location • Communication occurs implicitly as result of loads and stores Convenient: • Location transparency • Similar programming model to time-sharing on uniprocessors – Except processes run on different processors – Good throughput on multiprogrammed workloads Naturally provided on wide range of platforms • History dates at least to precursors of mainframes in early 60s • Wide range of scale: few to hundreds of processors Popularly known as shared memory machines or model • Ambiguous: memory may be physically distributed among processors
  • 10. 10 Shared Address Space Model Process: virtual address space plus one or more threads of control Portions of address spaces of processes are shared •Writes to shared address visible to other threads (in other processes too) •Natural extension of uniprocessors model: conventional memory operations for comm.; special atomic operations for synchronization •OS uses shared memory to coordinate processes St or e P1 P2 Pn P0 Load P0 pr i vat e P1 pr i vat e P2 pr i vat e Pn pr i vat e Virtual address spaces for a collection of processes communicating via shared addresses Machine physical address space Shared portion of address space Private portion of address space Common physical addresses
  • 11. 11 Communication Hardware Also natural extension of uniprocessor Already have processor, one or more memory modules and I/O controllers connected by hardware interconnect of some sort Memory capacity increased by adding modules, I/O by controllers •Add processors for processing! •For higher-throughput multiprogramming, or parallel programs I/O ctrlMem Mem Mem Interconnect Mem I/O ctrl Processor Processor Interconnect I/O dev ices