SlideShare a Scribd company logo
PMIx: Process Management for
Exascale Environments
Ralph H. Castain, David Solt, Joshua Hursey, Aurelien Bouteiller
EuroMPI/USA 2017, Chicago, IL
OMPI
Spectrum
OSHMEM
SOS
PGAS
others
What is PMIx?
PMI-1 PMI-2
wireup support
dynamic spawn
keyval publish/lookup
MPICH
years go by…
SLURM
ALPS
RM
PGAS
others
2015
Exascale systems
on horizon
Launch times long
New paradigms
2016
Exascale launch
in < 10s
Orchestration
PMIx v1.2
SLURM
JSM
RM
OMPI
Spectrum
OSHMEM
2017
Exascale launch
in < 30s
PMIx v2.x
SLURM
JSM
others
RM
Three Distinct Entities
• PMIx Standard
 Defined set of APIs, attribute strings
 Nothing about implementation
• PMIx Reference Library
 A full-featured implementation of the Standard
 Intended to ease adoption
• PMIx Reference Server
 Full-featured “shim” to a non-PMIx RM
The Community
https://pmix.github.io/pmix
https://github.com/pmix
Job
Script WLM WLM RM
Launch
Cmd
Spawn
Procs
GO
Global
Xchg
Proc
Fabric
NIC
Proc
NIC
Proc
Barrier
FS
Traditional Launch Sequence
Wait for files
& libs
Topo Topo Topo
Fabric
NIC
Fabric
Pro
c
Pro
c
Pro
c
Job
Script WLM WLM RM
Launch
Cmd
Spawn
Procs
GO
Global
Xchg
Proc
Fabric
NIC
Proxy
Proc
Fabric
NIC
Proxy
Proc
Proxy
Barrier
FS
Newer Launch Sequence
Wait for files
& libs
Topo Topo
Fabric
NIC
Topo
PMIx-SMS Interactions
RM
PMIx
Client
FS
Fabric
RAS
APP
Orchestration
Requests
Responses
NIC
Fabric
Mgr
PMIx
Server
MPI
OpenMP
Job
Script
System
Management Stack
Tool Support
PMIx Launch Sequence
*RM daemon, mpirun-daemon, etc.
PMIx/SLURM*
#nodes
MPI_Init(sec)
*LANL/Buffy cluster, 1ppn
PRS**
**PMIx Reference Server v2.0, direct-fetch/async
srun/PMI2
Performance papers coming in 2018!
Similar Requirements
• Notifications/response
 Errors, resource changes
 Negotiated response
• Request allocation changes
 shrink/expand
• Workflow management
 Steered/conditional execution
• QoS requests
 Power, file system, fabric
Multiple,
use-
specific
libs?
(difficult for RM
community to
support)
Single,
multi-
purpose
lib?
PMIx “Standards” Process
• Modifications/additions
 Proposed as RFC
 Include prototype implementation
• Pull request to reference library
 Notification sent to mailing list
• Reviews conducted
 RFC and implementation
 Continues until consensus emerges
• Approval given
 Developer telecon (weekly)
Standards Doc
under
development!
Philosophy
• Generalized APIs
 Few hard parameters
 “Info” arrays to pass information, specify directives
• Easily extended
 Add “keys” instead of modifying API
• Async operations
• Thread safe
• SMS always has right to say “not supported”
 Allow each backend to evaluate what and when to
support something
• Generalized APIs
 Few hard parameters
 “Info” arrays to pass information, specify directives
• Easily extended
 Add “keys” instead of modifying API
• Async operations
• Thread safe
• SMS always has right to say “not supported”
 Allow each backend to evaluate what and when to
support something
Messenger not Doer
APPSMS
Tool
Current Support
• Typical startup operations
 Put, get, commit, barrier,
spawn, [dis]connect,
publish/lookup
• Tool connections
 Debugger, job submission,
query
• Generalized query
support
 Job status, layout, system
data, resource availability
• Event notification
 App, system generated
 Subscribe, chained
 Pre-emption, failures,
timeout warning, …
• Logging (job record)
 Status reports, error output
• Flexible allocations
 Release resources, request
resources
Event Notification Use Case
• Fault detection and reporting
w/ULFM MPI
 ULFM MPI is a fault tolerant
flavor of Open MPI
• Failures may be detected from
the SMS, RAS, or directly by
MPI communications
• Components produce a PMIx
event when detecting an error
• Fault Tolerant components
register for the fault event
• Components propagate fault
events which are then
delivered to registered clients
MPI MPI
PMIx
Server
PMIx
Server
RAS
PMIx
In Pipeline
• Network support
 Security keys, pre-spawn local
driver setup, fabric topology
and status, traffic reports,
fabric manager interaction
• Obsolescence protection
 Automatic cross-version
compatibility
 Container support
• Job control
 Pause, kill, signal, heartbeat,
resilience support
• Generalized data store
• File system support
 Dependency detection
 Tiered storage caching strategies
• Debugger/tool support++
 Automatic rendezvous
 Single interface to all launchers
 Co-launch daemons
 Access fabric info, etc.
• Cross-library interoperation
Summary
We now have an interface library RMs will
support for application-directed requests
Need to collaboratively define
what we want to do with it
Project: https://pmix.github.io/pmix
Reference Implementation: https://github.com/pmix/pmix
Reference Server: https://github.com/pmix/pmix-reference-server

More Related Content

PDF
Algosec 5 more_things_you_can_do_with_a_security_policy_management_solution
PPTX
Splitting the Check on Compliance and Security
PDF
Putting the Sec into DevOps
PDF
Enforce compliance policy with model-driven automation
PDF
AlgoSec Application Migration Webinar
PDF
Migrating and Managing Security in an AWS Environment- Best Practices
PDF
DevSecOps: Putting the Sec into the DevOps
PPTX
Cisco ACI & Hybrid Networks - Breaking Down Silos with Central Policy Management
Algosec 5 more_things_you_can_do_with_a_security_policy_management_solution
Splitting the Check on Compliance and Security
Putting the Sec into DevOps
Enforce compliance policy with model-driven automation
AlgoSec Application Migration Webinar
Migrating and Managing Security in an AWS Environment- Best Practices
DevSecOps: Putting the Sec into the DevOps
Cisco ACI & Hybrid Networks - Breaking Down Silos with Central Policy Management

What's hot (20)

PDF
OSMC 2015: The Assimilation Project by Alan Robertson
PDF
how to simulate ACI
PDF
Multi Layer Monitoring V1
PPT
Create and Manage a Micro-Segmented Data Center – Best Practices
PDF
Best Practices for Workload Security: Securing Servers in Modern Data Center ...
PPTX
Migrating Application Connectivity and Network Security to AWS
PDF
5 things you didn't know you could do with security policy management
PPTX
Reaching PCI Nirvana: Ensure a Successful Audit & Maintain Continuous Compliance
PDF
Embracing the Rise of SecDevOps
PDF
SevOne Scalability
PPTX
SDN's managing security across the virtual network final
PPTX
Tying cyber attacks to business processes, for faster mitigation
PPTX
Drone Hijacking
PPTX
Cisco Firepower Migration | Cisco and AlgoSec Joint Webinar
PPT
Accelerate Application Deployment Across Cisco ACI Fabric, On-Premise Firewal...
PPTX
Vulnerability Discovery in the Cloud
PPTX
Perforce on Tour 2015 - Securing the Helix Platform at Citrix
PPTX
2018 07-24 network security at the speed of dev ops - webinar
PPTX
Sasa milic, cisco advanced malware protection
PPTX
Spirent: The Internet of Things: The Expanded Security Perimeter
OSMC 2015: The Assimilation Project by Alan Robertson
how to simulate ACI
Multi Layer Monitoring V1
Create and Manage a Micro-Segmented Data Center – Best Practices
Best Practices for Workload Security: Securing Servers in Modern Data Center ...
Migrating Application Connectivity and Network Security to AWS
5 things you didn't know you could do with security policy management
Reaching PCI Nirvana: Ensure a Successful Audit & Maintain Continuous Compliance
Embracing the Rise of SecDevOps
SevOne Scalability
SDN's managing security across the virtual network final
Tying cyber attacks to business processes, for faster mitigation
Drone Hijacking
Cisco Firepower Migration | Cisco and AlgoSec Joint Webinar
Accelerate Application Deployment Across Cisco ACI Fabric, On-Premise Firewal...
Vulnerability Discovery in the Cloud
Perforce on Tour 2015 - Securing the Helix Platform at Citrix
2018 07-24 network security at the speed of dev ops - webinar
Sasa milic, cisco advanced malware protection
Spirent: The Internet of Things: The Expanded Security Perimeter
Ad

Similar to EuroMPI 2017 PMIx presentation (20)

PPTX
PMIx Tiered Storage Support
PPTX
SC'17 BoF Presentation
PPTX
PMIx: Bridging the Container Boundary
PPTX
SC'16 PMIx BoF Presentation
PPTX
SC'18 BoF Presentation
PPTX
Live Application and Infrastructure Monitoring and Root Cause Log Analysis wi...
PPTX
SC15 PMIx Birds-of-a-Feather
PDF
20140708 - Jeremy Edberg: How Netflix Delivers Software
PDF
Threat intelligence solution
PPTX
Netflix Cloud Architecture and Open Source
PPTX
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
PDF
Microservices
PPTX
Event Bus as Backbone for Decoupled Microservice Choreography - Lecture and W...
PDF
Pm ix tutorial-june2019-pub (1)
PDF
Music city data Hail Hydrate! from stream to lake
PDF
2008-03-06 Harris Corp Security Seminar
PPTX
Event Bus as Backbone for Decoupled Microservice Choreography (JFall 2017)
PDF
Netflix Cloud Platform and Open Source
PDF
Monitoring in Big Data Platform - Albert Lewandowski, GetInData
PPTX
Take control of your DevOps Dumping Ground; Melissa Sussmann
PMIx Tiered Storage Support
SC'17 BoF Presentation
PMIx: Bridging the Container Boundary
SC'16 PMIx BoF Presentation
SC'18 BoF Presentation
Live Application and Infrastructure Monitoring and Root Cause Log Analysis wi...
SC15 PMIx Birds-of-a-Feather
20140708 - Jeremy Edberg: How Netflix Delivers Software
Threat intelligence solution
Netflix Cloud Architecture and Open Source
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Microservices
Event Bus as Backbone for Decoupled Microservice Choreography - Lecture and W...
Pm ix tutorial-june2019-pub (1)
Music city data Hail Hydrate! from stream to lake
2008-03-06 Harris Corp Security Seminar
Event Bus as Backbone for Decoupled Microservice Choreography (JFall 2017)
Netflix Cloud Platform and Open Source
Monitoring in Big Data Platform - Albert Lewandowski, GetInData
Take control of your DevOps Dumping Ground; Melissa Sussmann
Ad

Recently uploaded (20)

PDF
KodekX | Application Modernization Development
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Machine learning based COVID-19 study performance prediction
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPT
Teaching material agriculture food technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KodekX | Application Modernization Development
Building Integrated photovoltaic BIPV_UPV.pdf
Network Security Unit 5.pdf for BCA BBA.
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Diabetes mellitus diagnosis method based random forest with bat algorithm
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Modernizing your data center with Dell and AMD
NewMind AI Weekly Chronicles - August'25 Week I
Machine learning based COVID-19 study performance prediction
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Per capita expenditure prediction using model stacking based on satellite ima...
Teaching material agriculture food technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf

EuroMPI 2017 PMIx presentation

  • 1. PMIx: Process Management for Exascale Environments Ralph H. Castain, David Solt, Joshua Hursey, Aurelien Bouteiller EuroMPI/USA 2017, Chicago, IL
  • 2. OMPI Spectrum OSHMEM SOS PGAS others What is PMIx? PMI-1 PMI-2 wireup support dynamic spawn keyval publish/lookup MPICH years go by… SLURM ALPS RM PGAS others 2015 Exascale systems on horizon Launch times long New paradigms 2016 Exascale launch in < 10s Orchestration PMIx v1.2 SLURM JSM RM OMPI Spectrum OSHMEM 2017 Exascale launch in < 30s PMIx v2.x SLURM JSM others RM
  • 3. Three Distinct Entities • PMIx Standard  Defined set of APIs, attribute strings  Nothing about implementation • PMIx Reference Library  A full-featured implementation of the Standard  Intended to ease adoption • PMIx Reference Server  Full-featured “shim” to a non-PMIx RM
  • 5. Job Script WLM WLM RM Launch Cmd Spawn Procs GO Global Xchg Proc Fabric NIC Proc NIC Proc Barrier FS Traditional Launch Sequence Wait for files & libs Topo Topo Topo Fabric NIC Fabric
  • 6. Pro c Pro c Pro c Job Script WLM WLM RM Launch Cmd Spawn Procs GO Global Xchg Proc Fabric NIC Proxy Proc Fabric NIC Proxy Proc Proxy Barrier FS Newer Launch Sequence Wait for files & libs Topo Topo Fabric NIC Topo
  • 8. PMIx Launch Sequence *RM daemon, mpirun-daemon, etc.
  • 9. PMIx/SLURM* #nodes MPI_Init(sec) *LANL/Buffy cluster, 1ppn PRS** **PMIx Reference Server v2.0, direct-fetch/async srun/PMI2 Performance papers coming in 2018!
  • 10. Similar Requirements • Notifications/response  Errors, resource changes  Negotiated response • Request allocation changes  shrink/expand • Workflow management  Steered/conditional execution • QoS requests  Power, file system, fabric Multiple, use- specific libs? (difficult for RM community to support) Single, multi- purpose lib?
  • 11. PMIx “Standards” Process • Modifications/additions  Proposed as RFC  Include prototype implementation • Pull request to reference library  Notification sent to mailing list • Reviews conducted  RFC and implementation  Continues until consensus emerges • Approval given  Developer telecon (weekly) Standards Doc under development!
  • 12. Philosophy • Generalized APIs  Few hard parameters  “Info” arrays to pass information, specify directives • Easily extended  Add “keys” instead of modifying API • Async operations • Thread safe • SMS always has right to say “not supported”  Allow each backend to evaluate what and when to support something
  • 13. • Generalized APIs  Few hard parameters  “Info” arrays to pass information, specify directives • Easily extended  Add “keys” instead of modifying API • Async operations • Thread safe • SMS always has right to say “not supported”  Allow each backend to evaluate what and when to support something Messenger not Doer APPSMS Tool
  • 14. Current Support • Typical startup operations  Put, get, commit, barrier, spawn, [dis]connect, publish/lookup • Tool connections  Debugger, job submission, query • Generalized query support  Job status, layout, system data, resource availability • Event notification  App, system generated  Subscribe, chained  Pre-emption, failures, timeout warning, … • Logging (job record)  Status reports, error output • Flexible allocations  Release resources, request resources
  • 15. Event Notification Use Case • Fault detection and reporting w/ULFM MPI  ULFM MPI is a fault tolerant flavor of Open MPI • Failures may be detected from the SMS, RAS, or directly by MPI communications • Components produce a PMIx event when detecting an error • Fault Tolerant components register for the fault event • Components propagate fault events which are then delivered to registered clients MPI MPI PMIx Server PMIx Server RAS PMIx
  • 16. In Pipeline • Network support  Security keys, pre-spawn local driver setup, fabric topology and status, traffic reports, fabric manager interaction • Obsolescence protection  Automatic cross-version compatibility  Container support • Job control  Pause, kill, signal, heartbeat, resilience support • Generalized data store • File system support  Dependency detection  Tiered storage caching strategies • Debugger/tool support++  Automatic rendezvous  Single interface to all launchers  Co-launch daemons  Access fabric info, etc. • Cross-library interoperation
  • 17. Summary We now have an interface library RMs will support for application-directed requests Need to collaboratively define what we want to do with it Project: https://pmix.github.io/pmix Reference Implementation: https://github.com/pmix/pmix Reference Server: https://github.com/pmix/pmix-reference-server