SlideShare a Scribd company logo
Confidential
Better Faster Greener™ © 2022 Supermicro
Confidential
Supermicro’s Universal GPU: Modular, Standards
Based and Built for the Future
Josh Grossman,
Principal Product Manager
April, 2022
Better Faster Greener™ © 2022 Supermicro
Confidential
Agenda
• Introduction to AI Market
• Universal GPU Systems with MI250
• Martin Huarte on AMD GPU Software Stack
v
Confidential
AI Market Projection
• 13 trillion dollar overall market Size According to Mckinsey
• AI market size (USA) will expand at a Compound Annual Growth
Rate (CAGR) of 40.2% from 2021 to 2028.
• 83% of companies share that having access to AI is a top priority
in their business plans.
v
Confidential
4/20/2022 Better Faster Greener™ © 2022 Supermicro
5
AI augmentation” will create $2.9trn of “business value” and save 6.2bn
man-hours globally. A survey by McKinsey last year estimated that AI
analytics could add around $13trn, or 16%, to annual global GDP by 2030.
Retail and logistics stand to gain most (see chart 2). (Economist, 2022)
Confidential
4/20/2022 Better Faster Greener™ © 2022 Supermicro
6
Confidential
4/20/2022 Better Faster Greener™ © 2022 Supermicro
7
Confidential
Rack Scale AI Solutions
4/20/2022 Better Faster Greener™ © 2022 Supermicro
8
Confidential
9
Summary Universal GPU Server
• The Most Optimized and Flexible GPU Server Platform available today
o CPU MB Support
• AMD H12 Milan
• Intel X12 Ice Lake
o GPU Support
• NVIDIA Redstone with GPU to GPU NVLink
• AMD MI-250 with Infinity Fabric xGMI
• Traditional PCIe Form Factor GPU
• Modular Design for Flexibility
• Improved Thermal Capability
o Support up to 500W/700W GPU, 280W AMD CPU and 350W/400W Intel CPU
• 1U Expansion Module available for all 4U Servers
UBB/OAM
Intel PVC
Redstone
AMD MI-
250
PCIe
Supermicro Confidential
Confidential
4U/5U Rackmount
Dual X12, X13 and H12
Processors 32 DIMM Slots
Up to 10 PCIe Low Profile 5.0
Slots
Up to 10 PCIe with up to 2
AIOM/OCP 3.0 NIC Slots
Up to 10 Drives of 2.5”
NVMe/SAS/SATA
4x 3000W
Redundant Titanium (2+2) /
Platinum Level Power Supplies
Universal GPU Product Series
4U uGPU
Universal GPU Server
Performance
Modular, Standards Based
Modular design supports a variety of GPU
technologies and configurations
Supports industry leading high performance GPUs
from NVIDIA, AMD and Intel.
Standardize on one GPU Platform for all your
data center needs
Next Generation Supermicro Universal GPU Servers
Subject to change without notice
10 Better Faster Greener™ © 2022 Supermicro
5U uGPU
Universal GPU with 1 U Expansion Module
One GPU Platform
11
Universal Design and AMD Instinct MI250 OAM
Supermicro Confidential/Internal Only
• Significant HPC performance increase
over competition
• Also good for AI/ML workloads
• 128GB HBM2e ECC Memory per OAM
• GPU to GPU xGMI Infinity Fabric 2.5TB/s
CONFIDENTIAL
AMD Tools & Solutions for AI/ML and HPC
4/20/2022 Better Faster Greener™ © 2021 Supermicro
12
RTM
Reverse Time Migration
Datacenter Tools: Profilers & Debuggers, Comm & Math Libraries, Compiler
Code Reuse: ONNX Run-time, existing deep learning, HPC code
Cross Platform: Open source, supports AMD CPUs, CPU, non-AMD GPUs
3RD GEN AMD INFINITY
ARCHITECTURE
FIRST MULTI-CHIP GPU
• Highest performance
• Bigger GPU memory
• Higher Flops (FP64, FP32, FP16)
Confidential
Better Faster Greener™ © 2021 Supermicro
13
Specifications
CPU – Dual Socket
Dual AMD EPYC 7003 CPUs (Socket SP3)
up to 280W, 128 Cores/256 Threads
Memory – 32 DIMM Slots
32 DIMM, 8TB Reg. ECC DDR4 up to
3200MHz
Drives – 10 2.5” Drive-bay
Up to 10x HS NVMe U.2 connect to PCIe
Switch or 10x HS 2.5” SATA
Expansion – 8 PCIe Slots
8x PCIe 4.0 x16 LP (via PLX switch)
I/O ports
1x VGA, 1x COM Header, 2x USB 3.0, and
1x Dedicated IPMI
Power Supply
4x 3000W (2+2) Titanium Level efficiency
power supplies
4U AMD EPYC 7003 Dual CPUs and Four AMD MI250 GPUs
Universal GPU System AMD AS -4124GQ-TNMI
Subject to change without notice
Key Features
Universal GPU Server Standards Based Design
Modular by Design for Flexibility/Future Proofed
Improved Thermal Capability
Key Applications
Perfect Platform for HPC applications
Data Center Infrastructure
System Rear View
System Front View
Supermicro Confidential/Internal Only
Confidential
Better Faster Greener™ © 2021 Supermicro
14
Specifications
CPU – Dual Socket
Dual AMD EPYC 7003 CPUs (Socket SP3)
up to 280W, 128 Cores/256 Threads
Memory – 32 DIMM Slots
32 DIMM, 8TB Reg. ECC DDR4 up to
3200MHz
Drives – 10 2.5” Drive-bay
Up to 10x HS NVMe U.2 connect to PCIe
Switch or 10x HS 2.5” SATA
Expansion – 10 PCIe Slots
8x PCIe 4.0 x16 LP (via PLX switch)
2x PCIe 4.0 x16 LP or AIOM (via CPU w/
1U add-on)
I/O ports
1x VGA, 1x COM Header, 2x USB 3.0, and
1x Dedicated IPMI
Power Supply
4x 3000W (2+2) Titanium Level efficiency
power supplies
5U AMD EPYC 7003 Dual CPUs and Four AMD MI250 GPUs
Universal GPU System AMD AS -4124GQ-TNMI
Subject to change without notice
Key Features
Universal GPU Server Standards Based Design
Modular by Design for Flexibility/Future Proofed
Improved Thermal Capability
Key Applications
Perfect Platform for HPC applications
Data Center Infrastructure
System Rear View
System Front View
Supermicro Confidential/Internal Only
Driving Innovation and
Discovery with AMD Instinct™
accelerators on ROCm™ Stack
Martin Huarte, Ph.D.
Developer Relations Manager, martin.huarte@amd.com
16
[AMD Official Use Only]
Open APIs
Open
Libraries
Compilers
Developer
Tools
Kernel /
Runtime
HPC
Frameworks
ISV Apps
Open-
Source
Codes
Operating
Systems
Deployment
Tools
Mgmt Tools
ML
Frameworks
17
[AMD Official Use Only]
Drivers/Runtimes
Programming
models
Libraries
Compilers & Tools
Deployment Tools
Compiler
OpenMP API HIP API OpenCL™
RedHat, CentOS, SLES & Ubuntu Device Drivers and Run-Time
BLAS FFT
RAND
SPARSE
Debugger
Profiler
ROCm Validation Suite ROCm Data Center Tool
SOLVER
TENSILE
ALUTION THRUST MIOpen
MIVisionX
Tracer
RCCL
MIGraphX PRIM
hipify
ROCm SMI
18
[AMD Official Use Only]
AMD Infinity Hub
Containerized HPC Apps and ML Frameworks
Purpose-built accelerators for HPC and AI workloads
Full range of leading OEMs/ODMs supplying AMD
Accelerated systems to HPC and AI market segments
Open software platform for developers to build
HPC applications on AMD Accelerators
Single location for researchers and data scientists to
download containerized HPC apps and ML
frameworks
Compilers, Libraries, Dev
Tools, APIs, Kernels/Runtimes
Validated, Optimized Systems & Platforms
19
[AMD Official Use Only]
DRIVING MAINSTREAM ADOPTION & ECOSYSTEM ENABLEMENT
19
EXPANDED
OPTIMIZED
ENABLING
SUPPORT FOR AMD INSTINCT™
MI200 & AMD RADEON™ PRO
W6800 GPUS
COMPILER & LIBRARY
OPTIMIZATIONS FOR HPC &
AI/ML
NEW ROCm DOCUMENTATION
PORTAL & IMPROVED DEBUG
TOOLS
20
[AMD Official Use Only]
Re-architected ROCm Documentation
 Support Guides
 Installation & Deployment Guides
 API / SDK Documentation
Access to ROCm Learning Center
 GPU programming tutorials, videos and labs
https://docs.amd.com/
21
[AMD Official Use Only]
Molecular Dynamics Academic / Research Oil & Gas / Geoscience
NAMD
LAMMPS
GROMACS
Computer Aided
Engineering (CAE)
Weather Machine Learning
Reverse Time Migration (RTM) –
miniMOD sample
SPECFEM3D (Cartesian)
SPECFEM3D (Globe)
CP2K
Quantum Espresso
NWChem
VASP
MPAS
TempoQuest AceCAST
ICON
NEMO
Chroma
MILC
GRID
TensorFlow
PyTorch
ONNX-Runtime
MLPerf
AMBER
OpenMM
Relion
Quantum Chemistry Quantum Physics
OpenFOAM® (CFD)
PYFR (CFD)
Cascade CharLES (CFD)
Ansys Mechanical (FEA)
Target availability 1H22
22
[AMD Official Use Only]
AMD INFINITY HUB ROCm™ APP CATALOG
COMMERCIAL ISVs
[LINK]
[LINK]
•
•
•
•
•
*Ansys Mechanical 2022 R2, Cascade CharLES, TempoQuest AceCAST
23
[AMD Official Use Only]
• HPC Apps: CHROMA*, CP2K*, GRID*, GROMACS*,
HACC, LAMMPS, MILC, NAMD*, OpenMM*, Relion,
SPECFEM3D (Cartesian)*, SPECFEM3D (Globe)*
• HPC Apps: AMBER*, ICON, MPAS, NWCHEM,
OpenFOAM, PYFR, QuantumEspresso, WRF, NEMO
• AI/ML: PyTorch*, TensorFlow*
• Benchmarks: HPL, NBODY
• Benchmarks: MLPerf (SSD, Resnet50, Transformer),
HPCG
Additional MI200 Support Planned for 1H22
MI200 Support Planned for 2H21
* Available on InfinityHub with MI100 support today
Performance Results for Select Apps / Benchmarks
24
[AMD Official Use Only]
 AMD Instinct GPUs:
 AMD Instinct™ MI210 GPU page: https://www.amd.com/en/products/server-accelerators/instinct-mi210
 AMD Instinct™ MI Series Product Page: https://www.amd.com/en/graphics/instinct-server-accelerators
 AMD Instinct™ HPC Solutions Page: https://www.amd.com/en/graphics/servers-instinct-mi-powered-servers
 AMD Instinct™ Machine Learning Solutions Page: https://www.amd.com/en/graphics/servers-instinct-deep-learning
 AMD CDNA2 Architecture: https://www.amd.com/en/technologies/cdna2
 CDNA2 WP: https://www.amd.com/system/files/documents/amd-cdna2-white-paper.pdf
 AMD ROCm™ open software platform:
 AMD ROCm™ pages: https://www.amd.com/en/graphics/servers-solutions-rocm
 AMD Infinity Hub: https://www.amd.com/en/technologies/infinity-hub
 AMD Accelerator Cloud: https://www.amd.com/en/solutions/accelerated-computing
 ROCm Information Portal (DOCs & Learning Ctr.): https://docs.amd.com/
 HPC & AMD page: www.AMD.com/HPC
For AMD Instinct™ GPU and ROCm™ marketing assets, contact: Guy.Ludden@AMD.com or
Sydney.Freeman@AMD.com
Confidential
Thank You
25 Better Faster Greener™ © 2022 Supermicro
Please Contact us for Details:
Josh Grossman,
Principal Product Manager, Supermicro
joshg@supermicro.com
Martin Huarte, Ph.D.,
Developer Relations Manager, AMD
martin.huarte@amd.com
Confidential
DISCLAIMER
Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The
information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions
and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate
performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware
configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of
third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may
be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and
hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro
Computer, Inc. assumes no obligation to update or otherwise correct or revise this information.
SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE
CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT
MAY APPEAR IN THIS INFORMATION.
SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR
FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY
PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF
ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE
POSSIBILITY OF SUCH DAMAGES.
ATTRIBUTION
© 2022 Super Micro Computer, Inc. All rights reserved.
4/20/2022 Better Faster Greener™ © 2021 Supermicro
26
Confidential
www.supermicro.com
Thank You

More Related Content

PPTX
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
PPTX
The Power of HPC with Next Generation Supermicro Systems
PPTX
Supermicro Servers with Micron DDR5 & SSDs: Accelerating Real World Workloads
PPTX
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
PDF
The Path to "Zen 2"
 
PPTX
New Accelerated Compute Infrastructure Solutions from Supermicro
PPTX
Hot Chips: AMD Next Gen 7nm Ryzen 4000 APU
 
PPTX
Past Present and Future of CXL
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
The Power of HPC with Next Generation Supermicro Systems
Supermicro Servers with Micron DDR5 & SSDs: Accelerating Real World Workloads
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
The Path to "Zen 2"
 
New Accelerated Compute Infrastructure Solutions from Supermicro
Hot Chips: AMD Next Gen 7nm Ryzen 4000 APU
 
Past Present and Future of CXL

What's hot (20)

PDF
Virtualization with KVM (Kernel-based Virtual Machine)
PPTX
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
PDF
AMD: Where Gaming Begins
 
PPTX
Introduction to Serverless and Google Cloud Functions
PPTX
Accelerating Innovation from Edge to Cloud
PDF
AMD EPYC™ Microprocessor Architecture
 
PDF
The kvm virtualization way
PPTX
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
PDF
An Introduction to Kubernetes
PPTX
High Performance Object Storage in 30 Minutes with Supermicro and MinIO
PPTX
Microchip: CXL Use Cases and Enabling Ecosystem
PDF
Ansible Automation Platform.pdf
PDF
Tech Talk NVIDIA CUDA
PPTX
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
PPTX
CXL Fabric Management Standards
PPTX
Broadcom PCIe & CXL Switches OCP Final.pptx
PDF
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
PDF
한컴MDS_NVIDIA Jetson Platform
PPTX
Zen 2: The AMD 7nm Energy-Efficient High-Performance x86-64 Microprocessor Core
 
PPTX
From distributed caches to in-memory data grids
Virtualization with KVM (Kernel-based Virtual Machine)
Modular by Design: Supermicro’s New Standards-Based Universal GPU Server
AMD: Where Gaming Begins
 
Introduction to Serverless and Google Cloud Functions
Accelerating Innovation from Edge to Cloud
AMD EPYC™ Microprocessor Architecture
 
The kvm virtualization way
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
An Introduction to Kubernetes
High Performance Object Storage in 30 Minutes with Supermicro and MinIO
Microchip: CXL Use Cases and Enabling Ecosystem
Ansible Automation Platform.pdf
Tech Talk NVIDIA CUDA
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
CXL Fabric Management Standards
Broadcom PCIe & CXL Switches OCP Final.pptx
왜 쿠버네티스는 systemd로 cgroup을 관리하려고 할까요
한컴MDS_NVIDIA Jetson Platform
Zen 2: The AMD 7nm Energy-Efficient High-Performance x86-64 Microprocessor Core
 
From distributed caches to in-memory data grids
Ad

Similar to Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future (20)

PPTX
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
PPTX
Drive Data Center Efficiency with SuperBlade, Powered by AMD EPYC™ and Instinct™
PDF
PCCC23:日本AMD株式会社 テーマ1「AMD Instinct™ アクセラレーターの概要」
PDF
Marv Wexler - Transform Your with AI.pdf
PPTX
2U 2-Node Multi-GPU Platform Delivers Unrivaled Efficiency and Flexibility
PPTX
Building Efficient Edge Nodes for Content Delivery Networks
PDF
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
PDF
Evolution of Supermicro GPU Server Solution
PDF
Beyond Moore's Law: The Challenge of Heterogeneous Compute & Memory Systems
PPTX
MWC Roundtable: Accelerating Innovation from the Intelligent Edge to Cloud
PDF
NextHorizonOnTheCaseWirgThePresnrentions
PDF
Ceph optimized Storage / Global HW solutions for SDS, David Alvarez
PPTX
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
PPTX
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
PDF
ROCm and Distributed Deep Learning on Spark and TensorFlow
PDF
Blue Line Supermicro Aplus
PDF
POWER9 AC922 Newell System - HPC & AI
PDF
Keynote (Dr. Lisa Su) - Developers: The Heart of AMD Innovation - by Dr. Lisa...
PDF
Final lisa opening_keynote_draft_-_v12.1tb
PPTX
AMD EPYC 7002 Launch World Records
 
Innovative Solutions for Cloud Gaming, Media, Transcoding, & AI Inferencing
Drive Data Center Efficiency with SuperBlade, Powered by AMD EPYC™ and Instinct™
PCCC23:日本AMD株式会社 テーマ1「AMD Instinct™ アクセラレーターの概要」
Marv Wexler - Transform Your with AI.pdf
2U 2-Node Multi-GPU Platform Delivers Unrivaled Efficiency and Flexibility
Building Efficient Edge Nodes for Content Delivery Networks
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
Evolution of Supermicro GPU Server Solution
Beyond Moore's Law: The Challenge of Heterogeneous Compute & Memory Systems
MWC Roundtable: Accelerating Innovation from the Intelligent Edge to Cloud
NextHorizonOnTheCaseWirgThePresnrentions
Ceph optimized Storage / Global HW solutions for SDS, David Alvarez
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
X13 Products + Intel® Xeon® CPU Max Series–An Applications & Performance View
ROCm and Distributed Deep Learning on Spark and TensorFlow
Blue Line Supermicro Aplus
POWER9 AC922 Newell System - HPC & AI
Keynote (Dr. Lisa Su) - Developers: The Heart of AMD Innovation - by Dr. Lisa...
Final lisa opening_keynote_draft_-_v12.1tb
AMD EPYC 7002 Launch World Records
 
Ad

More from Rebekah Rodriguez (15)

PPTX
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
PPTX
Supermicro and The Green Grid (TGG)
PDF
X13 Pre-Release Update featuring 4th Gen Intel® Xeon® Scalable Processors
PPTX
Zero Trust for Private 5G and Edge
PPTX
Benefits of Operating an On-Premises Infrastructure
PPTX
Emerging Cloud Storage Trends for Enterprises
PPTX
Tackling Retail Technology Management Challenges at the Edge
PPTX
Optimize Content Delivery with Multi-Access Edge Computing
PPTX
Delivering Breakthrough Performance Per Core with AMD EPYC
PPTX
Delivering Breakthrough Performance Per Core with AMD EPYC
PPTX
High-Density Top-Loading Storage for Cloud Scale Applications
PPTX
Consumption Based On-Demand Private Cloud in a Box
PPTX
Rack Cluster Deployment for SDSC Supercomputer
PDF
Supermicro X12 Performance Update
PPTX
Simplify Data Management and Go Green with Supermicro & Qumulo
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Supermicro and The Green Grid (TGG)
X13 Pre-Release Update featuring 4th Gen Intel® Xeon® Scalable Processors
Zero Trust for Private 5G and Edge
Benefits of Operating an On-Premises Infrastructure
Emerging Cloud Storage Trends for Enterprises
Tackling Retail Technology Management Challenges at the Edge
Optimize Content Delivery with Multi-Access Edge Computing
Delivering Breakthrough Performance Per Core with AMD EPYC
Delivering Breakthrough Performance Per Core with AMD EPYC
High-Density Top-Loading Storage for Cloud Scale Applications
Consumption Based On-Demand Private Cloud in a Box
Rack Cluster Deployment for SDSC Supercomputer
Supermicro X12 Performance Update
Simplify Data Management and Go Green with Supermicro & Qumulo

Recently uploaded (20)

PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
Teaching material agriculture food technology
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Modernizing your data center with Dell and AMD
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Electronic commerce courselecture one. Pdf
PDF
Advanced IT Governance
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
cuic standard and advanced reporting.pdf
PDF
KodekX | Application Modernization Development
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Review of recent advances in non-invasive hemoglobin estimation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Teaching material agriculture food technology
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Monthly Chronicles - July 2025
Modernizing your data center with Dell and AMD
20250228 LYD VKU AI Blended-Learning.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
GamePlan Trading System Review: Professional Trader's Honest Take
Electronic commerce courselecture one. Pdf
Advanced IT Governance
Understanding_Digital_Forensics_Presentation.pptx
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
cuic standard and advanced reporting.pdf
KodekX | Application Modernization Development
The Rise and Fall of 3GPP – Time for a Sabbatical?

Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future

  • 2. Confidential Supermicro’s Universal GPU: Modular, Standards Based and Built for the Future Josh Grossman, Principal Product Manager April, 2022 Better Faster Greener™ © 2022 Supermicro
  • 3. Confidential Agenda • Introduction to AI Market • Universal GPU Systems with MI250 • Martin Huarte on AMD GPU Software Stack v
  • 4. Confidential AI Market Projection • 13 trillion dollar overall market Size According to Mckinsey • AI market size (USA) will expand at a Compound Annual Growth Rate (CAGR) of 40.2% from 2021 to 2028. • 83% of companies share that having access to AI is a top priority in their business plans. v
  • 5. Confidential 4/20/2022 Better Faster Greener™ © 2022 Supermicro 5 AI augmentation” will create $2.9trn of “business value” and save 6.2bn man-hours globally. A survey by McKinsey last year estimated that AI analytics could add around $13trn, or 16%, to annual global GDP by 2030. Retail and logistics stand to gain most (see chart 2). (Economist, 2022)
  • 6. Confidential 4/20/2022 Better Faster Greener™ © 2022 Supermicro 6
  • 7. Confidential 4/20/2022 Better Faster Greener™ © 2022 Supermicro 7
  • 8. Confidential Rack Scale AI Solutions 4/20/2022 Better Faster Greener™ © 2022 Supermicro 8
  • 9. Confidential 9 Summary Universal GPU Server • The Most Optimized and Flexible GPU Server Platform available today o CPU MB Support • AMD H12 Milan • Intel X12 Ice Lake o GPU Support • NVIDIA Redstone with GPU to GPU NVLink • AMD MI-250 with Infinity Fabric xGMI • Traditional PCIe Form Factor GPU • Modular Design for Flexibility • Improved Thermal Capability o Support up to 500W/700W GPU, 280W AMD CPU and 350W/400W Intel CPU • 1U Expansion Module available for all 4U Servers UBB/OAM Intel PVC Redstone AMD MI- 250 PCIe Supermicro Confidential
  • 10. Confidential 4U/5U Rackmount Dual X12, X13 and H12 Processors 32 DIMM Slots Up to 10 PCIe Low Profile 5.0 Slots Up to 10 PCIe with up to 2 AIOM/OCP 3.0 NIC Slots Up to 10 Drives of 2.5” NVMe/SAS/SATA 4x 3000W Redundant Titanium (2+2) / Platinum Level Power Supplies Universal GPU Product Series 4U uGPU Universal GPU Server Performance Modular, Standards Based Modular design supports a variety of GPU technologies and configurations Supports industry leading high performance GPUs from NVIDIA, AMD and Intel. Standardize on one GPU Platform for all your data center needs Next Generation Supermicro Universal GPU Servers Subject to change without notice 10 Better Faster Greener™ © 2022 Supermicro 5U uGPU Universal GPU with 1 U Expansion Module One GPU Platform
  • 11. 11 Universal Design and AMD Instinct MI250 OAM Supermicro Confidential/Internal Only • Significant HPC performance increase over competition • Also good for AI/ML workloads • 128GB HBM2e ECC Memory per OAM • GPU to GPU xGMI Infinity Fabric 2.5TB/s
  • 12. CONFIDENTIAL AMD Tools & Solutions for AI/ML and HPC 4/20/2022 Better Faster Greener™ © 2021 Supermicro 12 RTM Reverse Time Migration Datacenter Tools: Profilers & Debuggers, Comm & Math Libraries, Compiler Code Reuse: ONNX Run-time, existing deep learning, HPC code Cross Platform: Open source, supports AMD CPUs, CPU, non-AMD GPUs 3RD GEN AMD INFINITY ARCHITECTURE FIRST MULTI-CHIP GPU • Highest performance • Bigger GPU memory • Higher Flops (FP64, FP32, FP16)
  • 13. Confidential Better Faster Greener™ © 2021 Supermicro 13 Specifications CPU – Dual Socket Dual AMD EPYC 7003 CPUs (Socket SP3) up to 280W, 128 Cores/256 Threads Memory – 32 DIMM Slots 32 DIMM, 8TB Reg. ECC DDR4 up to 3200MHz Drives – 10 2.5” Drive-bay Up to 10x HS NVMe U.2 connect to PCIe Switch or 10x HS 2.5” SATA Expansion – 8 PCIe Slots 8x PCIe 4.0 x16 LP (via PLX switch) I/O ports 1x VGA, 1x COM Header, 2x USB 3.0, and 1x Dedicated IPMI Power Supply 4x 3000W (2+2) Titanium Level efficiency power supplies 4U AMD EPYC 7003 Dual CPUs and Four AMD MI250 GPUs Universal GPU System AMD AS -4124GQ-TNMI Subject to change without notice Key Features Universal GPU Server Standards Based Design Modular by Design for Flexibility/Future Proofed Improved Thermal Capability Key Applications Perfect Platform for HPC applications Data Center Infrastructure System Rear View System Front View Supermicro Confidential/Internal Only
  • 14. Confidential Better Faster Greener™ © 2021 Supermicro 14 Specifications CPU – Dual Socket Dual AMD EPYC 7003 CPUs (Socket SP3) up to 280W, 128 Cores/256 Threads Memory – 32 DIMM Slots 32 DIMM, 8TB Reg. ECC DDR4 up to 3200MHz Drives – 10 2.5” Drive-bay Up to 10x HS NVMe U.2 connect to PCIe Switch or 10x HS 2.5” SATA Expansion – 10 PCIe Slots 8x PCIe 4.0 x16 LP (via PLX switch) 2x PCIe 4.0 x16 LP or AIOM (via CPU w/ 1U add-on) I/O ports 1x VGA, 1x COM Header, 2x USB 3.0, and 1x Dedicated IPMI Power Supply 4x 3000W (2+2) Titanium Level efficiency power supplies 5U AMD EPYC 7003 Dual CPUs and Four AMD MI250 GPUs Universal GPU System AMD AS -4124GQ-TNMI Subject to change without notice Key Features Universal GPU Server Standards Based Design Modular by Design for Flexibility/Future Proofed Improved Thermal Capability Key Applications Perfect Platform for HPC applications Data Center Infrastructure System Rear View System Front View Supermicro Confidential/Internal Only
  • 15. Driving Innovation and Discovery with AMD Instinct™ accelerators on ROCm™ Stack Martin Huarte, Ph.D. Developer Relations Manager, martin.huarte@amd.com
  • 16. 16 [AMD Official Use Only] Open APIs Open Libraries Compilers Developer Tools Kernel / Runtime HPC Frameworks ISV Apps Open- Source Codes Operating Systems Deployment Tools Mgmt Tools ML Frameworks
  • 17. 17 [AMD Official Use Only] Drivers/Runtimes Programming models Libraries Compilers & Tools Deployment Tools Compiler OpenMP API HIP API OpenCL™ RedHat, CentOS, SLES & Ubuntu Device Drivers and Run-Time BLAS FFT RAND SPARSE Debugger Profiler ROCm Validation Suite ROCm Data Center Tool SOLVER TENSILE ALUTION THRUST MIOpen MIVisionX Tracer RCCL MIGraphX PRIM hipify ROCm SMI
  • 18. 18 [AMD Official Use Only] AMD Infinity Hub Containerized HPC Apps and ML Frameworks Purpose-built accelerators for HPC and AI workloads Full range of leading OEMs/ODMs supplying AMD Accelerated systems to HPC and AI market segments Open software platform for developers to build HPC applications on AMD Accelerators Single location for researchers and data scientists to download containerized HPC apps and ML frameworks Compilers, Libraries, Dev Tools, APIs, Kernels/Runtimes Validated, Optimized Systems & Platforms
  • 19. 19 [AMD Official Use Only] DRIVING MAINSTREAM ADOPTION & ECOSYSTEM ENABLEMENT 19 EXPANDED OPTIMIZED ENABLING SUPPORT FOR AMD INSTINCT™ MI200 & AMD RADEON™ PRO W6800 GPUS COMPILER & LIBRARY OPTIMIZATIONS FOR HPC & AI/ML NEW ROCm DOCUMENTATION PORTAL & IMPROVED DEBUG TOOLS
  • 20. 20 [AMD Official Use Only] Re-architected ROCm Documentation  Support Guides  Installation & Deployment Guides  API / SDK Documentation Access to ROCm Learning Center  GPU programming tutorials, videos and labs https://docs.amd.com/
  • 21. 21 [AMD Official Use Only] Molecular Dynamics Academic / Research Oil & Gas / Geoscience NAMD LAMMPS GROMACS Computer Aided Engineering (CAE) Weather Machine Learning Reverse Time Migration (RTM) – miniMOD sample SPECFEM3D (Cartesian) SPECFEM3D (Globe) CP2K Quantum Espresso NWChem VASP MPAS TempoQuest AceCAST ICON NEMO Chroma MILC GRID TensorFlow PyTorch ONNX-Runtime MLPerf AMBER OpenMM Relion Quantum Chemistry Quantum Physics OpenFOAM® (CFD) PYFR (CFD) Cascade CharLES (CFD) Ansys Mechanical (FEA) Target availability 1H22
  • 22. 22 [AMD Official Use Only] AMD INFINITY HUB ROCm™ APP CATALOG COMMERCIAL ISVs [LINK] [LINK] • • • • • *Ansys Mechanical 2022 R2, Cascade CharLES, TempoQuest AceCAST
  • 23. 23 [AMD Official Use Only] • HPC Apps: CHROMA*, CP2K*, GRID*, GROMACS*, HACC, LAMMPS, MILC, NAMD*, OpenMM*, Relion, SPECFEM3D (Cartesian)*, SPECFEM3D (Globe)* • HPC Apps: AMBER*, ICON, MPAS, NWCHEM, OpenFOAM, PYFR, QuantumEspresso, WRF, NEMO • AI/ML: PyTorch*, TensorFlow* • Benchmarks: HPL, NBODY • Benchmarks: MLPerf (SSD, Resnet50, Transformer), HPCG Additional MI200 Support Planned for 1H22 MI200 Support Planned for 2H21 * Available on InfinityHub with MI100 support today Performance Results for Select Apps / Benchmarks
  • 24. 24 [AMD Official Use Only]  AMD Instinct GPUs:  AMD Instinct™ MI210 GPU page: https://www.amd.com/en/products/server-accelerators/instinct-mi210  AMD Instinct™ MI Series Product Page: https://www.amd.com/en/graphics/instinct-server-accelerators  AMD Instinct™ HPC Solutions Page: https://www.amd.com/en/graphics/servers-instinct-mi-powered-servers  AMD Instinct™ Machine Learning Solutions Page: https://www.amd.com/en/graphics/servers-instinct-deep-learning  AMD CDNA2 Architecture: https://www.amd.com/en/technologies/cdna2  CDNA2 WP: https://www.amd.com/system/files/documents/amd-cdna2-white-paper.pdf  AMD ROCm™ open software platform:  AMD ROCm™ pages: https://www.amd.com/en/graphics/servers-solutions-rocm  AMD Infinity Hub: https://www.amd.com/en/technologies/infinity-hub  AMD Accelerator Cloud: https://www.amd.com/en/solutions/accelerated-computing  ROCm Information Portal (DOCs & Learning Ctr.): https://docs.amd.com/  HPC & AMD page: www.AMD.com/HPC For AMD Instinct™ GPU and ROCm™ marketing assets, contact: Guy.Ludden@AMD.com or Sydney.Freeman@AMD.com
  • 25. Confidential Thank You 25 Better Faster Greener™ © 2022 Supermicro Please Contact us for Details: Josh Grossman, Principal Product Manager, Supermicro joshg@supermicro.com Martin Huarte, Ph.D., Developer Relations Manager, AMD martin.huarte@amd.com
  • 26. Confidential DISCLAIMER Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro Computer, Inc. assumes no obligation to update or otherwise correct or revise this information. SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION © 2022 Super Micro Computer, Inc. All rights reserved. 4/20/2022 Better Faster Greener™ © 2021 Supermicro 26

Editor's Notes

  • #18: Heterogeneous compute interface for portability
  • #22: MLPerf 0.7 (Resnet 50, Transformer, SSD)
  • #25: Canned questions: Is HIP a drop-in replacement for CUDA? No. HIP provides porting tools which do most of the work to convert CUDA code into portable C++ code that uses the HIP APIs. Most developers will port their code from CUDA to HIP and then maintain the HIP version. HIP code provides the same performance as native CUDA code, plus the benefits of running on AMD platforms. What APIs and features does HIP support? HIP provides the following: Devices (hipSetDevice(), hipGetDeviceProperties()) Memory management (hipMalloc(), hipMemcpy(), hipFree()) Streams (hipStreamCreate(),hipStreamSynchronize(), hipStreamWaitEvent()) Events (hipEventRecord(), hipEventElapsedTime()) Kernel launching (hipLaunchKernel is a standard C/C++ function that replaces <<< >>>) HIP Module API to control when adn how code is loaded. CUDA-style kernel coordinate functions (threadIdx, blockIdx, blockDim, gridDim) Cross-lane instructions including shfl, ballot, any, all - Most device-side math built-ins. Error reporting (hipGetLastError(), hipGetErrorString()) The HIP API documentation describes each API and its limitations, if any, compared with the equivalent CUDA API. https://rocmdocs.amd.com/en/latest/Programming_Guides/HIP-FAQ.html#what-apis-and-features-does-hip-support