SlideShare a Scribd company logo
3
Most read
10
Most read
11
Most read
Partnership to enable copper for the next
generation artificial intelligence computing
Partnership to enable copper for the next
generation artificial intelligence computing
[Seunghyun Eddy Hwang, Principal SI Lead, NVIDIA]
[Wai Kiong Poon, Global Product Manager, MOLEX]
Enabling Copper for Artificial Intelligence Computing
Next Generation Architecture
Voice of Customer
 High Speed Board-To-Board (112G PAM4 and beyond)
 Low Profile (5mm mated height as initial target)
 Surface Mount Termination (Solder Ball Attachment)
 Minimum PCB Real-Estate
 Mechanical Robust
 Simplicity in Design
 Ease of manufacturing; Cost Effective
High Speed Mezzanine Connector
MM
MM Pro
MMe
• Speed up to 112Gbps
• Speed up to 112Gbps (improved performance)
• Speed up to 224Gbps
Evolution of High-Speed Mezzanine
High Speed Mezzanine Connector
High Speed Mezzanine Connector
What have we learnt from the current Mezzanine Design?
 What have we learnt from the current Mezzanine Design?
— What are the Current Issues?
— What is the Signal Integrity Limitations?
 Moving from 112G to 224G:
— Understanding issues with current design
— Understanding limitations with current assembly process
 These understandings will enable us to design the next generation of High-
Speed Board to Board Connector
SI Performance influenced by Terminal & Assembly Process
 A good impedance control is critical for 224G application, but the impedance
optimization is limited by the following:
— Connector Design
— Assembly Process
 Due to the current stitched terminal design,
the flexibility and sensitivity of the terminal
width tuning becomes very challenging
Dimension Constraints due to Assembly Process.
High Speed Mezzanine Connector
Limitations of Current Design
 Minor variation in the dimensions will result in resonance
 This type of variation is very difficult to control during assembly process
High Speed Mezzanine Connector
TDR Connector only
Sensitivity of SI Performance due to Terminal Deflection
 Aside from dimension variation, the impedance is also sensitive to
deflection condition which affects the distance between terminal and its
nearby plastic
High Speed Mezzanine Connector
Limitations of Current Design
High Speed Mezzanine Connector
MM
MM Pro
MMe
• Speed up to 112Gbps
• Speed up to 112Gbps (improved performance)
• Speed up to 224Gbps
Evolution of High-Speed Mezzanine
High Speed Mezzanine Connector
Victim : Diff 4
Aggressor : others
#Rference impedance 86ohm
Comparing MM, MM Pro Vs. MMe (Signal Integrity)
High Speed Mezzanine Connector
Comparing MM, MM Pro Vs. MMe (Signal Integrity)
High Speed Mezzanine Connector
• It’s no secret that GPU accelerators now power many of the world’s fastest supercomputers and AI
systems
• NVLink is the world’s first proprietary system interconnect technology from Nvidia that allows multiple
GPUs to communicate directly via a high-speed interconnection
• NVLink connects the machines’ processors – CPUs and GPUs – so they can exchange data much faster
than CPU
What is NVLink?
NVIDIA DGX
• Nvidia Tesla V100 is the
world’s most advanced
data center GPU
• Supports AI, deep
learning, HPC, and
autonomous driving
• Tesla V100 offers the
• DGX H100, which uses 4th
generation of NVLINK, is the
world’s most advanced GPU
for large generative AI and
other transformer-based
workloads
• H100 contains 8 GPU modules
that communicate thru
NVLink needs high-speed
connector for baseboard
attachment
Improved H100 Performance Bring SI
Challenge
• Significant performance boost from A100
• Fourth-generation NVIDIA
NVLink provides a 3x bandwidth
increase on all-reduce operations and
multi-GPU IO operating at 7x the
bandwidth of PCIe Gen 5.
• Yet, no significant form factor change for
improved performance, which introduces
significant SI challenges
A100
H100
Mezzanine Connector Selection
Criteria
• SI performance
• As shown in crosstalk
comparison plot in left
figure, other vendor
crosstalk is much worse
than Molex
• Formfactor
• Molex pinout can route all
NVLink high-speed signals
in less number of layers
than other vendor
• Mechanical stability
Molex
Other vendor
Molex Connector Optimization
Optimized
Baseline
Optimized
Baseline
Optimized
Baseline
Resonance
free
SI Performance Comparison: H100 vs
A100
A100 H100
Thank you!

More Related Content

PPTX
TE Connectivity: Card Edge Interconnects - Understanding Device & Riser Card ...
PDF
Shared Memory Centric Computing with CXL & OMI
PPTX
MemVerge: Past Present and Future of CXL
PPTX
Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure
PPTX
Liqid: Composable CXL Preview
PPTX
The State of CXL-related Activities within OCP
PPTX
SMART Modular: Memory Solutions with CXL
PPTX
CXL Consortium Update: Advancing Coherent Connectivity
TE Connectivity: Card Edge Interconnects - Understanding Device & Riser Card ...
Shared Memory Centric Computing with CXL & OMI
MemVerge: Past Present and Future of CXL
Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure
Liqid: Composable CXL Preview
The State of CXL-related Activities within OCP
SMART Modular: Memory Solutions with CXL
CXL Consortium Update: Advancing Coherent Connectivity

What's hot (20)

PPTX
Microchip: CXL Use Cases and Enabling Ecosystem
PPTX
Q1 Memory Fabric Forum: Big Memory Computing for AI
PPTX
Enfabrica - Bridging the Network and Memory Worlds
PDF
Q1 Memory Fabric Forum: SMART CXL Product Lineup
PDF
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
PDF
NVMe overview
PPTX
Student guide power systems for aix - virtualization i implementing virtual...
PPTX
Micron: Memory Expansion with CXL Modules: Benefits, Use Cases and Enriching ...
PPTX
CXL Fabric Management Standards
PPTX
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
PPTX
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
PDF
How to Perform HCL Notes 14 Upgrades Smoothly
PDF
XenDesktop 7.6とXenApp 7.6の移行および注意点について徹底解説
PPTX
Installing Postgres on Linux
 
PDF
Percona server for MySQL 제품 소개
PDF
AWS の OpenShift サービス (ROSA) を使った OpenShift Virtualizationの始め方.pdf
PPTX
Micron CXL product and architecture update
PDF
Moving to PCI Express based SSD with NVM Express
PDF
Nick Fisk - low latency Ceph
PDF
ALM과 DevOps 그리고 Azure DevOps
Microchip: CXL Use Cases and Enabling Ecosystem
Q1 Memory Fabric Forum: Big Memory Computing for AI
Enfabrica - Bridging the Network and Memory Worlds
Q1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
NVMe overview
Student guide power systems for aix - virtualization i implementing virtual...
Micron: Memory Expansion with CXL Modules: Benefits, Use Cases and Enriching ...
CXL Fabric Management Standards
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
How to Perform HCL Notes 14 Upgrades Smoothly
XenDesktop 7.6とXenApp 7.6の移行および注意点について徹底解説
Installing Postgres on Linux
 
Percona server for MySQL 제품 소개
AWS の OpenShift サービス (ROSA) を使った OpenShift Virtualizationの始め方.pdf
Micron CXL product and architecture update
Moving to PCI Express based SSD with NVM Express
Nick Fisk - low latency Ceph
ALM과 DevOps 그리고 Azure DevOps
Ad

Similar to Molex and Nvidia - Partnership to enable copper for the next generation artificial intelligence computing (20)

PPTX
Computação acelerada – a era das ap us roberto brandão, ciência
PPTX
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
PPTX
Amd accelerated computing -ufrj
PDF
Co-Design Architecture for Exascale
PDF
PDF
PLNOG16: Coping with Growing Demands – Developing the Network to New Bandwidt...
PPTX
Seminario utovrm
PDF
Mellanox Announcements at SC15
PPTX
Streaming multiprocessors and HPC
PDF
2.01_Nvidia_NVswitch_HotChips2018_DGX2NVS_Final.pdf
PPTX
Introducing the CrossLink Programmable ASSP
PDF
Interconnect Your Future With Mellanox
PPTX
Introduction-to-Distributed-Systems GPU-BilqesF 2.pptx
PDF
Interconnect Your Future: Paving the Road to Exascale
PDF
VLSI- An Automotive Application Perspective
PDF
High Performance Computing - Challenges on the Road to Exascale Computing
PDF
HPC Infrastructure To Solve The CFD Grand Challenge
PDF
Webinar: Aplicações gráficas com STM32H7
PDF
Silicon Photonics for HPC Interconnects
PPTX
TE Connectivity: Card Edge Interconnects
Computação acelerada – a era das ap us roberto brandão, ciência
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Amd accelerated computing -ufrj
Co-Design Architecture for Exascale
PLNOG16: Coping with Growing Demands – Developing the Network to New Bandwidt...
Seminario utovrm
Mellanox Announcements at SC15
Streaming multiprocessors and HPC
2.01_Nvidia_NVswitch_HotChips2018_DGX2NVS_Final.pdf
Introducing the CrossLink Programmable ASSP
Interconnect Your Future With Mellanox
Introduction-to-Distributed-Systems GPU-BilqesF 2.pptx
Interconnect Your Future: Paving the Road to Exascale
VLSI- An Automotive Application Perspective
High Performance Computing - Challenges on the Road to Exascale Computing
HPC Infrastructure To Solve The CFD Grand Challenge
Webinar: Aplicações gráficas com STM32H7
Silicon Photonics for HPC Interconnects
TE Connectivity: Card Edge Interconnects
Ad

More from Memory Fabric Forum (20)

PPTX
H3 Platform CXL Solution_Memory Fabric Forum.pptx
PDF
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
PPTX
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
PPTX
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
PPTX
Q1 Memory Fabric Forum: About MindShare Training
PPTX
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
PDF
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
PDF
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
PDF
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
PDF
Q1 Memory Fabric Forum: CXL Form Factor Primer
PDF
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
PDF
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
PPTX
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
PPTX
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
PPTX
Q1 Memory Fabric Forum: XConn CXL Switches for AI
PDF
Q1 Memory Fabric Forum: VMware Memory Vision
PPTX
MemVerge: Memory Expansion Without Breaking the Budget
PPTX
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
PPTX
Photowave Presentation Slides - 11.8.23.pptx
PPTX
Synopsys: Achieve First Pass Silicon Success with Synopsys CXL IP Solutions
H3 Platform CXL Solution_Memory Fabric Forum.pptx
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
Q1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: XConn CXL Switches for AI
Q1 Memory Fabric Forum: VMware Memory Vision
MemVerge: Memory Expansion Without Breaking the Budget
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
Photowave Presentation Slides - 11.8.23.pptx
Synopsys: Achieve First Pass Silicon Success with Synopsys CXL IP Solutions

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Big Data Technologies - Introduction.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Cloud computing and distributed systems.
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Approach and Philosophy of On baking technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
Machine learning based COVID-19 study performance prediction
PPT
Teaching material agriculture food technology
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Big Data Technologies - Introduction.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Reach Out and Touch Someone: Haptics and Empathic Computing
Cloud computing and distributed systems.
Mobile App Security Testing_ A Comprehensive Guide.pdf
20250228 LYD VKU AI Blended-Learning.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Spectroscopy.pptx food analysis technology
Programs and apps: productivity, graphics, security and other tools
Approach and Philosophy of On baking technology
Empathic Computing: Creating Shared Understanding
Machine learning based COVID-19 study performance prediction
Teaching material agriculture food technology
MYSQL Presentation for SQL database connectivity
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx

Molex and Nvidia - Partnership to enable copper for the next generation artificial intelligence computing

  • 1. Partnership to enable copper for the next generation artificial intelligence computing
  • 2. Partnership to enable copper for the next generation artificial intelligence computing [Seunghyun Eddy Hwang, Principal SI Lead, NVIDIA] [Wai Kiong Poon, Global Product Manager, MOLEX]
  • 3. Enabling Copper for Artificial Intelligence Computing Next Generation Architecture
  • 4. Voice of Customer  High Speed Board-To-Board (112G PAM4 and beyond)  Low Profile (5mm mated height as initial target)  Surface Mount Termination (Solder Ball Attachment)  Minimum PCB Real-Estate  Mechanical Robust  Simplicity in Design  Ease of manufacturing; Cost Effective High Speed Mezzanine Connector
  • 5. MM MM Pro MMe • Speed up to 112Gbps • Speed up to 112Gbps (improved performance) • Speed up to 224Gbps Evolution of High-Speed Mezzanine High Speed Mezzanine Connector
  • 6. High Speed Mezzanine Connector What have we learnt from the current Mezzanine Design?  What have we learnt from the current Mezzanine Design? — What are the Current Issues? — What is the Signal Integrity Limitations?  Moving from 112G to 224G: — Understanding issues with current design — Understanding limitations with current assembly process  These understandings will enable us to design the next generation of High- Speed Board to Board Connector
  • 7. SI Performance influenced by Terminal & Assembly Process  A good impedance control is critical for 224G application, but the impedance optimization is limited by the following: — Connector Design — Assembly Process  Due to the current stitched terminal design, the flexibility and sensitivity of the terminal width tuning becomes very challenging Dimension Constraints due to Assembly Process. High Speed Mezzanine Connector
  • 8. Limitations of Current Design  Minor variation in the dimensions will result in resonance  This type of variation is very difficult to control during assembly process High Speed Mezzanine Connector
  • 9. TDR Connector only Sensitivity of SI Performance due to Terminal Deflection  Aside from dimension variation, the impedance is also sensitive to deflection condition which affects the distance between terminal and its nearby plastic High Speed Mezzanine Connector
  • 10. Limitations of Current Design High Speed Mezzanine Connector
  • 11. MM MM Pro MMe • Speed up to 112Gbps • Speed up to 112Gbps (improved performance) • Speed up to 224Gbps Evolution of High-Speed Mezzanine High Speed Mezzanine Connector
  • 12. Victim : Diff 4 Aggressor : others #Rference impedance 86ohm Comparing MM, MM Pro Vs. MMe (Signal Integrity) High Speed Mezzanine Connector
  • 13. Comparing MM, MM Pro Vs. MMe (Signal Integrity) High Speed Mezzanine Connector
  • 14. • It’s no secret that GPU accelerators now power many of the world’s fastest supercomputers and AI systems • NVLink is the world’s first proprietary system interconnect technology from Nvidia that allows multiple GPUs to communicate directly via a high-speed interconnection • NVLink connects the machines’ processors – CPUs and GPUs – so they can exchange data much faster than CPU What is NVLink?
  • 15. NVIDIA DGX • Nvidia Tesla V100 is the world’s most advanced data center GPU • Supports AI, deep learning, HPC, and autonomous driving • Tesla V100 offers the • DGX H100, which uses 4th generation of NVLINK, is the world’s most advanced GPU for large generative AI and other transformer-based workloads • H100 contains 8 GPU modules that communicate thru NVLink needs high-speed connector for baseboard attachment
  • 16. Improved H100 Performance Bring SI Challenge • Significant performance boost from A100 • Fourth-generation NVIDIA NVLink provides a 3x bandwidth increase on all-reduce operations and multi-GPU IO operating at 7x the bandwidth of PCIe Gen 5. • Yet, no significant form factor change for improved performance, which introduces significant SI challenges A100 H100
  • 17. Mezzanine Connector Selection Criteria • SI performance • As shown in crosstalk comparison plot in left figure, other vendor crosstalk is much worse than Molex • Formfactor • Molex pinout can route all NVLink high-speed signals in less number of layers than other vendor • Mechanical stability Molex Other vendor
  • 19. SI Performance Comparison: H100 vs A100 A100 H100