SlideShare a Scribd company logo
Consume. Collaborate. Contribute.Consume. Collaborate. Contribute.
OCP Accelerator Module and The
Infrastructure
ODSA Project Workshop
March 28, 2019
Consume. Collaborate. Contribute.
Outline
• Motivation
• Approach
• Examples
• Requesting Participation and Feedback
Consume. Collaborate. Contribute.
Motivation
Consume. Collaborate. Contribute.
AI’s rapid evolution is producing an explosion of
new types of hardware accelerators for
Machine Learning (ML) and Deep Learning (DL)
GPU FPGA ASIC NPU TPU NNP IPU xPU…
Consume. Collaborate. Contribute.
Varied Module and System Form Factors
Consume. Collaborate. Contribute.
Different Implementations
Targeting Similar Requirements!
HPC
Consume. Collaborate. Contribute.
Power
CPU Head Node
Host
Interface
High-speed Expansion Links
Baseboard
CPU Head Node
Host
Interface
Cooling
Baseboard
PCIe Switch PCIe SwitchChassis Chassis
Logical Components for AI Hardware System
Consume. Collaborate. Contribute.
Facebook Big Basin System
Baseboard on sliding tray
Midplane boardBaseboardIO board
Consume. Collaborate. Contribute.
Big Basin Rack View
Head node Tioga Pass
• 2S server Tioga Pass as head node
• Open Rack v2, 12.6kw
• 4 Big Basin per Rack
Consume. Collaborate. Contribute.
Facebook Inference/Video Accelerator Common System
Consume. Collaborate. Contribute.
Facebook Training System
Consume. Collaborate. Contribute.
Varied Module and System Form Factors
Consume. Collaborate. Contribute.
Common Requirements for Accelerator System
• Flexibility
• Robustness & Serviceability
• Configuration, Programming, & Management
• Power & Cooling
• Inter-module Communication to Scale Up
• Input / Output Bandwidth to Scale Out
Consume. Collaborate. Contribute.
“If you want to go Fast, go Alone;
If you want go Far, go Together”
We have done Fast for Short-term result;
It is time to go Far at OCP for
Long-term gain!
Consume. Collaborate. Contribute.
We Started from OAM.
Consume. Collaborate. Contribute.
OCP Accelerator Module(OAM) Spec
• 102mm x 165mm Module Size
• With two high-speed Mirror Mezz connectors
• 12V and 48V input DC Power
• Up to 350w (12V) and up to 700w (48V) TDP
• Up to 450W (air-cooled) and 700W (liquid-cooled)
• Support single or multiple ASIC(s) per Module
• Up to eight x16 Links (Host + inter-module Links)
• Support one or two x16 High speed link(s) to Host
• Up to seven x16 high speed interconnect links
• Up to 8* Modules per Baseboard
• System management and debug interfaces
Consume. Collaborate. Contribute.
NEW
PROJECT
OCP
ACCELERATOR
MODULE
Consume. Collaborate. Contribute.
OAM Power
• Support both 12V and 48V as input
• 12V to support up to 350w TDP
• 48V to support up to 700w TDP
Power Rail Voltage Tolerance # of pins Current Capability Status
P12V 11V min to 13.2V max 27 27A (when at 11V) Normal Power
P12V Mandatory 11V min to 13.2V max 5 5A (when at 11V) Normal Power
P48V 44V min to 60V max 16 16A (when at 44V) Normal Power
P3.3V 3.3V±10% (max) 2 2A Normal Power
Consume. Collaborate. Contribute.
OAM Pin Map
SerDes 1 X16
SerDes 2 X16
SerDes 3 X16
Host X16
SerDes R
X20
SerDes 4 X16
SerDes 5 X16
SerDes 6 X16
Power
Connector 0 Connector 1
Consume. Collaborate. Contribute.
Interconnect end-to-end Channel Loss
• The module interconnection channel total insertion loss @28Gbps should
not be over -8dB
• System baseboard IL budget = Die to Die IL from each OAM supplier –
16dB
8-dB package loss 8-dB package lossOAM OAM
~2-3inch Package Package ~2-3inch
Baseboard
Consume. Collaborate. Contribute.
System Management/Debugging
• Sensor reporting
• Error monitoring/Reporting
• Firmware Update
• Power Capping
• FRU Information
• IO Calibration
• JTAG/I2C/UART interfaces for debugging
Consume. Collaborate. Contribute.
Overview OAM: Mechanical/Thermal
Top stiffener
Example
Bottom stiffenerOAM PCB
Example die
BOTTOM VIEWTOP VIEW
Consume. Collaborate. Contribute.
Mech Requirements – OAM Bottom Stiffener
• 5±0.15mm stiffener as required by connector
• 3mm alignment pins that extend 10mm
below OAM PCB surface
• Die spring (rectangular profile coil spring)
to provide unmate force
• EMI gaskets for grounding to baseboard
SIDE VIEW, FULL ASSEMBLY
(BOTTOM STIFFENER IN BLUE)
BOTTOM VIEW
FRONT VIEW
Top stiffener OAM PCB
Bottom Stiffener
System Baseboard
Consume. Collaborate. Contribute.
Mech Requirements – System Baseboard
• Component KOZ 103 x 166mm: 0mm height
• Cross-hatched locations: Grounding Pads
• EMI grounding pads located north and
south of the connectors
• 4x Mounting Holes for M3.5 screws
• 2x SMT nuts used as alignment features
TOP VIEW
Consume. Collaborate. Contribute.
Mech Recommendations – Alignment Features
• Notch provides orientation and keying (OPTIONAL, BUT RECOMMENDED)
TOP VIEW
SMT nuts on baseboard
SIDE VIEW
Molex Mirror Mezz Connector Gatherability: 0.76mm
Consume. Collaborate. Contribute.
Thermal Requirements – Operation Environment
• Ambient Temp: 5°C to 35°C
• Approach Temp: 5°C to 48°C
• Altitude: sea level to 6000ft
• Humidity: 20% to 90%
• Cold boot temp limit: TBD
• Storage temp: -20°C to 85°C
• No ambient temp compensation/de-rating for altitude
Consume. Collaborate. Contribute.
Now we have a industry standard OAM spec,
what’s the next?
Consume. Collaborate. Contribute.
We need an
Open
Accelerator Infrastructure
Consume. Collaborate. Contribute.
• Power and Cooling
• Mechanical
• Electrical
• Security & Management
SERVER
Hierarchical Base Specification for OAI
Well-defined boundaries
Fostering Innovation
• OAM
• UBB (Interconnect Topology)
• Switch Board
• SCM
• Tray
• Chassis
Designs and Products may be compliant to any or all specifications
Consume. Collaborate. Contribute.
The Universal Baseboard (UBB)
Consume. Collaborate. Contribute.
Different Neural Networks and Frameworks
for Model or Data Parallelism
Benefit from different
Interconnect Topologies
Consume. Collaborate. Contribute.
•Consider a Grid of Planar OAM sites
•Standard Volumetric
•Protocol Agnostic Interconnects
•Wires are Wires!
Host
Interface
Expansion
Universal
Baseboard
Universal Baseboard (UBB)
Consume. Collaborate. Contribute.
With different interconnect topologies
Six inter-module Links may
create a 3D Mesh or Torus
With seven inter-OAM Links
and one Host Link
A Grid of interconnected OAMs,
Max Bisection BW
One Hop Away
Ready for Expansion
With six inter-OAM Links
and one Host Link
Consume. Collaborate. Contribute.
OAM Topology Examples
Fully Connected w/ 7 links
Consume. Collaborate. Contribute.
OAM Topology Examples
Almost Fully Connected w/ 6 links
Consume. Collaborate. Contribute.
OAM Topology Examples
Rings w/ 4 links
Port 4/5/6/R for AISC which has 4 links on Conn1 Only
Consume. Collaborate. Contribute.
OAM Topology Examples
Hybrid Cube Mesh w/ 4 links
Port 4/5/6/R for AISC which has 4 links on Conn1 Only
Consume. Collaborate. Contribute.
OAM Topology Examples
Hybrid Cube Mesh w/ 6 links
Consume. Collaborate. Contribute.
Summary for OAI/OAM
• Rev 0.85 of the OAM spec is available in OCP Wiki
• Join the Project and further develop interoperable Modules
for an Open Accelerator Infrastructure(UBB, PSB, SCM, Tray,
Chassis…)
• We invite you to join the OAI subgroup for further
collaboration:
Register for the Mailing List:
https://ocp-all.groups.io/g/OCP-OAI
Wiki under OCP Server Project:
https://www.opencompute.org/wiki/Server/OAI
Consume. Collaborate. Contribute.
Accelerator Form Factors
• Different Form Factors
PCIe CEM
OAM
M.2/Dual M.2
OCP NIC
Others
• Different Accelerator Targets
Training
Inference
Video
Others
Consume. Collaborate. Contribute.

More Related Content

PDF
Pcie basic
PDF
Creating Your Own PCI Express System Using FPGAs: Embedded World 2010
PPTX
PCIe Gen 3.0 Presentation @ 4th FPGA Camp
PDF
Shared Memory Centric Computing with CXL & OMI
ODP
PCIe DL_layer_3.0.1 (1)
PDF
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
PPTX
PPT
ARM - Advance RISC Machine
Pcie basic
Creating Your Own PCI Express System Using FPGAs: Embedded World 2010
PCIe Gen 3.0 Presentation @ 4th FPGA Camp
Shared Memory Centric Computing with CXL & OMI
PCIe DL_layer_3.0.1 (1)
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
ARM - Advance RISC Machine

What's hot (20)

PPTX
DPDK KNI interface
PDF
DPDK & Layer 4 Packet Processing
PDF
Intel core i3, i5, i7 , core2 duo and atom processors
PDF
The Google File System (GFS)
PPTX
Slideshare - PCIe
PPTX
Understanding DPDK
PDF
AI Chip Trends and Forecast
PPTX
All Presentations during CXL Forum at Flash Memory Summit 22
PPTX
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
PPTX
Automate_LSF_ppt_final
PPTX
CXL Consortium Update: Advancing Coherent Connectivity
PPTX
Dpdk applications
PPTX
Q1 Memory Fabric Forum: Big Memory Computing for AI
PPTX
Understanding DPDK algorithmics
PPTX
Broadcom PCIe & CXL Switches OCP Final.pptx
DOCX
Intel i3 processor
PDF
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
PPTX
PDF
Understand and optimize Linux I/O
PDF
SP Routing Innovation with Segment Routing, VXLAN and EVPN - Ismail Ali
DPDK KNI interface
DPDK & Layer 4 Packet Processing
Intel core i3, i5, i7 , core2 duo and atom processors
The Google File System (GFS)
Slideshare - PCIe
Understanding DPDK
AI Chip Trends and Forecast
All Presentations during CXL Forum at Flash Memory Summit 22
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
Automate_LSF_ppt_final
CXL Consortium Update: Advancing Coherent Connectivity
Dpdk applications
Q1 Memory Fabric Forum: Big Memory Computing for AI
Understanding DPDK algorithmics
Broadcom PCIe & CXL Switches OCP Final.pptx
Intel i3 processor
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
Understand and optimize Linux I/O
SP Routing Innovation with Segment Routing, VXLAN and EVPN - Ismail Ali
Ad

Similar to ODSA - OCP Accelerator Module and the Infrastructure (20)

PDF
Ics21 workshop decoupling compute from memory, storage & io with omi - ...
PDF
Decoupling Compute from Memory, Storage and IO with OMI
PDF
ODSA - Technical Introduction
PPT
OHM CAD SYSTEM Capabilities
DOC
Resume_Bhasker
DOC
Asar resume
PDF
Choose your dev platform
PDF
Open_IoT_Summit-Europe-2016-Building_a_Drone_from_scratch
PPT
Hardware Projects
PDF
SHUTTLE NCBT419 Bay trail REV A - Lenovo LNV 4030 LNV-004.pdf
PDF
FOSOH-V (TM) preliminary schematics
PDF
cisco-n9k-c9364c-datasheet.pdf
PDF
FRDM-KL46Z_Hands-On_Presentation_v02
PDF
Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2
PDF
Modicon M340_BMXP342020.pdf
PDF
Lenovo Yoga 500 14ACL LT415-AMD 14235-1 MB
PDF
Rockwell Automation In Stock Nov 2015
PPTX
Open19 9/2016
PPTX
Presentation_BTBox_2012
PDF
esquematico sony vaio vpg-h el completo.
Ics21 workshop decoupling compute from memory, storage & io with omi - ...
Decoupling Compute from Memory, Storage and IO with OMI
ODSA - Technical Introduction
OHM CAD SYSTEM Capabilities
Resume_Bhasker
Asar resume
Choose your dev platform
Open_IoT_Summit-Europe-2016-Building_a_Drone_from_scratch
Hardware Projects
SHUTTLE NCBT419 Bay trail REV A - Lenovo LNV 4030 LNV-004.pdf
FOSOH-V (TM) preliminary schematics
cisco-n9k-c9364c-datasheet.pdf
FRDM-KL46Z_Hands-On_Presentation_v02
Presentation for IoT workshop at Sinhagad University (Feb 4, 2016) - 2/2
Modicon M340_BMXP342020.pdf
Lenovo Yoga 500 14ACL LT415-AMD 14235-1 MB
Rockwell Automation In Stock Nov 2015
Open19 9/2016
Presentation_BTBox_2012
esquematico sony vaio vpg-h el completo.
Ad

More from ODSA Workgroup (20)

PDF
ODSA - Business Workstream
PDF
ODSA - PoC Planning & Call for Help
PDF
ODSA - DirectConnect Options for the PoC
PDF
ODSA - PoC Physical Design Elements
PDF
ODSA - NXP PoC Presentation
PDF
ODSA - PoC Requirements and Use Cases
PDF
ODSA - Speedster22i FPGA for POC
PDF
ODSA Design Tools for Chiplet-Based Design
PDF
ODSA - Power and Heat
PDF
ODSA - PIPE Adapters
PDF
ODSA - PHY Layer
PDF
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
PDF
ODSA - Samsung Welcomes OCP's ODSA
PDF
ODSA - OCP Welcome
PDF
ODSA Sub-Project Launch
PDF
ODSA Workshop: Development Effort Summary
PDF
Design and Testing Challenges for Chiplet Based Design: Assembly and Test View
PDF
ODSA Proof of Concept SmartNIC Speeds & Feeds
PDF
ODSA Use Case - SmartNIC
PDF
Chiplets in Data Centers
ODSA - Business Workstream
ODSA - PoC Planning & Call for Help
ODSA - DirectConnect Options for the PoC
ODSA - PoC Physical Design Elements
ODSA - NXP PoC Presentation
ODSA - PoC Requirements and Use Cases
ODSA - Speedster22i FPGA for POC
ODSA Design Tools for Chiplet-Based Design
ODSA - Power and Heat
ODSA - PIPE Adapters
ODSA - PHY Layer
OODSA - BoW: Basic, Fast, Turbo Die-to Die Open Interface Solutions
ODSA - Samsung Welcomes OCP's ODSA
ODSA - OCP Welcome
ODSA Sub-Project Launch
ODSA Workshop: Development Effort Summary
Design and Testing Challenges for Chiplet Based Design: Assembly and Test View
ODSA Proof of Concept SmartNIC Speeds & Feeds
ODSA Use Case - SmartNIC
Chiplets in Data Centers

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Encapsulation theory and applications.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPT
Teaching material agriculture food technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
A Presentation on Artificial Intelligence
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Cloud computing and distributed systems.
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
sap open course for s4hana steps from ECC to s4
DOCX
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Encapsulation theory and applications.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Advanced methodologies resolving dimensionality complications for autism neur...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
NewMind AI Weekly Chronicles - August'25-Week II
20250228 LYD VKU AI Blended-Learning.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Teaching material agriculture food technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Unlocking AI with Model Context Protocol (MCP)
A Presentation on Artificial Intelligence
A comparative analysis of optical character recognition models for extracting...
Big Data Technologies - Introduction.pptx
Cloud computing and distributed systems.
Chapter 3 Spatial Domain Image Processing.pdf
Empathic Computing: Creating Shared Understanding
sap open course for s4hana steps from ECC to s4
The AUB Centre for AI in Media Proposal.docx

ODSA - OCP Accelerator Module and the Infrastructure