SlideShare a Scribd company logo
Siamak Tavallaei
CXL Advisor to the Board, CXL Consortium
Steering Committee, OCP
Nov 14, 2023
TheStateofCXL-relatedActivitieswithinOCP
SupercomputingConference(SC’23)
An Update:
CXL™ 3.1 Specification
Released! (Nov 14, 2023)
Presented by Siamak Tavallaei
CXL Advisor to the Board, CXL Consortium
EMBARGOED UNTIL NOVEMBER 14 AT 8:00 am EST
11
CXL 3.1
Featureenhancements
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
• The CXL specification continues to evolve to meet new
usage models
• New features introduced in the CXL 3.1 specification:
• CXL Fabric Improvements/Extensions
• Scale-outof CXL fabrics using PBR (Port Based Routing)
• Trusted-Execution-Environment Security Protocol (TSP)
• Allows for Virtualization-basedTrustedExecutionEnvironments (TEEs)
to host ConfidentialComputing Workloads
• Memory Expander Improvements
• Up to 34-bit of metadata and RAS capability enhancements
CXL 3.1 Feature Enhancements
12
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
13
CXL Fabric
Improvements/
Extensions
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
CXL 3.1: Fabric Enhancement Features
Port-Based Routing (PBR) compared to Hierarchy-Based Routing (HBR)
Support fabric topologiesother thantreetopologiesthatHBR switches offer
• Address-based,non-prescriptiveroutingfor large memory fabrics
• Supports tree,mesh, ring,star,butterfly,and multi-dimensionaltopologies
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
Host Host
CXL
CXLHBRSwitch
CXL HBR
Switch
CXL HBR
Switch
CXL HBR
Switch
Type-3
CXLMem
Type-3
CXLMem
Type-3
CXLMem
Type-3
CXLMem
Type-3
CXLMem
Type-3
CXLMem
CXL
PBR
Switc
h
Host Host Host Host
Host Host Host Host
CXL PBR
Switch
CXL PBR
Switch
CXL PBR
Switch
CXL PBR
Switch
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
Type-1/2/3
CXLDevice
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
19
Trusted Execution Environment (TEE)
&
TEE Security Protocol (TSP)
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
RECAP: CXL 2.0 Security Benefits
CXL 2.0 provides Integrity and Data Encryption (IDE) of traffic across
all entities (Root Complex, Switch, Device) at the Link Layer
CXL 2.0 Switch
CPU/SoC Root Complex CXL Device
Home Agent
MC
Host
Memory
Coherent Bridge
IO Bridge IOMMU CXL.memory
CXL.io
Area of
Protection
Coherent Cache
(Optional)
DTLB
MC
Device
Memory
CXL.cache
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
CXL 3.1 Trusted Security Protocol (TSP)
Allows for Virtualization-based, Trusted Execution Environments (TEEs)
to host Confidential Computing Workloads (CC WL)
Benefits:
• Freedom tomigratesensitiveWLs to TSP-enabledClouds
• Collaborationwithmultiplepartiesfor sharingdata
• Conform to Compliance & Datasovereigntyprograms
• StrengthenApplicationsecurity& SoftwareIP protection
KeyCapabilities:
• Separationbetween TVM*& CSP’s infrastructure(VMM)
• Configurationof CXL device
• Encryptionof sensitivedatain both Host/Devicememory
• Cryptographicallyverify correctconfigurationof trusted
computingenvironment
*TVM = Trusted VM
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
CXL Device
CXL Link
MC
TVM*
VM
VM
VM
VM
CXL.io
CXL.mem
CXL IDE
(Defined in CXL v2.0)
TEE Capable Host
MC
Device
Memory
Device
Memory
Host
Mem
Host
Mem
CXL.io
CXL.mem
Elements of TSP / TSP Overview
22
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
TSP Components for
Confidential Computing
• TrustedExecutionState& Access Control
• How access to memoryis controlled
• Configuration
• Ability to determinethe supported security
featureson the device, enable required
features,and lock the configuration
• Attestation& Authentication
• Trusting who you are talking to
• Memory Encryption
• Encrypting data-at-rest
• Transport Security
• Encrypting the link to protect data-in-flight
and detect/preventphysical attacks
Confidential Computing (initiator)
CXL TrustedExecutionEnvironment SecurityProtocol (TSP)
Attestation
Authentication
Trusted Execution
State & Access
Control
Memory
Encryption
(Data-at-rest)
Transport
Security
(Data-in-flight)
Configuration
CXL memoryexpander (target)
HDM-H (CXL 3.1), HDM-DB (CXL 3.1 ECN)
23
Memory
Expander
Improvements
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
CXL at OCP
CXL Specification Feature Summary
Features CXL 1.0 / 1.1 CXL 2.0 CXL 3.0 CXL 3.1
Releasedate 2019 2020 August 2022 November2023
Max link rate 32GT/s 32GT/s 64GT/s 64GT/s
Flit 68 byte (up to 32 GT/s)    
Type 1, Type 2 and Type 3 Devices    
MemoryPooling w/ MLDs   
Global PersistentFlush   
CXL IDE   
Switching (Single-level)   
Switching (Multi-level)  
Multiple Type 1/Type 2 devicesper rootport  
Direct memoryaccess for peer-to-peer  
256-byte Flit (up to 64 GT/s PAM4)  
256-byte Flit (Enhanced coherency)  
256-byte Flit (Memorysharing)  
256-byte Flit (Fabric capabilities)  
Fabric Manager API definition for PBR Switch 
Host-to-Hostcommunication with Global Integrated Memory(GIM) concept 
Trusted-Execution-Environment(TEE) SecurityProtocol 
Memoryexpander enhancements (up to 34-bit ofmeta data, RAS capability enhancements) 
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
 Supported
Not Supported
Siamak Tavallaei
Chief Systems Architect
CXL Advisor to the Board, CXL Consortium
Excerpts from:
The State of CXL-related Activities
within OCP
CXL™ Forum at OCP Summit
CXL as the standard protocol for data-movement
through coherent memory and for the associated
management and system composability
OCP as a community to realize integrated systems
around CXL
Proposal for an OCP reference system for CXL-
enabled compute disaggregation
CXL Technical Work Groups
CXL Consortium
https://www.computeexpresslink.org
CXL Board of Directors (BoD)
https://www.computeexpresslink.org/meettheboard
Marketing Work Group (MWG)
Technical Task Force (TTF)
Active CXL WGs run under Technical Task Force (TTF)
PWG (Protocol WG)
SSWG (System and Software WG)
PHY (Physical and Link Work Group)
MSWG (Memory System WG)
Compliance WG
OCP Server Project
Active OCP Server Project subprojects and workstreams
Drive contributions (Base Spec, Design Spec, Products, …)
https://www.opencompute.org/wiki/Server
CMS (Composable Memory System)
https://www.opencompute.org/projects/composable-memory-system
DC-MHS (Datacenter-ready Modular Hardware System)
https://www.opencompute.org/projects/dc-mhs
Extended Connectivity Workstream (for PCIe and CXL)
https://www.opencompute.org/wiki/Server/PCIe_Extended_Connectivity_Requirements_Workstream
ODSA (Open Domain-Specific Architecture)
https://www.opencompute.org/wiki/Server/ODSA
OAI (Open Accelerator Infrastructure)
https://www.opencompute.org/wiki/Server/OAI
HPC (High-performance Computing)
https://www.opencompute.org/wiki/HPC
OCP NIC
https://www.opencompute.org/wiki/Server/NIC
CXL Resources
36
Webinars:
• Webinar: A look into the CXLdevice ecosystemand the evolutionof CXL use cases
• Webinar: Introducing CXL 3.0: Enabling composable systems with expanded fabric capabilities
• Webinar: CXL 1.1 vs. CXL 2.0 – What’s the difference?
• Webinar Archive: https://www.computeexpresslink.org/webinars
Blogs:
• Upcoming Webinar: A Look into the CXL™ DeviceEcosystemand the Evolution of CXL Use Cases
• CXL 3.0 Webinar Q&A Recap
• CXL™ Consortium MemberSpotlight: UnifabriX
• Blog Archive: https://www.computeexpresslink.org/blog
White Papers:
• CXL 3.0 specification
• An Overviewof Reliability, Availability, and Serviceability(RAS) in ComputeExpress Link™ 2.0
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
CXL at OCP
CXL at OCP
CXL at OCP
A converged memory environment
is the main key
for ease of software programing!
and efficient data-movement!
11/21/2023
ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
• Use cases keep driving the need for higher IO and memory bandwidth:
e.g., high performance accelerators, system memory, SmartNIC etc.
• CPU capability requiring more memory capacity and bandwidth per core
• Efficient peer-to-peer resource-sharing & messaging across multiple domains
• Need to overcome memory bottlenecks due to CPU pin and thermal constraints
Industry trends
CXL at OCP
CXL at OCP
CXL at OCP
Provide
New Capabilities
CXL at OCP
CXL at OCP
CXL at OCP
The CXL 3.0 Spec
allows for a large
interconnected Fabric
ComputeExpressLink™ andCXL™Consortiumaretrademarksof the ComputeExpressLink Consortium. 59
CXL3.0Fabrics
Composable Systems with Spine/Leaf Architecture
CXL 3.0Fabric Architecture
• InterconnectedSpine Switch System
• Leaf Switch NIC Enclosure
• Leaf Switch CPU Enclosure
• Leaf Switch AcceleratorEnclosure
• Leaf Switch Memory Enclosure
CXL Switch
Accelerator Memory CPUs CPUs
CXL Switch
GFAM GFAM GFAM
CXL Switch CXL Switch
NIC NIC
CXL Switch
CXL Switch Spine
Switches
LeafSwitches
EndDevices
Fabric
Manager
Example
traffic
flow
ComputeExpressLink™ andCXL™Consortiumaretrademarksof the ComputeExpressLink Consortium. 60
CXL3.1Fabrics
Enables more efficient “East/West” interconnect
CXL
3.0/3.1
Composable Fabric growth for disaggregation/pooling/accelerator
Composable Architecture
n e w p a ra d i g m fo r
d i s a g g re ga te d c o m p u t i n g !
H P C , A I / M L , I n - m e m o r y D ata b a s e s
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
Challenges
Complex
Interconnect
?
Successful Disaggregation Approach
First do no harm
 The OS running on a Server: The Platform and the CXL Fabric Manager provide
the same experience as a static server system
 Remedy every new fault mode
 Ride on PCIe, UEFI, and traditional RAS and Security
 Reduce the problem to that which has been solved before!
Put things where they belong
• Partition the system efficiently: ease of use, serviceability, maintenance
While pushing the envelop, if it hurts, don’t do it!
• Retreat from the extremes and avoid too many variables for the first generation
• Fail Fast, learn, and grow the solution through PoCs
Successful Disaggregation Approach
First do no harm
 The OS running on a Server: The Platform and the CXL Fabric Manager provide
the same experience as a static server system
 Remedy every new fault mode
 Ride on PCIe, UEFI, and traditional RAS and Security
 Reduce the problem to that which has been solved before!
Put things where they belong
• Partition the system efficiently: ease of use, serviceability, maintenance
While pushing the envelop, if it hurts, don’t do it!
• Retreat from the extremes and avoid too many variables for the first generation
• Fail Fast, learn, and grow the solution through PoCs
Successful Disaggregation Approach
First do no harm
 The OS running on a Server: The Platform and the CXL Fabric Manager provide
the same experience as a static server system
 Remedy every new fault mode
 Ride on PCIe, UEFI, and traditional RAS and Security
 Reduce the problem to that which has been solved before!
Put things where they belong
• Partition the system efficiently: ease of use, serviceability, maintenance
While pushing the envelop, if it hurts, don’t do it!
• Retreat from the extremes and avoid too many variables for the first generation
• Fail Fast, learn, and grow the solution through PoCs
Based on the Modular Building Block Architecture (MBA),
Build a reference hardware system
to allow the software architecture
to be ready for
the full set of features!
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
CXL at OCP
Enablers (Software and Firmware Ingredients)
CXL Fabric Manager
• Secure composability, allocation, on-lining/off-lining
Pre-boot Environment
• Discovery, enumeration, setup, …
CXL Bus/Class Driver
• Configuration, Resource Allocation
CXL Memory Device Driver
• Interactions with Bus/Class Driver, Fabric Manager, VMM, …
• RAS, Security, Fault-isolation, On-lining, Off-lining, …
• Error Isolation, Telemetry, Performance Monitoring
OS-specific Software
• VMM, Hypervisor
• VM Allocation, Orchestration, Fault-isolation & Recovery
O C P
i s t h e p l a c e w h e r e w e
R e a l i z e
Te c h n o l o g i e s
i n t o
I n t e g r a t e d S y s t e m s
Activities within OCP Server Project
in support of CXL-enabled Systems
CMS (Composable Memory System)
Software Infrastructure for managing tiered, composable, disaggregated systems
https://www.opencompute.org/projects/composable-memory-system
DC-MHS (Datacenter-ready Modular Hardware System)
M-SIF (modular shared infrastructure)
Partition the system efficiently: ease of use, serviceability, maintenance
https://www.opencompute.org/projects/dc-mhs
https://www.opencompute.org/wiki/Server/DC-MHS
Extended Connectivity Workstream (for PCIe and CXL)
https://www.opencompute.org/wiki/Server/PCIe_Extended_Connectivity_Requirements_Workstream
Interconnect for the Disaggregated Computing
Local Disaggregation within a Chassis
With the option to Extend connectivity to Expansion Chassis
Considerations for Copper and Photonic Interconnect
CXL at OCP
CXL at OCP
B a s ed on
O C P
O p en Ac c el er a tor
I nf r as tructure
(OA I )
CXL at OCP
CXL at OCP
Summary:
Successful Disaggregation Approach (at OCP)
First do no harm (software compatibility, security, and management: CMS)
 The OS running on a Server: The Platform and the CXL Fabric Manager provide the
same experience as a static server system
 Remedy every new fault mode
 Ride on PCIe, UEFI, and traditional RAS and Security
 Reduce the problem to that which has been solved before!
Put things where they belong (modular hardware system: DC-MHS/M-SIF)
 Partition the system efficiently: ease of use, serviceability, maintenance
While pushing the envelop, if it hurts, don’t do it! (robust Extendeded Connectivity)
• Retreat from the extremes; avoid too many variables for the first generation
• Fail Fast, learn, and grow the solution through PoCs
CXL at OCP
Siamak Tavallaei has recently served as the CXL Consortium President, Chief
Systems Architect at Google Cloud, and the Incubation Committee (IC)
Representative for the Server Project. He is currently the CXL Advisor to the
Board at CXL Consortium and actively participates in OCP Steering
Committee. His current focus is the system optimization for large-scale,
mega-datacenters for general-purpose and tightly-connected, accelerated
machines built on co-designed hardware, software, security, and
management. He continues to drive the architecture and productization of
CXL-enabled solutions for AI/ML, HPC, and large memory-footprint
Databases. In 2016, he joined OCP as a co-lead of Server Project where he
drove open-sourced modular design concepts for integrated
hardware/software solutions (OAI, DC-SCM, CMS, DC-MHS, and DC-Stack).
His experiences as Chief Systems Architect at Google, Principal Architect at
Microsoft Azure, Distinguished Technologist at HP, and Principal Member of
Technical Staff at Compaq along with his contributions to industry
collaborations such as EISA, PCI, InfiniBand, and CXL give Siamak a broad
understanding of requirements and solutions for the Enterprise, Hyperscale,
and Edge datacenters and industry-wide initiatives.
Bio

More Related Content

PPTX
All Presentations during CXL Forum at Flash Memory Summit 22
PPTX
Broadcom PCIe & CXL Switches OCP Final.pptx
PPTX
Microchip: CXL Use Cases and Enabling Ecosystem
PPTX
MemVerge: The Software Stack for CXL Environments
PPTX
Arm: Enabling CXL devices within the Data Center with Arm Solutions
PPTX
MemVerge: Past Present and Future of CXL
PPTX
Introduction to CXL Fabrics
PPTX
CXL Fabric Management Standards
All Presentations during CXL Forum at Flash Memory Summit 22
Broadcom PCIe & CXL Switches OCP Final.pptx
Microchip: CXL Use Cases and Enabling Ecosystem
MemVerge: The Software Stack for CXL Environments
Arm: Enabling CXL devices within the Data Center with Arm Solutions
MemVerge: Past Present and Future of CXL
Introduction to CXL Fabrics
CXL Fabric Management Standards

What's hot (20)

PPTX
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
PPTX
Past Present and Future of CXL
PDF
System Software Guide to CXL - Linux Kernel Meetup 2024.pdf
PPTX
CXL chapter1 and chapter 2 presentation.pptx
PDF
Hardware Acceleration for Machine Learning
PPTX
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
PPTX
Micron: Memory Expansion with CXL Modules: Benefits, Use Cases and Enriching ...
PPTX
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
PPT
Pcie drivers basics
PDF
PCI Express Verification using Reference Modeling
PDF
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
PPTX
3D V-Cache
 
PDF
IP PCIe
PDF
Deep learning: Hardware Landscape
PPT
AXI_presentation.ppt
PPTX
Enfabrica - Bridging the Network and Memory Worlds
PDF
POWER10 innovations for HPC
PPTX
CXL Memory Expansion, Pooling, Sharing, FAM Enablement, and Switching
PPT
Timing and Design Closure in Physical Design Flows
PPTX
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Past Present and Future of CXL
System Software Guide to CXL - Linux Kernel Meetup 2024.pdf
CXL chapter1 and chapter 2 presentation.pptx
Hardware Acceleration for Machine Learning
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Micron: Memory Expansion with CXL Modules: Benefits, Use Cases and Enriching ...
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Pcie drivers basics
PCI Express Verification using Reference Modeling
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
3D V-Cache
 
IP PCIe
Deep learning: Hardware Landscape
AXI_presentation.ppt
Enfabrica - Bridging the Network and Memory Worlds
POWER10 innovations for HPC
CXL Memory Expansion, Pooling, Sharing, FAM Enablement, and Switching
Timing and Design Closure in Physical Design Flows
AMD Chiplet Architecture for High-Performance Server and Desktop Products
 
Ad

Similar to CXL at OCP (20)

PPTX
CXL Consortium Update: Advancing Coherent Connectivity
PPTX
Compute Express Link: Advancing Coherent Connectivity
PPTX
Q1 Memory Fabric Forum: XConn CXL Switches for AI
PPTX
Micron CXL product and architecture update
PPTX
Synopsys: Achieve First Pass Silicon Success with Synopsys CXL IP Solutions
PDF
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
PPTX
CXL Consortium Update
PPTX
The State of CXL-related Activities within OCP
PPTX
Evaluating UCIe based multi-die SoC to meet timing and power
PDF
LinkedIn OpenFabric Project - Interop 2017
PDF
Robert page-abstract
PDF
Exploiting rateless codes in cloud storage systems
PPTX
Trends and challenges in IP based SOC design
PDF
An octa core processor with shared memory and message-passing
PDF
Scale-out AI Training on Massive Core System from HPC to Fabric-based SOC
PPTX
HiPEAC 2022_Marco Tassemeier presentation
PPTX
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
PDF
Barcelona Supercomputing Center, Generador de Riqueza
PPTX
Synopsys Fusion Compiler-Comprehensive RTL-to-GDSII Implementation System
PDF
MACHBASE_NEO
CXL Consortium Update: Advancing Coherent Connectivity
Compute Express Link: Advancing Coherent Connectivity
Q1 Memory Fabric Forum: XConn CXL Switches for AI
Micron CXL product and architecture update
Synopsys: Achieve First Pass Silicon Success with Synopsys CXL IP Solutions
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
CXL Consortium Update
The State of CXL-related Activities within OCP
Evaluating UCIe based multi-die SoC to meet timing and power
LinkedIn OpenFabric Project - Interop 2017
Robert page-abstract
Exploiting rateless codes in cloud storage systems
Trends and challenges in IP based SOC design
An octa core processor with shared memory and message-passing
Scale-out AI Training on Massive Core System from HPC to Fabric-based SOC
HiPEAC 2022_Marco Tassemeier presentation
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
Barcelona Supercomputing Center, Generador de Riqueza
Synopsys Fusion Compiler-Comprehensive RTL-to-GDSII Implementation System
MACHBASE_NEO
Ad

More from Memory Fabric Forum (20)

PPTX
H3 Platform CXL Solution_Memory Fabric Forum.pptx
PDF
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
PPTX
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
PPTX
Q1 Memory Fabric Forum: About MindShare Training
PPTX
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
PDF
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
PDF
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
PDF
Q1 Memory Fabric Forum: SMART CXL Product Lineup
PDF
Q1 Memory Fabric Forum: CXL Form Factor Primer
PDF
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
PPTX
Q1 Memory Fabric Forum: Big Memory Computing for AI
PDF
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
PPTX
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
PPTX
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
PDF
Q1 Memory Fabric Forum: VMware Memory Vision
PPTX
MemVerge: Memory Expansion Without Breaking the Budget
PPTX
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
PPTX
Photowave Presentation Slides - 11.8.23.pptx
PPTX
TE Connectivity: Card Edge Interconnects
PPTX
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM
H3 Platform CXL Solution_Memory Fabric Forum.pptx
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
Q1 Memory Fabric Forum: Big Memory Computing for AI
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: VMware Memory Vision
MemVerge: Memory Expansion Without Breaking the Budget
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
Photowave Presentation Slides - 11.8.23.pptx
TE Connectivity: Card Edge Interconnects
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Approach and Philosophy of On baking technology
PDF
KodekX | Application Modernization Development
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Advanced IT Governance
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Approach and Philosophy of On baking technology
KodekX | Application Modernization Development
Review of recent advances in non-invasive hemoglobin estimation
GamePlan Trading System Review: Professional Trader's Honest Take
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Per capita expenditure prediction using model stacking based on satellite ima...
Understanding_Digital_Forensics_Presentation.pptx
Advanced IT Governance
Mobile App Security Testing_ A Comprehensive Guide.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Machine learning based COVID-19 study performance prediction
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity

CXL at OCP

  • 1. Siamak Tavallaei CXL Advisor to the Board, CXL Consortium Steering Committee, OCP Nov 14, 2023 TheStateofCXL-relatedActivitieswithinOCP SupercomputingConference(SC’23)
  • 2. An Update: CXL™ 3.1 Specification Released! (Nov 14, 2023) Presented by Siamak Tavallaei CXL Advisor to the Board, CXL Consortium EMBARGOED UNTIL NOVEMBER 14 AT 8:00 am EST
  • 4. • The CXL specification continues to evolve to meet new usage models • New features introduced in the CXL 3.1 specification: • CXL Fabric Improvements/Extensions • Scale-outof CXL fabrics using PBR (Port Based Routing) • Trusted-Execution-Environment Security Protocol (TSP) • Allows for Virtualization-basedTrustedExecutionEnvironments (TEEs) to host ConfidentialComputing Workloads • Memory Expander Improvements • Up to 34-bit of metadata and RAS capability enhancements CXL 3.1 Feature Enhancements 12 ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
  • 6. CXL 3.1: Fabric Enhancement Features Port-Based Routing (PBR) compared to Hierarchy-Based Routing (HBR) Support fabric topologiesother thantreetopologiesthatHBR switches offer • Address-based,non-prescriptiveroutingfor large memory fabrics • Supports tree,mesh, ring,star,butterfly,and multi-dimensionaltopologies ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium. Host Host CXL CXLHBRSwitch CXL HBR Switch CXL HBR Switch CXL HBR Switch Type-3 CXLMem Type-3 CXLMem Type-3 CXLMem Type-3 CXLMem Type-3 CXLMem Type-3 CXLMem CXL PBR Switc h Host Host Host Host Host Host Host Host CXL PBR Switch CXL PBR Switch CXL PBR Switch CXL PBR Switch Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice Type-1/2/3 CXLDevice
  • 8. 19 Trusted Execution Environment (TEE) & TEE Security Protocol (TSP) ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
  • 9. RECAP: CXL 2.0 Security Benefits CXL 2.0 provides Integrity and Data Encryption (IDE) of traffic across all entities (Root Complex, Switch, Device) at the Link Layer CXL 2.0 Switch CPU/SoC Root Complex CXL Device Home Agent MC Host Memory Coherent Bridge IO Bridge IOMMU CXL.memory CXL.io Area of Protection Coherent Cache (Optional) DTLB MC Device Memory CXL.cache ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
  • 10. CXL 3.1 Trusted Security Protocol (TSP) Allows for Virtualization-based, Trusted Execution Environments (TEEs) to host Confidential Computing Workloads (CC WL) Benefits: • Freedom tomigratesensitiveWLs to TSP-enabledClouds • Collaborationwithmultiplepartiesfor sharingdata • Conform to Compliance & Datasovereigntyprograms • StrengthenApplicationsecurity& SoftwareIP protection KeyCapabilities: • Separationbetween TVM*& CSP’s infrastructure(VMM) • Configurationof CXL device • Encryptionof sensitivedatain both Host/Devicememory • Cryptographicallyverify correctconfigurationof trusted computingenvironment *TVM = Trusted VM ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium. CXL Device CXL Link MC TVM* VM VM VM VM CXL.io CXL.mem CXL IDE (Defined in CXL v2.0) TEE Capable Host MC Device Memory Device Memory Host Mem Host Mem CXL.io CXL.mem
  • 11. Elements of TSP / TSP Overview 22 ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium. TSP Components for Confidential Computing • TrustedExecutionState& Access Control • How access to memoryis controlled • Configuration • Ability to determinethe supported security featureson the device, enable required features,and lock the configuration • Attestation& Authentication • Trusting who you are talking to • Memory Encryption • Encrypting data-at-rest • Transport Security • Encrypting the link to protect data-in-flight and detect/preventphysical attacks Confidential Computing (initiator) CXL TrustedExecutionEnvironment SecurityProtocol (TSP) Attestation Authentication Trusted Execution State & Access Control Memory Encryption (Data-at-rest) Transport Security (Data-in-flight) Configuration CXL memoryexpander (target) HDM-H (CXL 3.1), HDM-DB (CXL 3.1 ECN)
  • 14. CXL Specification Feature Summary Features CXL 1.0 / 1.1 CXL 2.0 CXL 3.0 CXL 3.1 Releasedate 2019 2020 August 2022 November2023 Max link rate 32GT/s 32GT/s 64GT/s 64GT/s Flit 68 byte (up to 32 GT/s)     Type 1, Type 2 and Type 3 Devices     MemoryPooling w/ MLDs    Global PersistentFlush    CXL IDE    Switching (Single-level)    Switching (Multi-level)   Multiple Type 1/Type 2 devicesper rootport   Direct memoryaccess for peer-to-peer   256-byte Flit (up to 64 GT/s PAM4)   256-byte Flit (Enhanced coherency)   256-byte Flit (Memorysharing)   256-byte Flit (Fabric capabilities)   Fabric Manager API definition for PBR Switch  Host-to-Hostcommunication with Global Integrated Memory(GIM) concept  Trusted-Execution-Environment(TEE) SecurityProtocol  Memoryexpander enhancements (up to 34-bit ofmeta data, RAS capability enhancements)  ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.  Supported Not Supported
  • 15. Siamak Tavallaei Chief Systems Architect CXL Advisor to the Board, CXL Consortium Excerpts from: The State of CXL-related Activities within OCP CXL™ Forum at OCP Summit
  • 16. CXL as the standard protocol for data-movement through coherent memory and for the associated management and system composability OCP as a community to realize integrated systems around CXL Proposal for an OCP reference system for CXL- enabled compute disaggregation
  • 17. CXL Technical Work Groups CXL Consortium https://www.computeexpresslink.org CXL Board of Directors (BoD) https://www.computeexpresslink.org/meettheboard Marketing Work Group (MWG) Technical Task Force (TTF) Active CXL WGs run under Technical Task Force (TTF) PWG (Protocol WG) SSWG (System and Software WG) PHY (Physical and Link Work Group) MSWG (Memory System WG) Compliance WG
  • 18. OCP Server Project Active OCP Server Project subprojects and workstreams Drive contributions (Base Spec, Design Spec, Products, …) https://www.opencompute.org/wiki/Server CMS (Composable Memory System) https://www.opencompute.org/projects/composable-memory-system DC-MHS (Datacenter-ready Modular Hardware System) https://www.opencompute.org/projects/dc-mhs Extended Connectivity Workstream (for PCIe and CXL) https://www.opencompute.org/wiki/Server/PCIe_Extended_Connectivity_Requirements_Workstream ODSA (Open Domain-Specific Architecture) https://www.opencompute.org/wiki/Server/ODSA OAI (Open Accelerator Infrastructure) https://www.opencompute.org/wiki/Server/OAI HPC (High-performance Computing) https://www.opencompute.org/wiki/HPC OCP NIC https://www.opencompute.org/wiki/Server/NIC
  • 19. CXL Resources 36 Webinars: • Webinar: A look into the CXLdevice ecosystemand the evolutionof CXL use cases • Webinar: Introducing CXL 3.0: Enabling composable systems with expanded fabric capabilities • Webinar: CXL 1.1 vs. CXL 2.0 – What’s the difference? • Webinar Archive: https://www.computeexpresslink.org/webinars Blogs: • Upcoming Webinar: A Look into the CXL™ DeviceEcosystemand the Evolution of CXL Use Cases • CXL 3.0 Webinar Q&A Recap • CXL™ Consortium MemberSpotlight: UnifabriX • Blog Archive: https://www.computeexpresslink.org/blog White Papers: • CXL 3.0 specification • An Overviewof Reliability, Availability, and Serviceability(RAS) in ComputeExpress Link™ 2.0 ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium.
  • 23. A converged memory environment is the main key for ease of software programing! and efficient data-movement!
  • 24. 11/21/2023 ComputeExpressLink™ andCXL™aretrademarksof the ComputeExpressLinkConsortium. • Use cases keep driving the need for higher IO and memory bandwidth: e.g., high performance accelerators, system memory, SmartNIC etc. • CPU capability requiring more memory capacity and bandwidth per core • Efficient peer-to-peer resource-sharing & messaging across multiple domains • Need to overcome memory bottlenecks due to CPU pin and thermal constraints Industry trends
  • 32. The CXL 3.0 Spec allows for a large interconnected Fabric
  • 33. ComputeExpressLink™ andCXL™Consortiumaretrademarksof the ComputeExpressLink Consortium. 59 CXL3.0Fabrics Composable Systems with Spine/Leaf Architecture CXL 3.0Fabric Architecture • InterconnectedSpine Switch System • Leaf Switch NIC Enclosure • Leaf Switch CPU Enclosure • Leaf Switch AcceleratorEnclosure • Leaf Switch Memory Enclosure CXL Switch Accelerator Memory CPUs CPUs CXL Switch GFAM GFAM GFAM CXL Switch CXL Switch NIC NIC CXL Switch CXL Switch Spine Switches LeafSwitches EndDevices Fabric Manager Example traffic flow
  • 34. ComputeExpressLink™ andCXL™Consortiumaretrademarksof the ComputeExpressLink Consortium. 60 CXL3.1Fabrics Enables more efficient “East/West” interconnect CXL 3.0/3.1 Composable Fabric growth for disaggregation/pooling/accelerator
  • 35. Composable Architecture n e w p a ra d i g m fo r d i s a g g re ga te d c o m p u t i n g ! H P C , A I / M L , I n - m e m o r y D ata b a s e s
  • 45. Successful Disaggregation Approach First do no harm  The OS running on a Server: The Platform and the CXL Fabric Manager provide the same experience as a static server system  Remedy every new fault mode  Ride on PCIe, UEFI, and traditional RAS and Security  Reduce the problem to that which has been solved before! Put things where they belong • Partition the system efficiently: ease of use, serviceability, maintenance While pushing the envelop, if it hurts, don’t do it! • Retreat from the extremes and avoid too many variables for the first generation • Fail Fast, learn, and grow the solution through PoCs
  • 46. Successful Disaggregation Approach First do no harm  The OS running on a Server: The Platform and the CXL Fabric Manager provide the same experience as a static server system  Remedy every new fault mode  Ride on PCIe, UEFI, and traditional RAS and Security  Reduce the problem to that which has been solved before! Put things where they belong • Partition the system efficiently: ease of use, serviceability, maintenance While pushing the envelop, if it hurts, don’t do it! • Retreat from the extremes and avoid too many variables for the first generation • Fail Fast, learn, and grow the solution through PoCs
  • 47. Successful Disaggregation Approach First do no harm  The OS running on a Server: The Platform and the CXL Fabric Manager provide the same experience as a static server system  Remedy every new fault mode  Ride on PCIe, UEFI, and traditional RAS and Security  Reduce the problem to that which has been solved before! Put things where they belong • Partition the system efficiently: ease of use, serviceability, maintenance While pushing the envelop, if it hurts, don’t do it! • Retreat from the extremes and avoid too many variables for the first generation • Fail Fast, learn, and grow the solution through PoCs
  • 48. Based on the Modular Building Block Architecture (MBA), Build a reference hardware system to allow the software architecture to be ready for the full set of features!
  • 56. Enablers (Software and Firmware Ingredients) CXL Fabric Manager • Secure composability, allocation, on-lining/off-lining Pre-boot Environment • Discovery, enumeration, setup, … CXL Bus/Class Driver • Configuration, Resource Allocation CXL Memory Device Driver • Interactions with Bus/Class Driver, Fabric Manager, VMM, … • RAS, Security, Fault-isolation, On-lining, Off-lining, … • Error Isolation, Telemetry, Performance Monitoring OS-specific Software • VMM, Hypervisor • VM Allocation, Orchestration, Fault-isolation & Recovery
  • 57. O C P i s t h e p l a c e w h e r e w e R e a l i z e Te c h n o l o g i e s i n t o I n t e g r a t e d S y s t e m s
  • 58. Activities within OCP Server Project in support of CXL-enabled Systems CMS (Composable Memory System) Software Infrastructure for managing tiered, composable, disaggregated systems https://www.opencompute.org/projects/composable-memory-system DC-MHS (Datacenter-ready Modular Hardware System) M-SIF (modular shared infrastructure) Partition the system efficiently: ease of use, serviceability, maintenance https://www.opencompute.org/projects/dc-mhs https://www.opencompute.org/wiki/Server/DC-MHS Extended Connectivity Workstream (for PCIe and CXL) https://www.opencompute.org/wiki/Server/PCIe_Extended_Connectivity_Requirements_Workstream Interconnect for the Disaggregated Computing Local Disaggregation within a Chassis With the option to Extend connectivity to Expansion Chassis Considerations for Copper and Photonic Interconnect
  • 61. B a s ed on O C P O p en Ac c el er a tor I nf r as tructure (OA I )
  • 64. Summary: Successful Disaggregation Approach (at OCP) First do no harm (software compatibility, security, and management: CMS)  The OS running on a Server: The Platform and the CXL Fabric Manager provide the same experience as a static server system  Remedy every new fault mode  Ride on PCIe, UEFI, and traditional RAS and Security  Reduce the problem to that which has been solved before! Put things where they belong (modular hardware system: DC-MHS/M-SIF)  Partition the system efficiently: ease of use, serviceability, maintenance While pushing the envelop, if it hurts, don’t do it! (robust Extendeded Connectivity) • Retreat from the extremes; avoid too many variables for the first generation • Fail Fast, learn, and grow the solution through PoCs
  • 66. Siamak Tavallaei has recently served as the CXL Consortium President, Chief Systems Architect at Google Cloud, and the Incubation Committee (IC) Representative for the Server Project. He is currently the CXL Advisor to the Board at CXL Consortium and actively participates in OCP Steering Committee. His current focus is the system optimization for large-scale, mega-datacenters for general-purpose and tightly-connected, accelerated machines built on co-designed hardware, software, security, and management. He continues to drive the architecture and productization of CXL-enabled solutions for AI/ML, HPC, and large memory-footprint Databases. In 2016, he joined OCP as a co-lead of Server Project where he drove open-sourced modular design concepts for integrated hardware/software solutions (OAI, DC-SCM, CMS, DC-MHS, and DC-Stack). His experiences as Chief Systems Architect at Google, Principal Architect at Microsoft Azure, Distinguished Technologist at HP, and Principal Member of Technical Staff at Compaq along with his contributions to industry collaborations such as EISA, PCI, InfiniBand, and CXL give Siamak a broad understanding of requirements and solutions for the Enterprise, Hyperscale, and Edge datacenters and industry-wide initiatives. Bio