SlideShare a Scribd company logo
Cerebras Systems © 2020
Supercomputer-Scale AI with Cerebras Systems
A Hock
RIKEN R-CCS 2nd International Symposium
18 February 2020
Cerebras Systems © 2020
AI has massive potential,
but is compute-limited today.
Cerebras Systems © 2020
AI has massive potential
From advertising to autonomy,
Commercial applications to cancer research,
Manufacturing to modeling and simulation for basic science.
AI has massive potential to change the way we work and live.
...for Society 5.0 and beyond.
Cerebras Systems © 2020
...but AI is compute limited today
Researchers continue to make great progress with deeper models and more data.
But model training often takes days, weeks, or more.
This is expensive, constrains research, limits innovation and time to market.
We need 1000x compute. And the challenge is growing.
Cerebras Systems © 2020
AI has massive potential,
but is compute-limited today.
We need a new compute solution
to accelerate deep learning.
Cerebras Systems © 2020
Enter Cerebras Systems
Cerebras Systems © 2020
The right solution for AI compute
Many cores optimized for sparse linear algebra
Memory tightly coupled to compute
High bandwidth communication
Programmable with today’s ML frameworks
Cerebras Systems © 2020
The Cerebras Wafer-Scale Engine
The world’s largest chip and most
powerful AI engine.
Designed from the ground-up to deliver
orders of magnitude performance gain for
deep learning.
- 215 x 215 mm, 1.2 trillion transistor chip
- 400,000 cores
- 18 GB on-chip SRAM
- 100 Pb/s interconnect
Cerebras Systems © 2020
Flexible cores optimized for tensor operations
Fully programmable compute core
Full array of general instructions with ML extensions
Flexible general ops for control processing
- e.g. arithmetic, logical, load/store, branch
Optimized tensor ops for data processing
- fmac [z] = [z], [w], a
- 3D 3D 2D scalar
Sparse compute engine for neural networks
- Dataflow-triggered computation
- Filters out zero data → skips unnecessary processing
- Higher performance and efficiency for sparse NN
Cerebras Systems © 2020
AI-optimized memory architecture
Traditional memory designs not optimal
- Central shared memory is slow & far away
- Requires large batches to drive utilization
The right answer is high performance, on-chip memory
- All memory is fully distributed along with compute datapaths
- Datapath has full performance from memory
- Full utilization down to batch 1
Cerebras Systems © 2020
The Cerebras Wafer-Scale Engine
Massive compute array, configurable high
bandwidth 2D mesh → compile and
compute all layers simultaneously.
- Model parallelism on a single chip
- Cluster-scale performance,
programmable as a single node
The challenge: how do we put this in your
hands and make it easy to use?
Cerebras Systems © 2020
The Cerebras Software Platform
Our software stack makes the Wafer-Scale Engine easy to use:
→ Programmable with today’s ML frameworks
→ Library of high performance DL ops
→ Customizable and extensible for other applications with flexible lower level APIs
Cerebras Systems © 2020
The Cerebras CS-1
The world’s most powerful AI computer
A full solution in a single system:
- Powered by the WSE
- Programmable via TF, other frameworks
- Install, deploy easily into a standard rack
15 RU standard rack-compliant server
1.2 Tbps I/O via 12x100GbE
20 kW power, air-cooled
Cerebras Systems © 2020
Cerebras Systems © 2020
The Cerebras CS-1
Packing the performance of a cluster
into a 15RU chassis wasn’t easy.
Requires systems-level thinking, new
invention and engineering for e.g.
- Packaging
- Power
- Cooling
- I/O
Let’s take a peek under the hood.
Cerebras Systems © 2020
Cerebras Systems © 2020
CS-1 System View
Cerebras Systems © 2020
Cerebras Systems © 2020
Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most
powerful deep learning solution.
Concluding remarks
Cerebras Systems © 2020
Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most
powerful deep learning solution.
Built from the ground up to accelerate deep learning by orders of
magnitude and empower AI and HPC researchers to do more, faster.
Concluding remarks
Cerebras Systems © 2020
Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most
powerful deep learning solution.
Built from the ground up to accelerate deep learning by orders of
magnitude and empower AI and HPC researchers to do more, faster.
WSE, CS-1, Software up and running real customer workloads
today, all the way from TF.
- Already accelerating AI at SC scale for science and health at ANL!
- And more soon...
Concluding remarks
Cerebras Systems © 2020
Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most
powerful deep learning solution.
Built from the ground up to accelerate deep learning by orders of
magnitude and empower AI and HPC researchers to do more, faster.
WSE, CS-1, Software up and running real customer workloads
today, all the way from TF.
- Already accelerating AI at SC scale for science and health at ANL!
- And more soon...
Call to action: bring us your big HPC & AI problems, system and
partnership interests. Can’t wait to work together.
Thank you!
Concluding remarks

More Related Content

PDF
10 Abundant-Data Computing
PDF
09 The Extreme-scale Scientific Software Stack for Collaborative Open Source
PDF
08 Supercomputer Fugaku
PDF
04 New opportunities in photon science with high-speed X-ray imaging detecto...
PDF
IBM Data Centric Systems & OpenPOWER
PDF
FPGA Hardware Accelerator for Machine Learning
PDF
Intel 2020 Labs Day Keynote Slides
PPTX
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
10 Abundant-Data Computing
09 The Extreme-scale Scientific Software Stack for Collaborative Open Source
08 Supercomputer Fugaku
04 New opportunities in photon science with high-speed X-ray imaging detecto...
IBM Data Centric Systems & OpenPOWER
FPGA Hardware Accelerator for Machine Learning
Intel 2020 Labs Day Keynote Slides
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems

What's hot (20)

PDF
OpenPOWER/POWER9 AI webinar
PDF
Hardware Acceleration for Machine Learning
PPTX
WML OpenPOWER presentation
PPTX
Jax 2013 - Big Data and Personalised Medicine
PDF
Deep learning: Hardware Landscape
PPTX
AI Hardware
PDF
How HPC and large-scale data analytics are transforming experimental science
PPTX
High Performance Computing - The Future is Here
PPTX
High performance computing for research
PDF
Introduction to High-Performance Computing (HPC) Containers and Singularity*
PDF
NIPS - Deep learning @ Edge using Intel's NCS
PPTX
HPC Market Update and Observations on Big Memory
PPTX
OpenPOWER foundation
PDF
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
PDF
Gaurav slides
PDF
Deep learning @ Edge using Intel's Neural Compute Stick
PDF
Update on the Exascale Computing Project (ECP)
PDF
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
PPTX
High performance computing
PDF
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
OpenPOWER/POWER9 AI webinar
Hardware Acceleration for Machine Learning
WML OpenPOWER presentation
Jax 2013 - Big Data and Personalised Medicine
Deep learning: Hardware Landscape
AI Hardware
How HPC and large-scale data analytics are transforming experimental science
High Performance Computing - The Future is Here
High performance computing for research
Introduction to High-Performance Computing (HPC) Containers and Singularity*
NIPS - Deep learning @ Edge using Intel's NCS
HPC Market Update and Observations on Big Memory
OpenPOWER foundation
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
Gaurav slides
Deep learning @ Edge using Intel's Neural Compute Stick
Update on the Exascale Computing Project (ECP)
Arm A64fx and Post-K: Game-Changing CPU & Supercomputer for HPC, Big Data, & AI
High performance computing
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
Ad

Similar to 13 Supercomputer-Scale AI with Cerebras Systems (20)

PPTX
Innovating to Create a Brighter Future for AI, HPC, and Big Data
PDF
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
PDF
AI in Health Care using IBM Systems/OpenPOWER systems
PDF
AI in Healh Care using IBM POWER systems
PDF
OpenPOWER/POWER9 Webinar from MIT and IBM
PDF
2016 August POWER Up Your Insights - IBM System Summit Mumbai
PDF
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
PDF
Cisco connect montreal 2018 compute v final
PDF
InTech Event | Cognitive Infrastructure for Enterprise AI
PPTX
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
PDF
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
PDF
AI + E-commerce
PDF
“Solving Tomorrow’s AI Problems Today with Cadence’s Newest Processor,” a Pre...
PDF
Vectorization whitepaper
PDF
Covid-19 Response Capability with Power Systems
PDF
Phi Week 2019
PPT
transform your busines with superior cloud economics
PPTX
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute
PPTX
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
PPTX
cc_mod1.ppt useful for engineering students
Innovating to Create a Brighter Future for AI, HPC, and Big Data
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Healh Care using IBM POWER systems
OpenPOWER/POWER9 Webinar from MIT and IBM
2016 August POWER Up Your Insights - IBM System Summit Mumbai
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Cisco connect montreal 2018 compute v final
InTech Event | Cognitive Infrastructure for Enterprise AI
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
“Scaling i.MX Applications Processors’ Native Edge AI with Discrete AI Accele...
AI + E-commerce
“Solving Tomorrow’s AI Problems Today with Cadence’s Newest Processor,” a Pre...
Vectorization whitepaper
Covid-19 Response Capability with Power Systems
Phi Week 2019
transform your busines with superior cloud economics
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
cc_mod1.ppt useful for engineering students
Ad

More from RCCSRENKEI (20)

PDF
第15回 配信講義 計算科学技術特論B(2022)
PDF
第14回 配信講義 計算科学技術特論B(2022)
PDF
第12回 配信講義 計算科学技術特論B(2022)
PDF
第13回 配信講義 計算科学技術特論B(2022)
PDF
第11回 配信講義 計算科学技術特論B(2022)
PDF
第10回 配信講義 計算科学技術特論B(2022)
PDF
第9回 配信講義 計算科学技術特論B(2022)
PDF
第8回 配信講義 計算科学技術特論B(2022)
PPT
第7回 配信講義 計算科学技術特論B(2022)
PPT
第6回 配信講義 計算科学技術特論B(2022)
PDF
第5回 配信講義 計算科学技術特論B(2022)
PPTX
Realization of Innovative Light Energy Conversion Materials utilizing the Sup...
PDF
Current status of the project "Toward a unified view of the universe: from la...
PPTX
Fugaku, the Successes and the Lessons Learned
PDF
第4回 配信講義 計算科学技術特論B(2022)
PDF
第3回 配信講義 計算科学技術特論B(2022)
PDF
第2回 配信講義 計算科学技術特論B(2022)
PDF
第1回 配信講義 計算科学技術特論B(2022)
PDF
210603 yamamoto
PDF
第15回 配信講義 計算科学技術特論A(2021)
第15回 配信講義 計算科学技術特論B(2022)
第14回 配信講義 計算科学技術特論B(2022)
第12回 配信講義 計算科学技術特論B(2022)
第13回 配信講義 計算科学技術特論B(2022)
第11回 配信講義 計算科学技術特論B(2022)
第10回 配信講義 計算科学技術特論B(2022)
第9回 配信講義 計算科学技術特論B(2022)
第8回 配信講義 計算科学技術特論B(2022)
第7回 配信講義 計算科学技術特論B(2022)
第6回 配信講義 計算科学技術特論B(2022)
第5回 配信講義 計算科学技術特論B(2022)
Realization of Innovative Light Energy Conversion Materials utilizing the Sup...
Current status of the project "Toward a unified view of the universe: from la...
Fugaku, the Successes and the Lessons Learned
第4回 配信講義 計算科学技術特論B(2022)
第3回 配信講義 計算科学技術特論B(2022)
第2回 配信講義 計算科学技術特論B(2022)
第1回 配信講義 計算科学技術特論B(2022)
210603 yamamoto
第15回 配信講義 計算科学技術特論A(2021)

Recently uploaded (20)

PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Microbiology with diagram medical studies .pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
INTRODUCTION TO EVS | Concept of sustainability
DOCX
Viruses (History, structure and composition, classification, Bacteriophage Re...
PDF
An interstellar mission to test astrophysical black holes
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPT
Chemical bonding and molecular structure
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Comparative Structure of Integument in Vertebrates.pptx
Biophysics 2.pdffffffffffffffffffffffffff
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Microbiology with diagram medical studies .pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
The KM-GBF monitoring framework – status & key messages.pptx
ECG_Course_Presentation د.محمد صقران ppt
INTRODUCTION TO EVS | Concept of sustainability
Viruses (History, structure and composition, classification, Bacteriophage Re...
An interstellar mission to test astrophysical black holes
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
microscope-Lecturecjchchchchcuvuvhc.pptx
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
Introduction to Fisheries Biotechnology_Lesson 1.pptx
neck nodes and dissection types and lymph nodes levels
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Chemical bonding and molecular structure
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Cell Membrane: Structure, Composition & Functions
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Comparative Structure of Integument in Vertebrates.pptx

13 Supercomputer-Scale AI with Cerebras Systems

  • 1. Cerebras Systems © 2020 Supercomputer-Scale AI with Cerebras Systems A Hock RIKEN R-CCS 2nd International Symposium 18 February 2020
  • 2. Cerebras Systems © 2020 AI has massive potential, but is compute-limited today.
  • 3. Cerebras Systems © 2020 AI has massive potential From advertising to autonomy, Commercial applications to cancer research, Manufacturing to modeling and simulation for basic science. AI has massive potential to change the way we work and live. ...for Society 5.0 and beyond.
  • 4. Cerebras Systems © 2020 ...but AI is compute limited today Researchers continue to make great progress with deeper models and more data. But model training often takes days, weeks, or more. This is expensive, constrains research, limits innovation and time to market. We need 1000x compute. And the challenge is growing.
  • 5. Cerebras Systems © 2020 AI has massive potential, but is compute-limited today. We need a new compute solution to accelerate deep learning.
  • 6. Cerebras Systems © 2020 Enter Cerebras Systems
  • 7. Cerebras Systems © 2020 The right solution for AI compute Many cores optimized for sparse linear algebra Memory tightly coupled to compute High bandwidth communication Programmable with today’s ML frameworks
  • 8. Cerebras Systems © 2020 The Cerebras Wafer-Scale Engine The world’s largest chip and most powerful AI engine. Designed from the ground-up to deliver orders of magnitude performance gain for deep learning. - 215 x 215 mm, 1.2 trillion transistor chip - 400,000 cores - 18 GB on-chip SRAM - 100 Pb/s interconnect
  • 9. Cerebras Systems © 2020 Flexible cores optimized for tensor operations Fully programmable compute core Full array of general instructions with ML extensions Flexible general ops for control processing - e.g. arithmetic, logical, load/store, branch Optimized tensor ops for data processing - fmac [z] = [z], [w], a - 3D 3D 2D scalar Sparse compute engine for neural networks - Dataflow-triggered computation - Filters out zero data → skips unnecessary processing - Higher performance and efficiency for sparse NN
  • 10. Cerebras Systems © 2020 AI-optimized memory architecture Traditional memory designs not optimal - Central shared memory is slow & far away - Requires large batches to drive utilization The right answer is high performance, on-chip memory - All memory is fully distributed along with compute datapaths - Datapath has full performance from memory - Full utilization down to batch 1
  • 11. Cerebras Systems © 2020 The Cerebras Wafer-Scale Engine Massive compute array, configurable high bandwidth 2D mesh → compile and compute all layers simultaneously. - Model parallelism on a single chip - Cluster-scale performance, programmable as a single node The challenge: how do we put this in your hands and make it easy to use?
  • 12. Cerebras Systems © 2020 The Cerebras Software Platform Our software stack makes the Wafer-Scale Engine easy to use: → Programmable with today’s ML frameworks → Library of high performance DL ops → Customizable and extensible for other applications with flexible lower level APIs
  • 13. Cerebras Systems © 2020 The Cerebras CS-1 The world’s most powerful AI computer A full solution in a single system: - Powered by the WSE - Programmable via TF, other frameworks - Install, deploy easily into a standard rack 15 RU standard rack-compliant server 1.2 Tbps I/O via 12x100GbE 20 kW power, air-cooled Cerebras Systems © 2020
  • 14. Cerebras Systems © 2020 The Cerebras CS-1 Packing the performance of a cluster into a 15RU chassis wasn’t easy. Requires systems-level thinking, new invention and engineering for e.g. - Packaging - Power - Cooling - I/O Let’s take a peek under the hood. Cerebras Systems © 2020
  • 15. Cerebras Systems © 2020 CS-1 System View Cerebras Systems © 2020
  • 16. Cerebras Systems © 2020 Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most powerful deep learning solution. Concluding remarks
  • 17. Cerebras Systems © 2020 Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most powerful deep learning solution. Built from the ground up to accelerate deep learning by orders of magnitude and empower AI and HPC researchers to do more, faster. Concluding remarks
  • 18. Cerebras Systems © 2020 Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most powerful deep learning solution. Built from the ground up to accelerate deep learning by orders of magnitude and empower AI and HPC researchers to do more, faster. WSE, CS-1, Software up and running real customer workloads today, all the way from TF. - Already accelerating AI at SC scale for science and health at ANL! - And more soon... Concluding remarks
  • 19. Cerebras Systems © 2020 Proud to introduce RIKEN RCCS to the Cerebras CS-1, the world’s most powerful deep learning solution. Built from the ground up to accelerate deep learning by orders of magnitude and empower AI and HPC researchers to do more, faster. WSE, CS-1, Software up and running real customer workloads today, all the way from TF. - Already accelerating AI at SC scale for science and health at ANL! - And more soon... Call to action: bring us your big HPC & AI problems, system and partnership interests. Can’t wait to work together. Thank you! Concluding remarks