SlideShare a Scribd company logo
OpenPOWER and AI
Workshop
Ganesan Narayanasamy
IBM
Welcome you all for the AI and OpenPOWER Bootcamp
6/2
0/2
2
OpenPOWER & AI Workshop at BSC ,Barcelona
By OpenPOWER Academia
Day 1 is meant as an introduction for everyone interested in using AI.
Day 2 is meant to go deeper with those who have especially challenging projects.
on 18th and 19th June 2018
Agenda
Day 1 - June 18th 2018
9:00 a.m to 9.30 a.m.
9.30 a.m to 10.15 am
10.15 am to 10.30 am
10.30 am to 11.15 am
11.15 am to 12.00 Noon
12.00 Noon to 1.00 pm
Welcome and OpenPOWER ADG features
Introduction to Power 9 and PowerAI
Break
Large Model Support and Distributed Deep Learning
Use Case Demonstration with PowerAI
Lunch
1.00pm to 1.45 pm
1.45 pm to 2.45 pm
2.45 pm to 3.00 pm
3.00.pm to 3.45pm
3.45 pm to 4.45 pm
4.45 pm to 5.00 pm
Mellanox Feature Updates
CFD Simulation on Power
Break
Introduction to Snap Machine Learning
Snap Machine Learning Demos , Q&A
Wrap up and Q & A
Agenda
Day 2 - June 19th 2018
9.00 am to 9.30 am
9.30 am to 12.00 pm
12.00 pm to 1.00 pm
01.00 pm to 04.30 pm
Quick review about Day I
Deep Learning Exercise II using Nimbix /Other Infra
Industry specific use cases ( LMS )
Lunch
Deep Learning Exercise II using Nimbix/Other infra
Industry specific Use cases using P9 features ( LMS
and DDL )
Founding Members
in 2013
Ecosystem
Chip / SOC
This is What A Revolution Looks Like © 2018 OpenPOWER
Foundation
I/O / Storage / Acceleration
Boards /
Systems
Software
System / Integration
Implementation / HPC / Research
Software
Boards /
Systems
System / Integration
I/O / Storage / Acceleration
Implementation / HPC / Research
Chip / SOC
This is What A Revolution Looks Like © 2017 OpenPOWER
Foundation
328+
Members
33
Countri
es
70+
ISVs
Chip / SOC
This is What A Revolution Looks Like © 2017 OpenPOWER
Foundation
I/O / Storage / Acceleration
Implementation / HPC / Research
Boards /
Systems
System / Integration
Software
328+
Members
33
Countri
es
70+
ISVs
Active Membership
From All
Layers of the
Stack
100k+ Linux Applications
Running on Power
2300 ISVs Written Code
on Linux
Partners
Bring
Systems
to Market
150+ OpenPOWER Ready
Certified Products
20+ Systems Manufacturers
40+ POWER-based systems
shipping or in development
100+ Collaborative innovations
under way
AI OpenPOWER Academia Discussion Group
OpenPOWER in Action
6/2
0/2
12
What is CORAL?
The program through which Summit & Sierra are procured.
 Several DOE labs have strong supercomputing programs and facilities.
 To bring the next generation of leading supercomputers to these labs, DOE
created CORAL (the Collaboration of Oak Ridge, Argonne, and Livermore) to
jointly procure these systems, and in so doing, align strategy and resources
across the DOE enterprise.
 Collaboration grouping of DOE labs was done based on common acquisition
timings. Collaboration is a win-win for all parties.
“Summit” System “Sierra” System
OpenPOWER Technologies: IBM POWER CPUs, NVIDIA Tesla GPUs, Mellanox EDR
100Gb/s InfiniBand
Paving The Road to Exascale Performance
Academic Membership
 Currently about 100+ academic members in OPF
14
A*STAR ASU ASTRI Moscow State
University
Carnegie Mellon Univ.
CDAC Colorado School of
Mines
CINECA CFMS Coimbatore Institute of
Technology
Dalian University of
Technology
GSIC Hartree Centre ICM IIIT Bangalore
IIT Bombay Indian Institute for
Technology Roorkee
ICCS INAF FZ Jülich
LSU BSC Nanyang
Technological
University
National University of
Singapore
NIT Mangalore
NIT Warangal Northeastern
University in China
ORNL OSU RICE
Rome HPC Center LLNL SANDIA SASTRA University Seoul National
University
Shanghai Shao Tong
University
SICSR TEES Tohoku University Tsinghua University
University of Arkansas SDSC Unicamp University of Central
Florida
University of Florida
University of Hawai University of
Hyderabad
University of Illinois University of Michigan University of Oregon
University of Patras University of Southern
California
TACC Waseda University IISc ,Loyola,IIT
Roorkee
Goals of the Academia Discussion Group
 Provide training and exchange of experience and know-how
 Provide platform for networking among academic members
 Work on engagement of HPC community
 Enable co-design/development activities
15
6/2
0/2
Conclusions
 Growing number of academic organizations have become member of the
OpenPOWER Foundation
 The Academia Discussion Groups provides a platform for training,
networking, engagement and enablement of co-design
 Those who have not yet joined:
You are welcome to join
https://members.openpowerfoundation.org/wg/AcademiaDG/mail/index
 OpenPOWER AI virtual University's focus on bringing together industry,
government and academic expertise to connect and help shape the AI
future .
 https://www.youtube.com/channel/UCYLtbUp0AH0ZAv5mNut1Kcg
16
6/2
0/2
Power 9 Advantages ( AC922)
AI OpenPOWER Academia Discussion Group
1. CPU
- POWER9 NZ gzip, has a potential when working with compressed-full
workload to reduced memory foot print and I/O bottlenecks in pre-processing
stage; is not today available but hopefully we will get this soon;
- CPU has direct access to GPU memory without need for migration; not
explored today in TF or Caffe part of PowerAI
- VSX3 can accelerate the media processing/pre-processing for computer
vision
http://www.eecg.utoronto.ca/~moshovos/ACA06/readings/altivec.pdf
2. System’s Memory
- 8x DDR4 memory channels will always give more performance and prevent
memory contention in AI workloads
- Managed memory is cache-coherent between CPU & GPU; not explored
today in TF or Caffe part of PowerAI
3. GPU
- NVLINK 2.0 with the CPU allows faster data movement from the CPU to the
GPU when datasets are larger in range of TB's
- GPUDirect RDMA to unified memory; don't think is explored today in TF or
Caffe part of PowerAI
- technology such LMS are best feet for large models like deep residual
networks / ResNet-152
https://arxiv.org/pdf/1803.06333
4. InfiniBand
- MPI / DDL / Horovod have the potential to explore this unique multi-host
socket direct adapter and provide lowest possible latency between many
learners when training. This will lead to lower training times. Posible
improvements in training efficiency over exiting research paper:
https://arxiv.org/pdf/1708.02188
5. I/O:
- PCIe Gen4 offers for NVMe adapters more bandwidth used for caching
datasets into compute nodes more closer to the GPUs (13.5GB/s vs 6.8GB/s
in PCIe Gen3); this is helping very much in pre-fetching the data into the
system memory
- OpenCAPI provides more bandwidth for other type of accelerators such
FPGA's give then option of fast inference processes; possible other kinds of
DRAM in the feature.
6. Others:
- Water cooled systems available for 4x GPUs and 6x GPUs are making the
AI solutions much more efficient at scale taken into consideration 300W/GPU
power consumption.
THANK YOU!

More Related Content

PDF
CFD on Power
PDF
BSC LMS DDL
PPTX
2018 bsc power9 and power ai
PDF
SNAP MACHINE LEARNING
PPT
OpenPOWER Webinar
PDF
OpenPOWER/POWER9 AI webinar
PDF
TAU E4S ON OpenPOWER /POWER9 platform
PDF
Covid-19 Response Capability with Power Systems
CFD on Power
BSC LMS DDL
2018 bsc power9 and power ai
SNAP MACHINE LEARNING
OpenPOWER Webinar
OpenPOWER/POWER9 AI webinar
TAU E4S ON OpenPOWER /POWER9 platform
Covid-19 Response Capability with Power Systems

What's hot (20)

PDF
OpenPOWER Latest Updates
PDF
IBM HPC Transformation with AI
PDF
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
PDF
IBM BOA for POWER
PDF
Deeplearningusingcloudpakfordata
PDF
MIT's experience on OpenPOWER/POWER 9 platform
PDF
OpenPOWER Webinar on Machine Learning for Academic Research
PDF
OpenPOWER/POWER9 Webinar from MIT and IBM
PDF
Xilinx Edge Compute using Power 9 /OpenPOWER systems
PPTX
WML OpenPOWER presentation
PDF
SCFE 2020 OpenCAPI presentation as part of OpenPWOER Tutorial
PDF
CUDA-Python and RAPIDS for blazing fast scientific computing
PPTX
A Primer on FPGAs - Field Programmable Gate Arrays
PPTX
PowerAI Deep dive
PDF
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
PDF
State of ARM-based HPC
PDF
Introducing HPC with a Raspberry Pi Cluster
PDF
Summit workshop thompto
PDF
OpenPOWER System Marconi100
PPTX
OpenPOWER foundation
OpenPOWER Latest Updates
IBM HPC Transformation with AI
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
IBM BOA for POWER
Deeplearningusingcloudpakfordata
MIT's experience on OpenPOWER/POWER 9 platform
OpenPOWER Webinar on Machine Learning for Academic Research
OpenPOWER/POWER9 Webinar from MIT and IBM
Xilinx Edge Compute using Power 9 /OpenPOWER systems
WML OpenPOWER presentation
SCFE 2020 OpenCAPI presentation as part of OpenPWOER Tutorial
CUDA-Python and RAPIDS for blazing fast scientific computing
A Primer on FPGAs - Field Programmable Gate Arrays
PowerAI Deep dive
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
State of ARM-based HPC
Introducing HPC with a Raspberry Pi Cluster
Summit workshop thompto
OpenPOWER System Marconi100
OpenPOWER foundation
Ad

Similar to AI OpenPOWER Academia Discussion Group (20)

PDF
OpenPOWER ADG key note
PPTX
OpenPOWER and AI workshop at Brazil
PPTX
1 open power foundation_japan meetup - v1
PDF
AI/Cloud Technology access
PPTX
Ai OpenPOWER meetup
PPTX
Ai open powermeetupmarch25
PPTX
Ai open powermeetupmarch25
PPTX
Ibm open poweraiworkshopaug16siliconvalley
PDF
1043: Applications and porting to OpenPOWER
PDF
Introduction to the OpenPOWER Foundation - Open Source Days event
PDF
OpenPOWER Foundation Overview
PDF
1040: OpenPOWER Foundation Update
PDF
Demystify OpenPOWER
PDF
OpenPOWER Update
PDF
IBM Cloud Paris 20180517 - La solution Power AI
PDF
OpenPOWER Overview - August 2016
PDF
OpenPOWER Workshop at IIT Roorkee
PDF
Power9 aihpc bigdataeducationserver
PPTX
OpenPOWER Processor Lab
PDF
OpenPOWER Foundation Overview
OpenPOWER ADG key note
OpenPOWER and AI workshop at Brazil
1 open power foundation_japan meetup - v1
AI/Cloud Technology access
Ai OpenPOWER meetup
Ai open powermeetupmarch25
Ai open powermeetupmarch25
Ibm open poweraiworkshopaug16siliconvalley
1043: Applications and porting to OpenPOWER
Introduction to the OpenPOWER Foundation - Open Source Days event
OpenPOWER Foundation Overview
1040: OpenPOWER Foundation Update
Demystify OpenPOWER
OpenPOWER Update
IBM Cloud Paris 20180517 - La solution Power AI
OpenPOWER Overview - August 2016
OpenPOWER Workshop at IIT Roorkee
Power9 aihpc bigdataeducationserver
OpenPOWER Processor Lab
OpenPOWER Foundation Overview
Ad

More from Ganesan Narayanasamy (20)

PDF
Empowering Engineering Faculties: Bridging the Gap with Emerging Technologies
PDF
Chip Design Curriculum development Residency program
PDF
Basics of Digital Design and Verilog
PDF
180 nm Tape out experience using Open POWER ISA
PDF
Workload Transformation and Innovations in POWER Architecture
PDF
Deep Learning Use Cases using OpenPOWER systems
PDF
POWER10 innovations for HPC
PDF
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
PDF
AI in healthcare - Use Cases
PDF
AI in Health Care using IBM Systems/OpenPOWER systems
PDF
AI in Healh Care using IBM POWER systems
PDF
Poster from NUS
PDF
SAP HANA on POWER9 systems
PPTX
Graphical Structure Learning accelerated with POWER9
PDF
AI in the enterprise
PDF
Robustness in deep learning
PDF
Perspectives of Frond end Design
PDF
A2O Core implementation on FPGA
PDF
OpenPOWER Foundation Introduction
PDF
Open Hardware and Future Computing
Empowering Engineering Faculties: Bridging the Gap with Emerging Technologies
Chip Design Curriculum development Residency program
Basics of Digital Design and Verilog
180 nm Tape out experience using Open POWER ISA
Workload Transformation and Innovations in POWER Architecture
Deep Learning Use Cases using OpenPOWER systems
POWER10 innovations for HPC
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare - Use Cases
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Healh Care using IBM POWER systems
Poster from NUS
SAP HANA on POWER9 systems
Graphical Structure Learning accelerated with POWER9
AI in the enterprise
Robustness in deep learning
Perspectives of Frond end Design
A2O Core implementation on FPGA
OpenPOWER Foundation Introduction
Open Hardware and Future Computing

Recently uploaded (20)

PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced Soft Computing BINUS July 2025.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Cloud computing and distributed systems.
PDF
Electronic commerce courselecture one. Pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Understanding_Digital_Forensics_Presentation.pptx
Review of recent advances in non-invasive hemoglobin estimation
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Per capita expenditure prediction using model stacking based on satellite ima...
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Chapter 3 Spatial Domain Image Processing.pdf
Advanced Soft Computing BINUS July 2025.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
20250228 LYD VKU AI Blended-Learning.pptx
Cloud computing and distributed systems.
Electronic commerce courselecture one. Pdf
MYSQL Presentation for SQL database connectivity
Dropbox Q2 2025 Financial Results & Investor Presentation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton

AI OpenPOWER Academia Discussion Group

  • 2. Welcome you all for the AI and OpenPOWER Bootcamp 6/2 0/2 2
  • 3. OpenPOWER & AI Workshop at BSC ,Barcelona By OpenPOWER Academia Day 1 is meant as an introduction for everyone interested in using AI. Day 2 is meant to go deeper with those who have especially challenging projects. on 18th and 19th June 2018
  • 4. Agenda Day 1 - June 18th 2018 9:00 a.m to 9.30 a.m. 9.30 a.m to 10.15 am 10.15 am to 10.30 am 10.30 am to 11.15 am 11.15 am to 12.00 Noon 12.00 Noon to 1.00 pm Welcome and OpenPOWER ADG features Introduction to Power 9 and PowerAI Break Large Model Support and Distributed Deep Learning Use Case Demonstration with PowerAI Lunch 1.00pm to 1.45 pm 1.45 pm to 2.45 pm 2.45 pm to 3.00 pm 3.00.pm to 3.45pm 3.45 pm to 4.45 pm 4.45 pm to 5.00 pm Mellanox Feature Updates CFD Simulation on Power Break Introduction to Snap Machine Learning Snap Machine Learning Demos , Q&A Wrap up and Q & A
  • 5. Agenda Day 2 - June 19th 2018 9.00 am to 9.30 am 9.30 am to 12.00 pm 12.00 pm to 1.00 pm 01.00 pm to 04.30 pm Quick review about Day I Deep Learning Exercise II using Nimbix /Other Infra Industry specific use cases ( LMS ) Lunch Deep Learning Exercise II using Nimbix/Other infra Industry specific Use cases using P9 features ( LMS and DDL )
  • 8. Chip / SOC This is What A Revolution Looks Like © 2018 OpenPOWER Foundation I/O / Storage / Acceleration Boards / Systems Software System / Integration Implementation / HPC / Research
  • 9. Software Boards / Systems System / Integration I/O / Storage / Acceleration Implementation / HPC / Research Chip / SOC This is What A Revolution Looks Like © 2017 OpenPOWER Foundation 328+ Members 33 Countri es 70+ ISVs
  • 10. Chip / SOC This is What A Revolution Looks Like © 2017 OpenPOWER Foundation I/O / Storage / Acceleration Implementation / HPC / Research Boards / Systems System / Integration Software 328+ Members 33 Countri es 70+ ISVs Active Membership From All Layers of the Stack 100k+ Linux Applications Running on Power 2300 ISVs Written Code on Linux Partners Bring Systems to Market 150+ OpenPOWER Ready Certified Products 20+ Systems Manufacturers 40+ POWER-based systems shipping or in development 100+ Collaborative innovations under way
  • 13. What is CORAL? The program through which Summit & Sierra are procured.  Several DOE labs have strong supercomputing programs and facilities.  To bring the next generation of leading supercomputers to these labs, DOE created CORAL (the Collaboration of Oak Ridge, Argonne, and Livermore) to jointly procure these systems, and in so doing, align strategy and resources across the DOE enterprise.  Collaboration grouping of DOE labs was done based on common acquisition timings. Collaboration is a win-win for all parties. “Summit” System “Sierra” System OpenPOWER Technologies: IBM POWER CPUs, NVIDIA Tesla GPUs, Mellanox EDR 100Gb/s InfiniBand Paving The Road to Exascale Performance
  • 14. Academic Membership  Currently about 100+ academic members in OPF 14 A*STAR ASU ASTRI Moscow State University Carnegie Mellon Univ. CDAC Colorado School of Mines CINECA CFMS Coimbatore Institute of Technology Dalian University of Technology GSIC Hartree Centre ICM IIIT Bangalore IIT Bombay Indian Institute for Technology Roorkee ICCS INAF FZ Jülich LSU BSC Nanyang Technological University National University of Singapore NIT Mangalore NIT Warangal Northeastern University in China ORNL OSU RICE Rome HPC Center LLNL SANDIA SASTRA University Seoul National University Shanghai Shao Tong University SICSR TEES Tohoku University Tsinghua University University of Arkansas SDSC Unicamp University of Central Florida University of Florida University of Hawai University of Hyderabad University of Illinois University of Michigan University of Oregon University of Patras University of Southern California TACC Waseda University IISc ,Loyola,IIT Roorkee
  • 15. Goals of the Academia Discussion Group  Provide training and exchange of experience and know-how  Provide platform for networking among academic members  Work on engagement of HPC community  Enable co-design/development activities 15 6/2 0/2
  • 16. Conclusions  Growing number of academic organizations have become member of the OpenPOWER Foundation  The Academia Discussion Groups provides a platform for training, networking, engagement and enablement of co-design  Those who have not yet joined: You are welcome to join https://members.openpowerfoundation.org/wg/AcademiaDG/mail/index  OpenPOWER AI virtual University's focus on bringing together industry, government and academic expertise to connect and help shape the AI future .  https://www.youtube.com/channel/UCYLtbUp0AH0ZAv5mNut1Kcg 16 6/2 0/2
  • 17. Power 9 Advantages ( AC922)
  • 19. 1. CPU - POWER9 NZ gzip, has a potential when working with compressed-full workload to reduced memory foot print and I/O bottlenecks in pre-processing stage; is not today available but hopefully we will get this soon; - CPU has direct access to GPU memory without need for migration; not explored today in TF or Caffe part of PowerAI - VSX3 can accelerate the media processing/pre-processing for computer vision http://www.eecg.utoronto.ca/~moshovos/ACA06/readings/altivec.pdf 2. System’s Memory - 8x DDR4 memory channels will always give more performance and prevent memory contention in AI workloads - Managed memory is cache-coherent between CPU & GPU; not explored today in TF or Caffe part of PowerAI
  • 20. 3. GPU - NVLINK 2.0 with the CPU allows faster data movement from the CPU to the GPU when datasets are larger in range of TB's - GPUDirect RDMA to unified memory; don't think is explored today in TF or Caffe part of PowerAI - technology such LMS are best feet for large models like deep residual networks / ResNet-152 https://arxiv.org/pdf/1803.06333 4. InfiniBand - MPI / DDL / Horovod have the potential to explore this unique multi-host socket direct adapter and provide lowest possible latency between many learners when training. This will lead to lower training times. Posible improvements in training efficiency over exiting research paper: https://arxiv.org/pdf/1708.02188
  • 21. 5. I/O: - PCIe Gen4 offers for NVMe adapters more bandwidth used for caching datasets into compute nodes more closer to the GPUs (13.5GB/s vs 6.8GB/s in PCIe Gen3); this is helping very much in pre-fetching the data into the system memory - OpenCAPI provides more bandwidth for other type of accelerators such FPGA's give then option of fast inference processes; possible other kinds of DRAM in the feature. 6. Others: - Water cooled systems available for 4x GPUs and 6x GPUs are making the AI solutions much more efficient at scale taken into consideration 300W/GPU power consumption.