SlideShare a Scribd company logo
1
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Paulo Sergio Lemes Queiroz
Systems Consultant
IBM
PowerAI Deep Dive
2
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
• What is PowerAI
• PowerAI Components
 Hardware requirements
 CPU VS GPU
 Volta / Tensors
• Using PowerAI Components
• Extending PowerAI
Session objectives
3
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
What is PowerAI
• Set of support Libraries to
develop machine learning and
Deep learning applications
4
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Hardware Requirements
Minimal requirements for accelerated Machine Learning is:
- Any Power Server with a Nvidia GPU
IBM Power System S824L (with NVIDIA technology)
IBM Power System S822LC for High Performance Computing
IBM Power System S822LC for Big Data
IBM Power System S821LC
Key start point is have the GPU for acceleration
Nvlink enable even further acceleration
Nvlink usage is transparent
5
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
CPU vs GPU
CPU is a generic core, wich has access to the system main memmory in order to do all kind of tasks
GPU is composed by thousands of specialized cores that handles mathematical operations
GPUs are specialized on Tensor / Matrix / Vector / Float point parallel operations
Sequential operations are slower on GPUS
6
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
GPU limitations
GPU doesn't has access to the system main memmory, all data must be copied to the GPU and copied
back when processing is done ( batch like operations )
Non SIMD ( Same Instruction Multiple Datapoints ) operations are not THAT fast
7
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Volta / Tensors
Volta GPU will have dedicated Tensor function units, which will allow processing of Neural Networks to
run even faster
However this will require even bigger bandwidth ( and lower lattency ) between the main memory and
the GPU
8
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Using PowerAI Components
PowerAI, by default, require Ubunto 16.04 and it's install files can be located at:
https://public.dhe.ibm.com/software/server/POWER/Linux/mldl/ubuntu/README.html
The following components come along at the free package:
caffe-bvlc - Berkeley Vision and Learning Center (BVLC) upstream Caffe, v1.0.0
caffe-ibm - IBM Optimized version of BVLC Caffe, v1.0.0
caffe-nv - NVIDIA fork of Caffe, v0.15.14
chainer - Chainer, v1.23.0
digits - DIGITS, v5.0.0
tensorflow - Google TensorFlow, v1.1.0
ddl-tensorflow - Distributed Deep Learning custom operator for TensorFlow
theano - Theano, v0.9.0
torch - Torch, v7
It's important to highlight that PowerAI isn't limited by these pre-compiled libraries or versions
As on any Ubuntu machine, the user can compile any software
9
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Using PowerAI Components
All software components of PowerAI are installed at /opt/DL
root@pq-s824l-kvm:/opt/DL# ls -l
total 0
drwxr-xr-x 4 root root 29 Jun 23 16:08 bazel
lrwxrwxrwx 1 root root 23 Jun 23 16:39 caffe -> /etc/alternatives/caffe
drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-bvlc
drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-ibm
drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-nv
drwxr-xr-x 7 root root 68 Jun 23 16:38 chainer
drwxr-xr-x 4 root root 28 Aug 8 14:21 ddl
drwxr-xr-x 6 root root 55 Aug 8 14:21 ddl-tensorflow
drwxr-xr-x 8 root root 210 Aug 6 16:47 digits
drwxr-xr-x 7 root root 67 Jun 23 16:38 nccl
drwxr-xr-x 6 root root 54 Jun 23 16:38 openblas
drwxr-xr-x 3 root root 44 Aug 8 14:19 repo
drwxr-xr-x 5 root root 54 Jun 23 16:10 tensorflow
drwxr-xr-x 6 root root 54 Jun 23 16:38 theano
drwxr-xr-x 9 root root 94 Jun 23 16:38 torch
root@pq-s824l-kvm:/opt/DL# pwd
/opt/DL
root@pq-s824l-kvm:/opt/DL#
10
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Using PowerAI Components
By default these packages are not in the system PATH, however to enable those to exist at the PATH a
few helper scripts can be used:
. /opt/DL/tensorflow/bin/tensorflow-activate
. /opt/DL/theano/bin/theano-activate
export PATH="${PATH}:/opt/DL/bazel/bin"
11
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Tuning recommendation
Change the CPU govenor to performance, in order to avoid performance fluctuations
for i in ./devices/system/cpu/cpufreq/policy* ; do echo performance > $i/scaling_governor ; done
cpupower -c all frequency-set -g performance
Enable Persistent memory mode for the GPU:
nvidia-smi -pm ENABLED
22
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Your feedback about this session is very important to us.
Submit a survey at:
ibmtechu.com
23
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
Continue the conversation
view event
highlights
talk to tech
experts
connect with
attendees
read training
articles
IBM Systems
Technical Events
LinkedIn
community
Join today
bit.ly/IBMTechUconnect
24
IBM Systems Technical Events | ibm.com/training/events
© Copyright IBM Corporation 2017. Technical University/Symposia materials may not
be reproduced in whole or in part without the prior written permission of IBM.
ibm.com/training
provides a comprehensive
portfolio of skills and career
accelerators that are designed
to meet all your training needs.
If you can’t find the training that is right for you
with our Global Training Providers, we can help.
Contact IBM Training at dpmc@us.ibm.com
Continue growing your IBM skills

More Related Content

PPTX
Introduction to PowerAI - The Enterprise AI Platform
PPT
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
PPTX
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
PDF
Transparent Hardware Acceleration for Deep Learning
PPTX
Innovation with ai at scale on the edge vt sept 2019 v0
PPTX
2018 bsc power9 and power ai
PDF
BSC LMS DDL
PDF
SNAP MACHINE LEARNING
Introduction to PowerAI - The Enterprise AI Platform
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Transparent Hardware Acceleration for Deep Learning
Innovation with ai at scale on the edge vt sept 2019 v0
2018 bsc power9 and power ai
BSC LMS DDL
SNAP MACHINE LEARNING

What's hot (20)

PDF
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
PDF
Intel's Machine Learning Strategy
PPTX
Deep Learning
PPTX
WML OpenPOWER presentation
PDF
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
PDF
CFD on Power
PPTX
A Primer on FPGAs - Field Programmable Gate Arrays
PDF
OpenPOWER/POWER9 AI webinar
PDF
OpenPOWER Foundation Overview
PDF
Distributed deep learning optimizations for Finance
PDF
OpenPOWER/POWER9 Webinar from MIT and IBM
PDF
Distributed deep learning optimizations - AI WithTheBest
PPTX
AWS & Intel Webinar Series - Accelerating AI Research
PDF
Tesla Accelerated Computing Platform
PDF
Amd ces tech day 2018 lisa su
PDF
AIDC NY: BODO AI Presentation - 09.19.2019
PDF
DDN: Protecting Your Data, Protecting Your Hardware
PDF
FPGA Hardware Accelerator for Machine Learning
PPTX
AI OpenPOWER Academia Discussion Group
PDF
MIT's experience on OpenPOWER/POWER 9 platform
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
Intel's Machine Learning Strategy
Deep Learning
WML OpenPOWER presentation
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
CFD on Power
A Primer on FPGAs - Field Programmable Gate Arrays
OpenPOWER/POWER9 AI webinar
OpenPOWER Foundation Overview
Distributed deep learning optimizations for Finance
OpenPOWER/POWER9 Webinar from MIT and IBM
Distributed deep learning optimizations - AI WithTheBest
AWS & Intel Webinar Series - Accelerating AI Research
Tesla Accelerated Computing Platform
Amd ces tech day 2018 lisa su
AIDC NY: BODO AI Presentation - 09.19.2019
DDN: Protecting Your Data, Protecting Your Hardware
FPGA Hardware Accelerator for Machine Learning
AI OpenPOWER Academia Discussion Group
MIT's experience on OpenPOWER/POWER 9 platform
Ad

Similar to PowerAI Deep Dive ( key points ) (20)

PDF
Enabling POWER 8 advanced features on Linux
PDF
AIX Performance Tuning Session at STU2017
PPT
Visão geral do hardware do servidor System z e Linux on z - Concurso Mainframe
PPTX
PDF
"Relax and Recover", an Open Source mksysb for Linux on Power
PPTX
Octobus technical university def
PDF
S016576 managing-data-footprint-reduction-brazil-v1708f
PDF
S016394 pendulum-swings-melbourne-v1708d
PDF
S014068 pendulum-swings-orlando-v1705c
PDF
TCP/IP Stack Configuration with Configuration Assistant for IBM z/OS CS
PPTX
EnterpriseDB - IT innovation at the speed of business.
PPTX
GPGPU in Commercial Software: Lessons From Three Cycles of the Adobe Creative...
PDF
S104878 nvme-revolution-jburg-v1809b
PDF
[Café techno] - Ibm power7 - Les dernières annonces
PDF
Some experiences for porting application to Intel Xeon Phi
PPTX
FPGA MeetUp
PDF
S104874 toe-pool-jburg-v1809e
PDF
Best Practices and Performance Studies for High-Performance Computing Clusters
PDF
Red Hat for IBM System z IBM Enterprise2014 Las Vegas
PDF
OpenPOWER Seminar at IIT Madras
Enabling POWER 8 advanced features on Linux
AIX Performance Tuning Session at STU2017
Visão geral do hardware do servidor System z e Linux on z - Concurso Mainframe
"Relax and Recover", an Open Source mksysb for Linux on Power
Octobus technical university def
S016576 managing-data-footprint-reduction-brazil-v1708f
S016394 pendulum-swings-melbourne-v1708d
S014068 pendulum-swings-orlando-v1705c
TCP/IP Stack Configuration with Configuration Assistant for IBM z/OS CS
EnterpriseDB - IT innovation at the speed of business.
GPGPU in Commercial Software: Lessons From Three Cycles of the Adobe Creative...
S104878 nvme-revolution-jburg-v1809b
[Café techno] - Ibm power7 - Les dernières annonces
Some experiences for porting application to Intel Xeon Phi
FPGA MeetUp
S104874 toe-pool-jburg-v1809e
Best Practices and Performance Studies for High-Performance Computing Clusters
Red Hat for IBM System z IBM Enterprise2014 Las Vegas
OpenPOWER Seminar at IIT Madras
Ad

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
Teaching material agriculture food technology
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced Soft Computing BINUS July 2025.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Teaching material agriculture food technology
Network Security Unit 5.pdf for BCA BBA.
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced Soft Computing BINUS July 2025.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
The AUB Centre for AI in Media Proposal.docx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Chapter 3 Spatial Domain Image Processing.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Empathic Computing: Creating Shared Understanding
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Modernizing your data center with Dell and AMD
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...

PowerAI Deep Dive ( key points )

  • 1. 1 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Paulo Sergio Lemes Queiroz Systems Consultant IBM PowerAI Deep Dive
  • 2. 2 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. • What is PowerAI • PowerAI Components  Hardware requirements  CPU VS GPU  Volta / Tensors • Using PowerAI Components • Extending PowerAI Session objectives
  • 3. 3 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. What is PowerAI • Set of support Libraries to develop machine learning and Deep learning applications
  • 4. 4 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Hardware Requirements Minimal requirements for accelerated Machine Learning is: - Any Power Server with a Nvidia GPU IBM Power System S824L (with NVIDIA technology) IBM Power System S822LC for High Performance Computing IBM Power System S822LC for Big Data IBM Power System S821LC Key start point is have the GPU for acceleration Nvlink enable even further acceleration Nvlink usage is transparent
  • 5. 5 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. CPU vs GPU CPU is a generic core, wich has access to the system main memmory in order to do all kind of tasks GPU is composed by thousands of specialized cores that handles mathematical operations GPUs are specialized on Tensor / Matrix / Vector / Float point parallel operations Sequential operations are slower on GPUS
  • 6. 6 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. GPU limitations GPU doesn't has access to the system main memmory, all data must be copied to the GPU and copied back when processing is done ( batch like operations ) Non SIMD ( Same Instruction Multiple Datapoints ) operations are not THAT fast
  • 7. 7 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Volta / Tensors Volta GPU will have dedicated Tensor function units, which will allow processing of Neural Networks to run even faster However this will require even bigger bandwidth ( and lower lattency ) between the main memory and the GPU
  • 8. 8 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Using PowerAI Components PowerAI, by default, require Ubunto 16.04 and it's install files can be located at: https://public.dhe.ibm.com/software/server/POWER/Linux/mldl/ubuntu/README.html The following components come along at the free package: caffe-bvlc - Berkeley Vision and Learning Center (BVLC) upstream Caffe, v1.0.0 caffe-ibm - IBM Optimized version of BVLC Caffe, v1.0.0 caffe-nv - NVIDIA fork of Caffe, v0.15.14 chainer - Chainer, v1.23.0 digits - DIGITS, v5.0.0 tensorflow - Google TensorFlow, v1.1.0 ddl-tensorflow - Distributed Deep Learning custom operator for TensorFlow theano - Theano, v0.9.0 torch - Torch, v7 It's important to highlight that PowerAI isn't limited by these pre-compiled libraries or versions As on any Ubuntu machine, the user can compile any software
  • 9. 9 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Using PowerAI Components All software components of PowerAI are installed at /opt/DL root@pq-s824l-kvm:/opt/DL# ls -l total 0 drwxr-xr-x 4 root root 29 Jun 23 16:08 bazel lrwxrwxrwx 1 root root 23 Jun 23 16:39 caffe -> /etc/alternatives/caffe drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-bvlc drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-ibm drwxr-xr-x 13 root root 151 Jun 23 16:38 caffe-nv drwxr-xr-x 7 root root 68 Jun 23 16:38 chainer drwxr-xr-x 4 root root 28 Aug 8 14:21 ddl drwxr-xr-x 6 root root 55 Aug 8 14:21 ddl-tensorflow drwxr-xr-x 8 root root 210 Aug 6 16:47 digits drwxr-xr-x 7 root root 67 Jun 23 16:38 nccl drwxr-xr-x 6 root root 54 Jun 23 16:38 openblas drwxr-xr-x 3 root root 44 Aug 8 14:19 repo drwxr-xr-x 5 root root 54 Jun 23 16:10 tensorflow drwxr-xr-x 6 root root 54 Jun 23 16:38 theano drwxr-xr-x 9 root root 94 Jun 23 16:38 torch root@pq-s824l-kvm:/opt/DL# pwd /opt/DL root@pq-s824l-kvm:/opt/DL#
  • 10. 10 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Using PowerAI Components By default these packages are not in the system PATH, however to enable those to exist at the PATH a few helper scripts can be used: . /opt/DL/tensorflow/bin/tensorflow-activate . /opt/DL/theano/bin/theano-activate export PATH="${PATH}:/opt/DL/bazel/bin"
  • 11. 11 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Tuning recommendation Change the CPU govenor to performance, in order to avoid performance fluctuations for i in ./devices/system/cpu/cpufreq/policy* ; do echo performance > $i/scaling_governor ; done cpupower -c all frequency-set -g performance Enable Persistent memory mode for the GPU: nvidia-smi -pm ENABLED
  • 12. 22 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Your feedback about this session is very important to us. Submit a survey at: ibmtechu.com
  • 13. 23 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. Continue the conversation view event highlights talk to tech experts connect with attendees read training articles IBM Systems Technical Events LinkedIn community Join today bit.ly/IBMTechUconnect
  • 14. 24 IBM Systems Technical Events | ibm.com/training/events © Copyright IBM Corporation 2017. Technical University/Symposia materials may not be reproduced in whole or in part without the prior written permission of IBM. ibm.com/training provides a comprehensive portfolio of skills and career accelerators that are designed to meet all your training needs. If you can’t find the training that is right for you with our Global Training Providers, we can help. Contact IBM Training at dpmc@us.ibm.com Continue growing your IBM skills