SlideShare a Scribd company logo
HPC to OpenStack
Our journey
Tuesday 16th
February 2016
Outline
●
Key Questions
●
Provisioning Systems
●
Configuration Management
●
Futures
●
Q&A
How do we??
●
Install bare metal
●
Configure networking, and base OS
●
Install software
●
Manage the configuration
●
Make it redundant
Provisioning Systems
●
Which one to choose
– Foreman
– Cobbler
– Oscar
– Rocks
– xCAT
– PCM
– Bright
– Others...
What is xCAT
●
eXtreme Cluster/Cloud Administration Toolkit
●
Management of clusters
●
Baremetal Provisioning
●
Scriptable
●
Large scale management (Lightsout, remote console,
distributed shell)
●
Configures key services based on tables
Why we're using xCAT
●
xCAT provides bare metal management and provisioning of
hardware.
●
Automatic Discovery
●
EPL Licencing
– Adaptable
– No licencing cost
●
Close relationship with developers
●
Hardware management
– Lights out management
– Remote console/Serial over Lan
– Remote inventory, vitals, power
Pros and Cons
●
Pros
– Flexible
– Open-Source
– HW Support
– OS Support
●
Cons
– No GUI
– No Configuration Management
– Very Steep Learning Curve
– Only maintained by IBM/Lenovo
How is this relevant to OpenStack
●
The installation process is very similar
●
The quantity of nodes are very similar
●
We still need to provision nodes from bare-metal
●
XCAT has an Ironic driver :)
Why NOW!!!!
●
KVM performance
●
SR-IOV support
●
VXLAN offloading
Why OpenStack for HPC
●
Users want flexibility
●
Users have a choice
●
Users don't depend on Administrators as much
●
Not all applications are available on the system
●
Sequential jobs
●
Embarrassingly parallel jobs
Performance
●
MVAPICH2-virt performance is approx 1-4% slower
compared to native (source Mellanox booth SC15)
●
8% slower w/SR-IOV vs Native IB (source SDSC) compared
to 28% 2 years ago on KVM
http://mvapich.cse.ohio-state.edu/static/media/talks/slide/dk_mellanox_theatre_sc15_1.pdf
Our Dilemma
●
OpenStack is so new
●
Not many people know it
●
Where do we start
Our Choices
●
Initially (Havana - Icehouse)
– CentOS
– RDO
– Packstack
●
Middle (Icehouse – Juno)
– CentOS
– RDO
– Crudini (openstack-config)
●
Then (Juno - Liberty)
– CentOS
– RDO
– SaltStack
Why!!!
●
CentOS
– Because a lot of our customers used it
– Compatible with GPFS
●
RDO
– Natural choice after choosing CentOS
– Easy way of getting all packages installed via package manager
●
SaltStack
– Customer introduction to the SW
How – Strike 1
●
Packstack
●
Great for PoC environment
●
Test any specific features
●
Simple to get everything running
How – Strike 2
●
Crudini (openstack-config)
●
A lot of work required to get things up and running
●
Everything was scripted
●
Doesn't work with multiple values for any option
●
Very difficult to maintain and version control
How – Strike 3
●
SaltStack
●
Created many salt formulas specific for our requirements
●
Moved all our configs
– OpenStack
– GPFS
– Icinga (Nagios)
– Ganglia
– 3rd
-party applications
– HPC specific formulas
– InfiniBand
Preparing the System
●
Decide the version of OpenStack
●
Sync repos by using reposync from the repos
– epel
– RDO
– CentOS
●
Sample yum.conf and repo files below
https://gitlab.arif-ali.co.uk/arif/openstack-lab
Preparing the xCAT
●
Install xCAT RPMs
●
Configure xCAT tables
– nodes and their configurations
– networks
– switch mappings
– Etc…
●
Install nodes with base OS
●
Use salt to configure all nodes depending on xCAT groups
https://gitlab.arif-ali.co.uk/arif/openstack-lab
Future Development
●
Update configs to use salt formulas
●
Contribute to the salt-formula-* projects
– Add any experience from field
– Creating spec files for the salt-formulas
●
Keystone v3 API integration
– Federation support
●
Use openstack-ansible ??
Finally
●
Use heat to deploy virtual HPC environments
●
Create private networks
●
Install custom scheduler and software
●
Create as many as you want
https://gitlab.arif-ali.co.uk/arif/openstack-lab/tree/master/heat_templates
Links
●
https://gitlab.arif-ali.co.uk/arif/openstack-lab
●
http://blog.arif-ali.co.uk/blog/categories/openstack
●
https://gitlab.arif-ali.co.uk/arif/openstack-
lab/wikis/home
Q & A

More Related Content

PDF
Stig Telfer - OpenStack and the Software-Defined SuperComputer
PDF
Hyperconverged Cloud, Not just a toy anymore - Andrew Hatfield, Red Hat
PDF
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
PDF
CERN OpenStack Cloud Control Plane - From VMs to K8s
PDF
A Container Stack for Openstack - OpenStack Silicon Valley
PDF
Evolution of Openstack Networking at CERN
PDF
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
PDF
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Hyperconverged Cloud, Not just a toy anymore - Andrew Hatfield, Red Hat
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
CERN OpenStack Cloud Control Plane - From VMs to K8s
A Container Stack for Openstack - OpenStack Silicon Valley
Evolution of Openstack Networking at CERN
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
OpenNebula Conf 2014 | Using Ceph to provide scalable storage for OpenNebula ...

What's hot (20)

PDF
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
PDF
Future Science on Future OpenStack
PDF
Scaling Ceph at CERN - Ceph Day Frankfurt
PDF
How to Survive an OpenStack Cloud Meltdown with Ceph
PPTX
What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...
PDF
Ceph data services in a multi- and hybrid cloud world
PDF
Ceph and Mirantis OpenStack
PDF
DRBD + OpenStack (Openstack Live Prague 2016)
PDF
Glusterfs and openstack
PPTX
Red Hat Gluster Storage, Container Storage and CephFS Plans
PDF
Disaggregating Ceph using NVMeoF
PDF
2015 open storage workshop ceph software defined storage
PDF
Red Hat Storage 2014 - Product(s) Overview
PPTX
Disaggregating Ceph using NVMeoF
PDF
Protecting the Galaxy - Multi-Region Disaster Recovery with OpenStack and Ceph
PDF
OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure -...
PDF
Ceph Tech Talk: Ceph at DigitalOcean
PDF
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
PDF
Containers on Baremetal and Preemptible VMs at CERN and SKA
PDF
OpenNebula Conf 2014: Expanding OpenNebula´s support for Cloud Bursting - Emm...
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
Future Science on Future OpenStack
Scaling Ceph at CERN - Ceph Day Frankfurt
How to Survive an OpenStack Cloud Meltdown with Ceph
What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...
Ceph data services in a multi- and hybrid cloud world
Ceph and Mirantis OpenStack
DRBD + OpenStack (Openstack Live Prague 2016)
Glusterfs and openstack
Red Hat Gluster Storage, Container Storage and CephFS Plans
Disaggregating Ceph using NVMeoF
2015 open storage workshop ceph software defined storage
Red Hat Storage 2014 - Product(s) Overview
Disaggregating Ceph using NVMeoF
Protecting the Galaxy - Multi-Region Disaster Recovery with OpenStack and Ceph
OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure -...
Ceph Tech Talk: Ceph at DigitalOcean
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
Containers on Baremetal and Preemptible VMs at CERN and SKA
OpenNebula Conf 2014: Expanding OpenNebula´s support for Cloud Bursting - Emm...
Ad

Similar to Hpc to OpenStack: Our journey (20)

PDF
RDO-Packstack Workshop
PDF
Public vs. Private Cloud Performance by Flex
PDF
Building and Running OpenStack on POWER8
PPTX
Climb Technical Overview
PDF
Sanger OpenStack presentation March 2017
PDF
Build cloud like Rackspace with OpenStack Ansible
PDF
OpenstackOverview.pdf
PPTX
Openstack in 10 mins
PPTX
Openstack
PPTX
OpenStack hands-on (All-in-One)
PDF
Red Hat Cloud Infrastructure Conference 2013 - Presentation about OpenStack ...
PDF
OpenStack Deployment in the Enterprise
PPTX
Being Brave: Deploying OpenStack from Master
PDF
Openstackoverview-DEC2013
PDF
Latinoware 2013 - OpenStack RDO - A walkthrough by the Open Source Cloud Comp...
ODP
Open stack
PDF
The Next Generation Datacenter
PDF
OpenStack Toronto Q2 MeetUp - June 1st 2017
PDF
How to integrate_custom_openstack_services_with_devstack
PPTX
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
RDO-Packstack Workshop
Public vs. Private Cloud Performance by Flex
Building and Running OpenStack on POWER8
Climb Technical Overview
Sanger OpenStack presentation March 2017
Build cloud like Rackspace with OpenStack Ansible
OpenstackOverview.pdf
Openstack in 10 mins
Openstack
OpenStack hands-on (All-in-One)
Red Hat Cloud Infrastructure Conference 2013 - Presentation about OpenStack ...
OpenStack Deployment in the Enterprise
Being Brave: Deploying OpenStack from Master
Openstackoverview-DEC2013
Latinoware 2013 - OpenStack RDO - A walkthrough by the Open Source Cloud Comp...
Open stack
The Next Generation Datacenter
OpenStack Toronto Q2 MeetUp - June 1st 2017
How to integrate_custom_openstack_services_with_devstack
Why OpenStack on UCS? An Introduction to Red Hat and Cisco OpenStack Solution
Ad

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Spectroscopy.pptx food analysis technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
KodekX | Application Modernization Development
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Cloud computing and distributed systems.
PDF
cuic standard and advanced reporting.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Building Integrated photovoltaic BIPV_UPV.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Empathic Computing: Creating Shared Understanding
Spectroscopy.pptx food analysis technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
20250228 LYD VKU AI Blended-Learning.pptx
KodekX | Application Modernization Development
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
The AUB Centre for AI in Media Proposal.docx
Dropbox Q2 2025 Financial Results & Investor Presentation
NewMind AI Weekly Chronicles - August'25 Week I
MYSQL Presentation for SQL database connectivity
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Cloud computing and distributed systems.
cuic standard and advanced reporting.pdf

Hpc to OpenStack: Our journey

  • 1. HPC to OpenStack Our journey Tuesday 16th February 2016
  • 3. How do we?? ● Install bare metal ● Configure networking, and base OS ● Install software ● Manage the configuration ● Make it redundant
  • 4. Provisioning Systems ● Which one to choose – Foreman – Cobbler – Oscar – Rocks – xCAT – PCM – Bright – Others...
  • 5. What is xCAT ● eXtreme Cluster/Cloud Administration Toolkit ● Management of clusters ● Baremetal Provisioning ● Scriptable ● Large scale management (Lightsout, remote console, distributed shell) ● Configures key services based on tables
  • 6. Why we're using xCAT ● xCAT provides bare metal management and provisioning of hardware. ● Automatic Discovery ● EPL Licencing – Adaptable – No licencing cost ● Close relationship with developers ● Hardware management – Lights out management – Remote console/Serial over Lan – Remote inventory, vitals, power
  • 7. Pros and Cons ● Pros – Flexible – Open-Source – HW Support – OS Support ● Cons – No GUI – No Configuration Management – Very Steep Learning Curve – Only maintained by IBM/Lenovo
  • 8. How is this relevant to OpenStack ● The installation process is very similar ● The quantity of nodes are very similar ● We still need to provision nodes from bare-metal ● XCAT has an Ironic driver :)
  • 9. Why NOW!!!! ● KVM performance ● SR-IOV support ● VXLAN offloading
  • 10. Why OpenStack for HPC ● Users want flexibility ● Users have a choice ● Users don't depend on Administrators as much ● Not all applications are available on the system ● Sequential jobs ● Embarrassingly parallel jobs
  • 11. Performance ● MVAPICH2-virt performance is approx 1-4% slower compared to native (source Mellanox booth SC15) ● 8% slower w/SR-IOV vs Native IB (source SDSC) compared to 28% 2 years ago on KVM http://mvapich.cse.ohio-state.edu/static/media/talks/slide/dk_mellanox_theatre_sc15_1.pdf
  • 12. Our Dilemma ● OpenStack is so new ● Not many people know it ● Where do we start
  • 13. Our Choices ● Initially (Havana - Icehouse) – CentOS – RDO – Packstack ● Middle (Icehouse – Juno) – CentOS – RDO – Crudini (openstack-config) ● Then (Juno - Liberty) – CentOS – RDO – SaltStack
  • 14. Why!!! ● CentOS – Because a lot of our customers used it – Compatible with GPFS ● RDO – Natural choice after choosing CentOS – Easy way of getting all packages installed via package manager ● SaltStack – Customer introduction to the SW
  • 15. How – Strike 1 ● Packstack ● Great for PoC environment ● Test any specific features ● Simple to get everything running
  • 16. How – Strike 2 ● Crudini (openstack-config) ● A lot of work required to get things up and running ● Everything was scripted ● Doesn't work with multiple values for any option ● Very difficult to maintain and version control
  • 17. How – Strike 3 ● SaltStack ● Created many salt formulas specific for our requirements ● Moved all our configs – OpenStack – GPFS – Icinga (Nagios) – Ganglia – 3rd -party applications – HPC specific formulas – InfiniBand
  • 18. Preparing the System ● Decide the version of OpenStack ● Sync repos by using reposync from the repos – epel – RDO – CentOS ● Sample yum.conf and repo files below https://gitlab.arif-ali.co.uk/arif/openstack-lab
  • 19. Preparing the xCAT ● Install xCAT RPMs ● Configure xCAT tables – nodes and their configurations – networks – switch mappings – Etc… ● Install nodes with base OS ● Use salt to configure all nodes depending on xCAT groups https://gitlab.arif-ali.co.uk/arif/openstack-lab
  • 20. Future Development ● Update configs to use salt formulas ● Contribute to the salt-formula-* projects – Add any experience from field – Creating spec files for the salt-formulas ● Keystone v3 API integration – Federation support ● Use openstack-ansible ??
  • 21. Finally ● Use heat to deploy virtual HPC environments ● Create private networks ● Install custom scheduler and software ● Create as many as you want https://gitlab.arif-ali.co.uk/arif/openstack-lab/tree/master/heat_templates
  • 23. Q & A