SlideShare a Scribd company logo
1
Running in Production
Thang Man
Lead Cloud Engineer
FimPlus Digital
@Barcamp Saigon 2016
Mission-critical
Business-critical
2
“Production” in our life and the IT world
Why OpenStack?
3
Our Demand Workloads
for Infrastructure
4
Solution Architecture Overview
* Juno → Liberty
* RHEL → Ubuntu
* Swift → CEPH
* Not used
- Trove
5
HA for OpenStack Controllers
● MySQL Galera Cluster
Active/Active
● HAProxy for Galera
Active/Backup
● Stateless services
HAProxy Active/Active
● RabbitMQ:
Built-in ha-queues
● Virtual IP
Pacemaker/Corosync
6
Networking: A Basic Block
● Aggregated throughput
● Fault tolerant
● Scalable
● Optimized for East-West
traffic patterns
● White-box networking
7
A Big Picture
8
Neutron vs. Nova-network
Legacy networking (Nova-network) OpenStack networking (Neutron)
Simple, single agent Complex, multiple agents
More matured, established Newer, maturing
Flat or VLAN Flat, VLAN, Overlays, L2-L3, SDN
No plug-in support Plug-in support for 3rd parties
Scales well Scaling requires 3rd party plug-ins
No multi-tier topologies Multi-tier topologies
9
CEPH Storage: Network Model
● Cluster should have more 2X-4X capacity than Public network 10
CEPH Integration
11
Rules of Thumb Sizing
● Ceph OSD
○ Disks
■ 8-10 SAS HDDs per 1x 10Gbps
■ ~12 SATA HDDs per 1x 10Gbps
■ 1x SSD for write journal per 4-6 OSD drives
■ JBOD (no RAID configuration)
○ RAM
■ 1GB of RAM per 1 TB of OSD storage space
○ CPU
■ 0.5 CPU core/1 Ghz of a core per OSD disk (1-2 CPU cores for SSD drives)
● Ceph Monitor (min 3 per cluster)
○ 1 GB of RAM per daemon
○ 1 ceph-mon node per 15-20 OSD nodes
12
Configuration
● Ceph Storage Cluster
○ Multi-pools:
SSD, SAS/SATA
○ Replicas of objects:
min 2
○ Data protection:
Replicated vs. EC
○ Memory allocator:
TCMalloc vs. JEMalloc
13
● Ceph Block Device (RBD)
○ rbd_cache
● Ceph Object Gateway (RGW)
○ rgw_cache_lru_size
○ rgw_thread_pool_size
○ rgw_override_bucket_index_max_shards
○ LB: LVS (direct routing) vs. HAProxy
○ Web server: Civetweb vs. Nginx vs.
Apache
Compute
● Specialized hardwares
○ GPU
○ SSD locality
○ Encryption keys in USB devices
● Scheduling on specific hosts (e.g. KVM, XenServer), hardwares
○ Host Aggregates
● Separating critical workloads from others
○ Availability Zones
14
Compute (cont.)
● High performance computing for VMs (*)
○ No overcommit (RAM, CPU)
○ CPU Pinning
○ NUMA Topology awareness
● High availability for VMs
○ Live/Cold Migration
○ Evacuation
○ resume_guests_state_on_host_boot=true
● Scalability for VMs
○ Resize
(*) http://www.slideshare.net/vietstack/vietopenstack-meetup-7th-high-performance-vm
15
Multi-site
16
Hybrid Cloud
17
Hybrid with
18
https://cloudplatform.googleblog.com/2016/04/OpenStack-users-backup-your-Cinder-volumes-to-Google-Cloud-Storage.html
Some Lessons Learned
19
Improvement Needing
● Instance
○ Resize online
○ Auto-scaling
○ HA Compute
● Volume
○ Discard support
○ Ceph-based volume migration
● Network
○ QoS for egress traffic
● Do upgrade
○ Ceph (Jewel)
○ OpenStack
20
21
Skilled people we need
● Linux Internals
● TCP/IP Networking
● Automation
● System Programming
● Performance Oriented
Image Reference
(1) https://images.google.com
(2) https://www.openstack.org
(3) http://ceph.com
(4) https://cumulusnetworks.com
(5) https://www.redhat.com
(6) https://www.mirantis.com
(7) https://www.arista.com
(8) http://www.sebastien-han.fr
22
Questions and
Thanks!
23
@mvthang
manvanthang@gmail.com

More Related Content

PDF
2019.06.27 Intro to Ceph
PDF
Performance optimization for all flash based on aarch64 v2.0
PDF
20171101 taco scargo luminous is out, what's in it for you
PDF
Ceph Block Devices: A Deep Dive
PDF
Ceph Tech Talk: Ceph at DigitalOcean
PDF
Boosting I/O Performance with KVM io_uring
PDF
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
PDF
Using Ceph in OStack.de - Ceph Day Frankfurt
2019.06.27 Intro to Ceph
Performance optimization for all flash based on aarch64 v2.0
20171101 taco scargo luminous is out, what's in it for you
Ceph Block Devices: A Deep Dive
Ceph Tech Talk: Ceph at DigitalOcean
Boosting I/O Performance with KVM io_uring
Integrating gluster fs,_qemu_and_ovirt-vijay_bellur-linuxcon_eu_2013
Using Ceph in OStack.de - Ceph Day Frankfurt

What's hot (20)

PDF
Ceph Month 2021: RADOS Update
PDF
Ceph Research at UCSC
PDF
XSKY - ceph luminous update
PDF
Cncf meetup-rook
PDF
Cncf meetup-rook
PDF
Gluster as Block Store in Containers
PDF
Making distributed storage easy: usability in Ceph Luminous and beyond
ODP
Accessing gluster ufo_-_eco_willson
PDF
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
PDF
BlueStore, A New Storage Backend for Ceph, One Year In
PDF
Gluster for sysadmins
PDF
Ceph BlueStore - новый тип хранилища в Ceph / Максим Воронцов, (Redsys)
ODP
oVirt DR Site to-site using ansible
PDF
State of the_gluster_-_lceu
PDF
What's new in Luminous and Beyond
PDF
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
ODP
Gluster d thread_synchronization_using_urcu_lca2016
PDF
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
ODP
Disaster Recovery in oVirt
PDF
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
Ceph Month 2021: RADOS Update
Ceph Research at UCSC
XSKY - ceph luminous update
Cncf meetup-rook
Cncf meetup-rook
Gluster as Block Store in Containers
Making distributed storage easy: usability in Ceph Luminous and beyond
Accessing gluster ufo_-_eco_willson
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
BlueStore, A New Storage Backend for Ceph, One Year In
Gluster for sysadmins
Ceph BlueStore - новый тип хранилища в Ceph / Максим Воронцов, (Redsys)
oVirt DR Site to-site using ansible
State of the_gluster_-_lceu
What's new in Luminous and Beyond
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
Gluster d thread_synchronization_using_urcu_lca2016
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
Disaster Recovery in oVirt
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
Ad

Similar to Running OpenStack in Production - Barcamp Saigon 2016 (20)

PPTX
Ceph Deployment at Target: Customer Spotlight
PPTX
Ceph Deployment at Target: Customer Spotlight
PPTX
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
PPTX
Ceph Day Chicago - Ceph at work at Bloomberg
PPTX
New Ceph capabilities and Reference Architectures
PPTX
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
PPTX
Ceph at Work in Bloomberg: Object Store, RBD and OpenStack
PDF
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
PDF
Sanger OpenStack presentation March 2017
PPTX
Ceph Day Melbourne - Walk Through a Software Defined Everything PoC
PDF
Peanut Butter and jelly: Mapping the deep Integration between Ceph and OpenStack
PDF
Ceph Day Berlin: Scaling an Academic Cloud
PDF
Ceph Day Berlin: Scaling an Academic Cloud
PDF
Openstack summit2013-hongkong
PDF
Red Hat Storage Day Boston - OpenStack + Ceph Storage
PDF
OpenStack Best Practices and Considerations - terasky tech day
PPTX
Walk Through a Software Defined Everything PoC
PDF
Red hat open stack and storage presentation
PDF
99.999% Available OpenStack Cloud - A Builder's Guide
PDF
Montreal OpenStack Q2 MeetUp - May 30th 2017
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer Spotlight
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day Chicago - Ceph at work at Bloomberg
New Ceph capabilities and Reference Architectures
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Ceph at Work in Bloomberg: Object Store, RBD and OpenStack
Linux Stammtisch Munich: Ceph - Overview, Experiences and Outlook
Sanger OpenStack presentation March 2017
Ceph Day Melbourne - Walk Through a Software Defined Everything PoC
Peanut Butter and jelly: Mapping the deep Integration between Ceph and OpenStack
Ceph Day Berlin: Scaling an Academic Cloud
Ceph Day Berlin: Scaling an Academic Cloud
Openstack summit2013-hongkong
Red Hat Storage Day Boston - OpenStack + Ceph Storage
OpenStack Best Practices and Considerations - terasky tech day
Walk Through a Software Defined Everything PoC
Red hat open stack and storage presentation
99.999% Available OpenStack Cloud - A Builder's Guide
Montreal OpenStack Q2 MeetUp - May 30th 2017
Ad

More from Thang Man (11)

PDF
OpenStack 101: Introduction & Technical Overview
PDF
Lesson 8 - Reviewing Basic Networking
PDF
Lesson 7 - Linux Shell Programming
PDF
Lesson 6 - Administering Linux System (2)
PDF
Lesson 5 - Administering Linux System (1)
PDF
Lesson 3 - Linux File System
PDF
Lesson 2 - Install Linux & Command Line Environment
PDF
Lesson 1 - Introduction to Open Source & Linux
PDF
Building Trusted Network
PDF
Overview of Linux
PDF
Firewall fundamentals
OpenStack 101: Introduction & Technical Overview
Lesson 8 - Reviewing Basic Networking
Lesson 7 - Linux Shell Programming
Lesson 6 - Administering Linux System (2)
Lesson 5 - Administering Linux System (1)
Lesson 3 - Linux File System
Lesson 2 - Install Linux & Command Line Environment
Lesson 1 - Introduction to Open Source & Linux
Building Trusted Network
Overview of Linux
Firewall fundamentals

Recently uploaded (20)

PDF
Empathic Computing: Creating Shared Understanding
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
Empathic Computing: Creating Shared Understanding
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
NewMind AI Monthly Chronicles - July 2025
Big Data Technologies - Introduction.pptx
Understanding_Digital_Forensics_Presentation.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The AUB Centre for AI in Media Proposal.docx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Encapsulation_ Review paper, used for researhc scholars
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Modernizing your data center with Dell and AMD
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Review of recent advances in non-invasive hemoglobin estimation
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology

Running OpenStack in Production - Barcamp Saigon 2016