SlideShare a Scribd company logo
9
Most read
10
Most read
Mitigating Common
CloudStack Instance
Deployment Failures
Jithin Raju
About Me
• Cloud Architect at ShapeBlue
• Involved with CloudStack/ Forks
since 2013.
• Citrix > Accelerite > ShapeBlue
Agenda
Common CloudStack Instance
Deployment Failures
Q&A
Discussion
Insufficient
Capacity
Capacity is fully utilized
Add
Add more Capacity
(Compute/Storage).
Delete
Delete unused
instances and volumes.
Resource
Resource limits
Monitor
Monitor utilization
Insufficient
Address Capacity
• Add new public IP
address range/subnet.
Add
• Resource limits.
Resource
• Plan capacity well
Plan
Public IPs are fully allocated
Unable to allocate
vnet Extend the VLAN range
Resource Limits on Networks
Plan capacity well.
Guest VLAN range in the zone fully utilized
Tag Mismatch Review
Review the
configuration.
Add
Add the tag to
more hosts/
storage.
Avoid
Avoid using tags
if not required.
The host / Storage tag is not configured
correctly.
Tagged host/storage are fully utilized.
Overconfidence
with over
provisioning Use realistic values.
Thorough testing.
Leave headroom.
Higher used capacity compared to allocated
capacity
Deployment
options
Avoid using
combinations
which can’t
be deployed.
Reduce the
choices
Conflicting choices
No destination found for a deployment
for VM instance Ensure
Host/Hardware
Health.
Ensure free
capacity.
Hypervisor
monitoring.
Hosts are disconnected
Lost vCenter access
No Free Capacity
Capability
Mismatch
Revise the compute offerings
along with hardware changes.
Review compute offerings.
Unable to find hosts with a suitable number
of vCPUs, CPU MHz, or any other
specification in the compute offering.
Resource Limits
Increase the limit. Free up resources.
Resource limit on account or domain for
instance, volumes, primary storage,
public IPs, and networks.
Null Pointer
Exceptions
Avoid DB
modifications.
Apply fixes. Report/ Fix
bugs.
Manual DB changes
Cloudstack bugs.
Database Errors Monitor Monitor the server.
Connectivity
Ensure good
connectivity to
Database.
MySQL
Use tested and
supported MySQL
versions.
DB Server performance
JDBC Errors.
Errors executing statements.
DB server filesystem filled up.
Instance not
booting
Fix the guest OS
mapping.
Use supported
controller types.
Fix the
template.
Incorrect Guest OS mapping.
Unsupported controller type.
Corrupted template/ISO.
Timeouts
Update timeouts to
match the environment
and use cases.
Identify the sub-task
causing the delay.
Review underlying
platform performance.
“Wait” timeouts
Job timeout
Issues with VR
Inspect the VR. Restart the network
with clean-up.
Fix / Report bug.
Unresponsive VR.
Filesystem filled up.
Unreachable VR.
Storage issues.
Cloudstack bugs.
Q&A
DISCUSSION

More Related Content

PDF
Ansible
PDF
Monitoring in CloudStack
PPTX
VXLAN Integration with CloudStack Advanced Zone
PDF
Volume Encryption In CloudStack
PDF
KVM High Availability Regardless of Storage - Gabriel Brascher, VP of Apache ...
PDF
Deploying CloudStack and Ceph with flexible VXLAN and BGP networking
PDF
Building a redundant CloudStack management cluster - Vladimir Melnik
Ansible
Monitoring in CloudStack
VXLAN Integration with CloudStack Advanced Zone
Volume Encryption In CloudStack
KVM High Availability Regardless of Storage - Gabriel Brascher, VP of Apache ...
Deploying CloudStack and Ceph with flexible VXLAN and BGP networking
Building a redundant CloudStack management cluster - Vladimir Melnik

What's hot (20)

PDF
Neutron packet logging framework
PDF
Service Function Chaining in Openstack Neutron
PPT
IBM WebSphere Application Server traditional and Docker
PPTX
CloudStack networking
PPTX
vSAN architecture components
PDF
MySQL InnoDB Cluster - New Features in 8.0 Releases - Best Practices
PPTX
VMware Advance Troubleshooting Workshop - Day 2
PDF
KVM tools and enterprise usage
PPTX
OpenStack Cinder
PDF
Alphorm.com Formation Microsoft Azure : Azure Active Directory 2021
PPT
Tomcat Server
PPTX
Meetup 23 - 02 - OVN - The future of networking in OpenStack
PDF
What's Coming in CloudStack 4.19
PDF
How to Survive an OpenStack Cloud Meltdown with Ceph
PDF
News And Development Update Of The CloudStack Tungsten Fabric SDN Plug-in
PPTX
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
PPTX
NSX-T Architecture and Components.pptx
PDF
Virtualisation
PPTX
NGINX: Basics and Best Practices
PDF
OpenStack Tutorial
Neutron packet logging framework
Service Function Chaining in Openstack Neutron
IBM WebSphere Application Server traditional and Docker
CloudStack networking
vSAN architecture components
MySQL InnoDB Cluster - New Features in 8.0 Releases - Best Practices
VMware Advance Troubleshooting Workshop - Day 2
KVM tools and enterprise usage
OpenStack Cinder
Alphorm.com Formation Microsoft Azure : Azure Active Directory 2021
Tomcat Server
Meetup 23 - 02 - OVN - The future of networking in OpenStack
What's Coming in CloudStack 4.19
How to Survive an OpenStack Cloud Meltdown with Ceph
News And Development Update Of The CloudStack Tungsten Fabric SDN Plug-in
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
NSX-T Architecture and Components.pptx
Virtualisation
NGINX: Basics and Best Practices
OpenStack Tutorial
Ad

Similar to Mitigating Common CloudStack Instance Deployment Failures (20)

PDF
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
PDF
Cloud stack troubleshooting
PDF
Troubleshooting Apache Cloudstack
PDF
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
PPTX
Trouble shooting apachecloudstack
PPTX
Troubleshoot the Cloud Problem.pptx
PPTX
When the Cloud is a Rockin: High Availability in Apache CloudStack
PDF
DOST 2016 Cloud Without Failures
PDF
OSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
PDF
2010-07 - Datamation - Top 10 Reasons Cloud Computing Deployments Fail
PDF
High virtualizationdegree
PDF
3 the cloud
PDF
CloudStack - Top 5 Technical Issues and Troubleshooting
PDF
Cloud Management with vRealize Operations
PPSX
CloudStack + KVM: Your Local Cloud Lab
PDF
Giles Sirett: Introduction and CloudStack news
PPTX
The Next Step of OpenStack Evolution for NFV Deployments
PPTX
AWS AutoScalling- Tech Talks Maio 2019
PPTX
Designing cloud stack clouds geoff higginbottom/shapeblue
PPTX
Cloud Computin: Tackling Common Processor Error Codes Effectively.pptx
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Cloud stack troubleshooting
Troubleshooting Apache Cloudstack
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Trouble shooting apachecloudstack
Troubleshoot the Cloud Problem.pptx
When the Cloud is a Rockin: High Availability in Apache CloudStack
DOST 2016 Cloud Without Failures
OSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
2010-07 - Datamation - Top 10 Reasons Cloud Computing Deployments Fail
High virtualizationdegree
3 the cloud
CloudStack - Top 5 Technical Issues and Troubleshooting
Cloud Management with vRealize Operations
CloudStack + KVM: Your Local Cloud Lab
Giles Sirett: Introduction and CloudStack news
The Next Step of OpenStack Evolution for NFV Deployments
AWS AutoScalling- Tech Talks Maio 2019
Designing cloud stack clouds geoff higginbottom/shapeblue
Cloud Computin: Tackling Common Processor Error Codes Effectively.pptx
Ad

More from ShapeBlue (20)

PPTX
The Yotta x CloudStack Advantage: Scalable, India-First Cloud
PPTX
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
PPTX
Extensions Framework (XaaS) - Enabling Orchestrate Anything
PDF
CloudStack GPU Integration - Rohit Yadav
PPTX
Building and Operating a Private Cloud with CloudStack and LINBIT CloudStack ...
PDF
Ampere Offers Energy-Efficient Future For AI And Cloud
PDF
Empowering Cloud Providers with Apache CloudStack and Stackbill
PDF
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
PDF
Fully Open-Source Private Clouds: Freedom, Security, and Control
PPTX
Pushing the Limits: CloudStack at 25K Hosts
PPTX
Stretching CloudStack over multiple datacenters
PPTX
Proposed Feature: Monitoring and Managing Cloud Usage Costs in Apache CloudStack
PDF
I’d like to resell your CloudStack services, but...
PDF
Storage Setup for LINSTOR/DRBD/CloudStack
PDF
Apache CloudStack 101 - Introduction, What’s New and What’s Coming
PDF
Development of an Оbject Storage Plugin for CloudStack, Christian Reichert, s...
PDF
VM-HA with CloudStack and Linstor, Rene Peinthor
PDF
How We Use CloudStack to Provide Managed Hosting, Swen Brüseke, proIO
PDF
Internet Facing VMs and the DDoS Problem, Wido den Hollander, Your.Online
The Yotta x CloudStack Advantage: Scalable, India-First Cloud
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
Extensions Framework (XaaS) - Enabling Orchestrate Anything
CloudStack GPU Integration - Rohit Yadav
Building and Operating a Private Cloud with CloudStack and LINBIT CloudStack ...
Ampere Offers Energy-Efficient Future For AI And Cloud
Empowering Cloud Providers with Apache CloudStack and Stackbill
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
Fully Open-Source Private Clouds: Freedom, Security, and Control
Pushing the Limits: CloudStack at 25K Hosts
Stretching CloudStack over multiple datacenters
Proposed Feature: Monitoring and Managing Cloud Usage Costs in Apache CloudStack
I’d like to resell your CloudStack services, but...
Storage Setup for LINSTOR/DRBD/CloudStack
Apache CloudStack 101 - Introduction, What’s New and What’s Coming
Development of an Оbject Storage Plugin for CloudStack, Christian Reichert, s...
VM-HA with CloudStack and Linstor, Rene Peinthor
How We Use CloudStack to Provide Managed Hosting, Swen Brüseke, proIO
Internet Facing VMs and the DDoS Problem, Wido den Hollander, Your.Online

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
KodekX | Application Modernization Development
PPTX
Spectroscopy.pptx food analysis technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
cuic standard and advanced reporting.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Understanding_Digital_Forensics_Presentation.pptx
20250228 LYD VKU AI Blended-Learning.pptx
KodekX | Application Modernization Development
Spectroscopy.pptx food analysis technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Per capita expenditure prediction using model stacking based on satellite ima...
cuic standard and advanced reporting.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
The AUB Centre for AI in Media Proposal.docx
Unlocking AI with Model Context Protocol (MCP)
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf

Mitigating Common CloudStack Instance Deployment Failures

Editor's Notes

  • #4: We are going to discuss 15 common instance deployment failure scenarios, their usual causes, and suggested mitigation steps.
  • #5: If you have used cloudstack already you must have seen this error. Insufficient capacity is a generic error thrown for many failures and obviously when there is no available capacity as well. Most of the time the capacity is fully utilized and the ways to resolve are straightforward. We could add more compute or storage resources depending on the situation. If there is an opportunity to delete some unused resources such as instances or volumes that could also help. Another way to handle this situation is with efficient use of resource limits at account, domain levels.
  • #6: When you are out of public IPs and the new network requires a public IP the instance deployment could fail. The way around this situation is mostly by adding more public IPs. To avoid getting into this situation we could use resource limits efficiently. If you know the use case and the expected usage of public IPs upfront you could add the capacity accordingly. If you pay attention to the resource utilization you should be able to provision new IPs before it results in any failure.
  • #7: When you deploy an instance where the network needs to be implemented a new VLAN needs to be allocated. There are situations where the provisioned VLAN IDs are fully consumed, this could also result in an instance deployment failure. The typical way around this is to extend the VLAN range. If you have used resource limits effectively, to some extent you may not see this issue that often. If the VLAN capacity is planned well you may not face this issue at all.
  • #8: The deployment planner is unable to find host / storage matching the tag having enough capacity. We can solve this by reviewing the current host / storage tagging configuration and fixing them. We could avoid using tags if that’s not required. Or we could add tags to more resources.
  • #9: Over provisioning has no magic to give us more than the capacity the infrastructure has. We need to use the over provisioning values realistically. If you do thorough testing you should be able to find the suitable values. Also keep some head room to avoid resource contentions.
  • #10: We should avoid choosing mutually exclusive deployment choices such as selecting dedicated affinity group and selecting an offering have a host or storage that is not available in the dedicated resource. We can avoid this by selecting only the appropriate choices for instance deployment.
  • #11: You might see this instance deployment error when hosts are unavailable due to any reason. It could hardware, network or software issues. This is also seen where there is no free capacity as well. You can avoid this by ensuring the host and its hardware health. Make sure there is free capacity. Implement any hypervisor monitoring.
  • #12: I have seen this capability mismatch error during instance deployment after a server hardware upgrade or replacement. The older CPUs could be say 3500 MHZ and the compute offering were created based on them so we may have used CPU MHZ value of 3500. If the new CPU is of 2000MHz the instance deployment would fail. Its always good practice to revise the compute offering according to the hardware changes.
  • #13: Although I suggested using resource limits as a solution earlier a couple of times, resource limits themselves could lead to instance deployment failure. If the instance resource limit is reached for either account , domain or project level the result is a failure. We do have much options in this case, we can either increase the limit or free up the resources.
  • #14: Java Null pointer exceptions are another set of errors that could lead to any operation failure and instance deployment as well. Mostly these appear if you have done wrong DB changes. We hit this error caused by bugs as well. If we can avoid manual DB changes we will not be affected by this error most likely. If it’s a bug either we can find the fixes version and upgrade or report the bug or even fix it ourselves if that’s an option.
  • #15: Any issues with database can can break the cloudstack installation. It is essential to keep it away from any errors. Issues such as DB server performance issues, JDBC errors, some wrong mysql statements etc are the most common ones leading to failures such as instance deployment failure. We need to monitor the DB server to avoid any service outages. We need to ensure good network connectivity to DB server. Also we can avoid some issues if we use supported and tested mysql versions.
  • #16: Sometimes the instance deployment job succeeds but the guest OS fails to boot. This could happen due to incorrect guest OS mappings, or unsupported guest OS type. Unsupported controller type or corrupted template or ISO. If you access the console you should be able to tell what caused the issue. Depending on the cause you should be able to fix it by either fixing the guest OS mapping, using the correct controller types or by fixing the template.
  • #17: Timeouts another common failure scenario. We could avoid this by increasing the timeout. We can also identify the sub tasks causing the delay and fix it. Most of times the underlysing platform is slow, its worth reviewing it fixing it solves the problem.
  • #18: Unless the network is configured the instance won't be deployed. Any system issue with VR could result in instance deployment failure. It could be an unresponsive VR, its filesystem could be filled up. VR may be unreachable. There could storage issues where the VR is hosted. There could be some bug causing the network configuration on the VR to fail. In these situations we should inspect the VR to identify what is causing the issue. Sometimes restarting the network with clean-up would solve the issue. Sometimes it is best to report the bug or fix the bug if that’s the case.