SlideShare a Scribd company logo
APPLICATION MANAGEMENT IN
AWS
BEST PRACTICES
INTRODUCTION
BRAD ADAIR
▸ Director of Infrastructure Services at IQ Innovations, LLC.
▸ Have been working in IT for 12+ years in various areas ranging
from desktop support to system administration to management.
▸ AWS Certified Solutions Architect
▸ Have been working heavily in AWS for about 2.5 years.
▸ Email: brad@adair.tech
▸ Twitter: @bpadair
INTRODUCTION
APPLICATION MANAGEMENT IN AWS
▸ Public cloud in general, and AWS in particular are
changing the way that we think about infrastructure and
the way we manage the applications that run on that
infrastructure.
▸ Less permanence, more ephemeral and temporary.
▸ More purpose built and dedicated resources.
▸ Less “make it fit”
INTRODUCTION
FOUR DOMAINS
▸ Performance
▸ Security
▸ Reliability
▸ Scalability
PERFORMANCE
WHAT DO WE MEAN?
▸ What do we mean when we talk about performance?
▸ Getting as much power as possible?
▸ Getting just enough?
▸ What about growth?
PERFORMANCE
GENERAL GUIDANCE
▸ Use Trusted Advisor to find (somewhat) obvious
performance issues.
▸ Things like over-utilized instances, excessive security
group rules, and cache-hit ratio can be found here.
▸ Plan for performance to scale, not grow.
▸ Monitor, monitor, monitor.
PERFORMANCE
DATABASES
▸ Need special consideration.
▸ RDS, Dynamo, EC2 instance.
▸ If using EC2, use provisioned IOPS, and RAID-0 volumes.
▸ Do not put databases on EFS instances.
▸ Replication - yes/no - where?
PERFORMANCE
CASE-STUDY: IQ INNOVATIONS
▸ Two data centers and a public cloud provider.
▸ All Centos running on ESXi.
▸ MySQL database.
▸ Apache, Tomcat, Grails stack on app servers.
▸ 1 clients configuration: 8 servers dedicated to MySQL, 14 app servers, 1 NFS server, 2 utility
servers.
▸ Performance was terrible.
▸ Average app response time: ~600ms
▸ Average end-user response time: ~4s
▸ Constantly running out of memory and restarting
▸ Nowhere to grow
PERFORMANCE
CASE STUDY: IQ INNOVATIONS
▸ Moved to AWS. Eliminated the collocation space and other cloud provider.
▸ Still running MySQL and Centos.
▸ Databases moved to RDS. Application servers moved to EC2.
▸ Same client configuration: 6 RDS instances for databases, 4 app servers, 1 utility server,
EFS to replace SAN.
▸ Performance improved dramatically:
▸ App response time: ~80-100ms
▸ End-user response time: ~1-2s
▸ No more memory issues.
▸ Cost savings of about 50%.
SECURITY
HAVEN’T WE BEEN DOING THIS FOREVER?
▸ Yes, and a lot of existing knowledge still applies.
▸ You still need smart policies.
▸ Your application still needs to protect against common attack vectors.
▸ Some things to change with a move to AWS, however.
▸ You are no longer responsible for physical security.
▸ You are no longer responsible for hypervisor security or patching.
▸ Depending on the service you may not even be responsible for OS
security and patching.
SECURITY
BEST PRACTICES
▸ Trusted advisor. This is a recurring theme.
▸ Bastion hosts
▸ VPC
▸ Peering
▸ Security groups
▸ NACL
▸ COMMON SENSE!
SECURITY
COMMON MISTAKES
▸ Console access for everyone.
▸ Overly permissive policies.
▸ Lack of two factor authentication.
▸ Overly/Publicly exposed access keys.
▸ Access key rotation.
RELIABILITY
EASIER AND HARDER SIMULTANEOUSLY
▸ A lot of the work for reliability is done for you.
▸ It is a mistake to put too much trust in this.
▸ The tools are there, but you have to choose to use them.
▸ Architecture matters.
RELIABILITY
CRITICAL THINGS TO UNDERSTAND
▸ Availability zones
▸ Regions
▸ Difference between AZs and Regions and how they should
be used together.
▸ Replication of different services.
▸ Availability SLAs.
▸ S3 storage classes/levels
RELIABILITY
CASE STUDY: CONFIDENTIAL COMPANY
▸ Pre-AWS:
▸ Only in one data center due to cost.
▸ Had clients nationwide, but all resources were
centralized.
▸ Had to have 4 or more hours of downtime for
deployments
▸ Many SPoF including storage and network. Redundancy
was attempted but not done well.
RELIABILITY
CASE STUDY: CONFIDENTIAL COMPANY
▸ AWS Setup:
▸ Multiple VPCs spread across multiple regions to provide redundancy
and be close to customers.
▸ VPC peering to reduce single points of failure.
▸ MAZ RDS instances for databases.
▸ EFS for network based storage.
▸ Replication of databases across regions.
▸ IaC templates for VPCs to allow for rapid reproduction in other regions.
SCALABILITY
WHAT IS SCALABILITY
▸ Scalability is about more than simply adding more
resources in response to increased demand.
▸ Scalability needs to include both scaling up and scaling
down.
▸ Goal is to maximize user experience while minimizing cost.
SCALABILITY
DIFFERENT APPROACH
▸ Provision with small spikes in mind, but not growth.
▸ Scale to growth.
▸ Schedule scale downs and scale ups.
▸ Auto-scaling is your friend.
▸ Monitor, monitor, monitor. Don’t alert, alert, alert.
SCALABILITY
COMMON MISTAKES
▸ Over-provisioning.
▸ Reserving too quickly.
▸ Planning for vertical scaling as opposed to horizontal.
▸ Provisioning for growth instead of planning for it.
▸ Manual intervention.
▸ Under analysis of utilization.
QUESTIONS?

More Related Content

PPTX
Datacomm VMWare Hybrid Cloud
PPTX
AWS Cloud Disaster Recovery Plan Checklist - Are you ready?
PPTX
5 Takeaways from AWS re:Invent 2019
PDF
Paving The Way To The Hybrid Cloud
PDF
AWS Cost Optimisation Made Easy
PDF
Deploying in the Cloud: Why and How
PPTX
Make a Move to the Azure Cloud with SoftNAS
PPTX
On-Prem to All-In: How Versent Leads Successful AWS Migrations
Datacomm VMWare Hybrid Cloud
AWS Cloud Disaster Recovery Plan Checklist - Are you ready?
5 Takeaways from AWS re:Invent 2019
Paving The Way To The Hybrid Cloud
AWS Cost Optimisation Made Easy
Deploying in the Cloud: Why and How
Make a Move to the Azure Cloud with SoftNAS
On-Prem to All-In: How Versent Leads Successful AWS Migrations

Similar to Best Practices for Application Management in AWS (17)

PDF
Cloud Bursting: Leveraging the Cloud to Maintain App Performance during Peak ...
PPTX
AWS Elastic Disaster Recovery (AWS DRS) First Call Deck.pptx
PPT
Cloud computing
PDF
A real-life account of moving 100% to a public cloud
PDF
Best of re:Invent 2016 meetup presentation
PDF
Construindo Aplicacoes Web e Mobile Escalaveis na AWS
PDF
Migrating to AWS
PPTX
Cloud1 Computing 01
PPTX
Amx202 l Building Your CA Service Management Solution on AWS
PPTX
vBrownBag AWS Certified SysOps : Associate Domain 4
PPT
Cloud computing What Why How
PPTX
Introduction to cloud computing
PDF
Getting Started with AWS | AWS Tutorial for Beginners | AWS Training | Edureka
PDF
AWS SysOps Administrator Training | AWS SysOps Tutorial | Edureka
PDF
Migratory Workloads Across Clouds with Nomad
PPTX
Migrate Existing Applications to AWS without Re-engineering
PPTX
Should you keep your On-Premises NAS: Upgrade, Pay Maintenance or Public Cloud?
Cloud Bursting: Leveraging the Cloud to Maintain App Performance during Peak ...
AWS Elastic Disaster Recovery (AWS DRS) First Call Deck.pptx
Cloud computing
A real-life account of moving 100% to a public cloud
Best of re:Invent 2016 meetup presentation
Construindo Aplicacoes Web e Mobile Escalaveis na AWS
Migrating to AWS
Cloud1 Computing 01
Amx202 l Building Your CA Service Management Solution on AWS
vBrownBag AWS Certified SysOps : Associate Domain 4
Cloud computing What Why How
Introduction to cloud computing
Getting Started with AWS | AWS Tutorial for Beginners | AWS Training | Edureka
AWS SysOps Administrator Training | AWS SysOps Tutorial | Edureka
Migratory Workloads Across Clouds with Nomad
Migrate Existing Applications to AWS without Re-engineering
Should you keep your On-Premises NAS: Upgrade, Pay Maintenance or Public Cloud?
Ad

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
The AUB Centre for AI in Media Proposal.docx
Machine Learning_overview_presentation.pptx
Machine learning based COVID-19 study performance prediction
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Review of recent advances in non-invasive hemoglobin estimation
Programs and apps: productivity, graphics, security and other tools
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
A Presentation on Artificial Intelligence
20250228 LYD VKU AI Blended-Learning.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Ad

Best Practices for Application Management in AWS

  • 2. INTRODUCTION BRAD ADAIR ▸ Director of Infrastructure Services at IQ Innovations, LLC. ▸ Have been working in IT for 12+ years in various areas ranging from desktop support to system administration to management. ▸ AWS Certified Solutions Architect ▸ Have been working heavily in AWS for about 2.5 years. ▸ Email: brad@adair.tech ▸ Twitter: @bpadair
  • 3. INTRODUCTION APPLICATION MANAGEMENT IN AWS ▸ Public cloud in general, and AWS in particular are changing the way that we think about infrastructure and the way we manage the applications that run on that infrastructure. ▸ Less permanence, more ephemeral and temporary. ▸ More purpose built and dedicated resources. ▸ Less “make it fit”
  • 4. INTRODUCTION FOUR DOMAINS ▸ Performance ▸ Security ▸ Reliability ▸ Scalability
  • 5. PERFORMANCE WHAT DO WE MEAN? ▸ What do we mean when we talk about performance? ▸ Getting as much power as possible? ▸ Getting just enough? ▸ What about growth?
  • 6. PERFORMANCE GENERAL GUIDANCE ▸ Use Trusted Advisor to find (somewhat) obvious performance issues. ▸ Things like over-utilized instances, excessive security group rules, and cache-hit ratio can be found here. ▸ Plan for performance to scale, not grow. ▸ Monitor, monitor, monitor.
  • 7. PERFORMANCE DATABASES ▸ Need special consideration. ▸ RDS, Dynamo, EC2 instance. ▸ If using EC2, use provisioned IOPS, and RAID-0 volumes. ▸ Do not put databases on EFS instances. ▸ Replication - yes/no - where?
  • 8. PERFORMANCE CASE-STUDY: IQ INNOVATIONS ▸ Two data centers and a public cloud provider. ▸ All Centos running on ESXi. ▸ MySQL database. ▸ Apache, Tomcat, Grails stack on app servers. ▸ 1 clients configuration: 8 servers dedicated to MySQL, 14 app servers, 1 NFS server, 2 utility servers. ▸ Performance was terrible. ▸ Average app response time: ~600ms ▸ Average end-user response time: ~4s ▸ Constantly running out of memory and restarting ▸ Nowhere to grow
  • 9. PERFORMANCE CASE STUDY: IQ INNOVATIONS ▸ Moved to AWS. Eliminated the collocation space and other cloud provider. ▸ Still running MySQL and Centos. ▸ Databases moved to RDS. Application servers moved to EC2. ▸ Same client configuration: 6 RDS instances for databases, 4 app servers, 1 utility server, EFS to replace SAN. ▸ Performance improved dramatically: ▸ App response time: ~80-100ms ▸ End-user response time: ~1-2s ▸ No more memory issues. ▸ Cost savings of about 50%.
  • 10. SECURITY HAVEN’T WE BEEN DOING THIS FOREVER? ▸ Yes, and a lot of existing knowledge still applies. ▸ You still need smart policies. ▸ Your application still needs to protect against common attack vectors. ▸ Some things to change with a move to AWS, however. ▸ You are no longer responsible for physical security. ▸ You are no longer responsible for hypervisor security or patching. ▸ Depending on the service you may not even be responsible for OS security and patching.
  • 11. SECURITY BEST PRACTICES ▸ Trusted advisor. This is a recurring theme. ▸ Bastion hosts ▸ VPC ▸ Peering ▸ Security groups ▸ NACL ▸ COMMON SENSE!
  • 12. SECURITY COMMON MISTAKES ▸ Console access for everyone. ▸ Overly permissive policies. ▸ Lack of two factor authentication. ▸ Overly/Publicly exposed access keys. ▸ Access key rotation.
  • 13. RELIABILITY EASIER AND HARDER SIMULTANEOUSLY ▸ A lot of the work for reliability is done for you. ▸ It is a mistake to put too much trust in this. ▸ The tools are there, but you have to choose to use them. ▸ Architecture matters.
  • 14. RELIABILITY CRITICAL THINGS TO UNDERSTAND ▸ Availability zones ▸ Regions ▸ Difference between AZs and Regions and how they should be used together. ▸ Replication of different services. ▸ Availability SLAs. ▸ S3 storage classes/levels
  • 15. RELIABILITY CASE STUDY: CONFIDENTIAL COMPANY ▸ Pre-AWS: ▸ Only in one data center due to cost. ▸ Had clients nationwide, but all resources were centralized. ▸ Had to have 4 or more hours of downtime for deployments ▸ Many SPoF including storage and network. Redundancy was attempted but not done well.
  • 16. RELIABILITY CASE STUDY: CONFIDENTIAL COMPANY ▸ AWS Setup: ▸ Multiple VPCs spread across multiple regions to provide redundancy and be close to customers. ▸ VPC peering to reduce single points of failure. ▸ MAZ RDS instances for databases. ▸ EFS for network based storage. ▸ Replication of databases across regions. ▸ IaC templates for VPCs to allow for rapid reproduction in other regions.
  • 17. SCALABILITY WHAT IS SCALABILITY ▸ Scalability is about more than simply adding more resources in response to increased demand. ▸ Scalability needs to include both scaling up and scaling down. ▸ Goal is to maximize user experience while minimizing cost.
  • 18. SCALABILITY DIFFERENT APPROACH ▸ Provision with small spikes in mind, but not growth. ▸ Scale to growth. ▸ Schedule scale downs and scale ups. ▸ Auto-scaling is your friend. ▸ Monitor, monitor, monitor. Don’t alert, alert, alert.
  • 19. SCALABILITY COMMON MISTAKES ▸ Over-provisioning. ▸ Reserving too quickly. ▸ Planning for vertical scaling as opposed to horizontal. ▸ Provisioning for growth instead of planning for it. ▸ Manual intervention. ▸ Under analysis of utilization.