SlideShare a Scribd company logo
AWS at Scale
AWS at scale
AWS at scale
AWS at scale
AWS at scale
AWS at scale
AWS at scale

More Related Content

PPTX
PPTX
Amazon Web Services for Application Hosting | SugarCon 2011
PPTX
Amazon Web Services Diagram Templates by Creately
PPTX
EC2 Pricing Model (deck 0307 of the InfiniteSkills AWS course at http://bit.l...
PPTX
Cloudformation
PDF
Aws vs azure
PDF
AWS Training - Certification of Completion
PDF
01 aw some day_main track_aws basics
Amazon Web Services for Application Hosting | SugarCon 2011
Amazon Web Services Diagram Templates by Creately
EC2 Pricing Model (deck 0307 of the InfiniteSkills AWS course at http://bit.l...
Cloudformation
Aws vs azure
AWS Training - Certification of Completion
01 aw some day_main track_aws basics

What's hot (15)

PPTX
Cloud Computing & Benefits
PPTX
Scalable Java Application Development on AWS
PPT
Practical Experiences With ArcGIS Server
PDF
Getting Started with Amazon EMR
PPTX
AWS Peru Meetup - recap reinvent 2016 (by Carlos Cortez)
PPTX
AWS Perú Meetup - Arquitecting for HA by Raul Hugo
PDF
Accion Labs - Cloud Computing Solutions
PPTX
Aws training
PPTX
Aws certification training guruprasanth.s
PDF
Datadog jawsfesta2017 20171104
PDF
Cloud Taxonomy: Platform vs Infrastructure
PDF
aws-certificate
PDF
AWS chez Attestis
PDF
What is Cloud Computing with AWS at Websummit Dublin
PPTX
AWS Bath User Group - Meetup #2
Cloud Computing & Benefits
Scalable Java Application Development on AWS
Practical Experiences With ArcGIS Server
Getting Started with Amazon EMR
AWS Peru Meetup - recap reinvent 2016 (by Carlos Cortez)
AWS Perú Meetup - Arquitecting for HA by Raul Hugo
Accion Labs - Cloud Computing Solutions
Aws training
Aws certification training guruprasanth.s
Datadog jawsfesta2017 20171104
Cloud Taxonomy: Platform vs Infrastructure
aws-certificate
AWS chez Attestis
What is Cloud Computing with AWS at Websummit Dublin
AWS Bath User Group - Meetup #2
Ad

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Spectroscopy.pptx food analysis technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Encapsulation theory and applications.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
A Presentation on Artificial Intelligence
PDF
Empathic Computing: Creating Shared Understanding
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Review of recent advances in non-invasive hemoglobin estimation
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Chapter 3 Spatial Domain Image Processing.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Approach and Philosophy of On baking technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Encapsulation_ Review paper, used for researhc scholars
Spectroscopy.pptx food analysis technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Encapsulation theory and applications.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Big Data Technologies - Introduction.pptx
A comparative analysis of optical character recognition models for extracting...
A Presentation on Artificial Intelligence
Empathic Computing: Creating Shared Understanding
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Ad

AWS at scale

Editor's Notes

  • #3: Lets start with our customer’s life before AWS. There were 6 clusters in a hardware DC. Each cluster share own master MySQL database, set of slaves DB, set of Web-apps, own portal app, tools, etc. Drivers for changes: BF preparation is too expensive/complicated New cluster creation or disaster recovery process was a sequence of documented actions (runbooks). Few downtimes during 2008 challenged to be more redundant and need failover DC AWS technologies: none
  • #4: Two AWS DCs + master on premise hardware DC AWS Benefits: Rolling deployment: turn off DC from pool and deploy Fast and “unlimited” capacity scaling up/down Lessons learned: AWS is flexible and scalable A lot of infra changes need to be done All components need to be cloud-ready (build to fail) We need to change and adopt to cloud. Moving forward: The slaves routinely fell behind when we had to ingest lots of new data, sometimes by 10 hours or more.  We needed to rethink our entire stack. AWS technologies: EC2, EIP
  • #5: Full platform/organizational re-architecture. Agile/DevOps/… Re-architecture drivers: * app/organizational changes * next level of flexibility, performance, and reliability * solve our multi-region replication problem * get rid of our individual clusters App/Org changes: * Monolithic Java app was broken up into a set of small services, each supported by a decentralized engineering team. * The teams were responsible for the entire service life-cycle, from Development to QA to Operations. * Engineering adopted Agile as a development methodology, where previously we were waterfall driven. * Shorter release cycle: from coordinated 8-12 weeks to once per week with any time coordinated release Tools changes: Puppet adoption Zabbix/Nagios to Datadog Distributed logging to centralized Data stack: For our DB system of record, we chose Cassandra for its multi-region replication abilities (DynamoDB did not have this feature) and cloud-native operational qualities. ElasticSearch replaced Solr for similar reasons. AWS technologies: IaaC: CloudFormation 3 VPCs: dev/qa/prod, 3 regions with 3 AZ each Public and Private ELBs AutoScaling EBS – early adopter of big volumes MySQL RDS Route53 SWF/SQS/SNS/SES
  • #6: ECS – clustered Docker container orchestration service Early ECS adoption: Early ECS with HAProxy balancers layer + Consul (service discovery) + Consul-template (dynamic balancing via HAProxy) + Registrator (service registration in Consul) + Custom deployment tool (based on Thor + AWS Ruby SDK) Complex and hard to manage/troubleshoot Additional layers costs Missed features Current ECS implementation: ALB with host based and URL based target rules Clear and simple deployment process via yaml CFN templates New features as Docker labels Missing: multiple ELBs and Service Discovery
  • #7: Lambda + API Gateway + EFS + ECS example Task: run ML TensorFlow image trainer for particular image class by request AWS Batch is not support EFS – sad API Gateway + Lambda can launch task on ECS cluster, but what if there is no free resources? Lambda increasing ASG size as well ) CloudWatch decreasing ASG size Tokenizer ECS DEMO
  • #8: Data Pipeline example Customer’s actions -> S3 parquet data -> EMR cluster -> Spark steps (joins, sorts, aggregations) -> S3 parquet data -> Hive indexing-> ES DP features: Preconditions Intermediate actions (scripts) AMI with frameworks Spots Flexible scaling Additional talk - cost saving: Spots/Reserved T2 instances S3 policies + storage types Resources inspection