SlideShare a Scribd company logo
Hope is Not a Strategy
June 3, 2021
Automating efficient resource utilization for SREs
Welcome! We’ll get started in a few minutes...
Today’s StormForge presenters
● Product mktg professional,
former developer
● 30 years industry experience,
including Dynatrace, Altiris,
EDS
● Based in Ann Arbor, MI
RICH BENTLEY
Sr. Director,
Product Marketing
● Sales Engineer, former IT
leader
● 24 years experience: Finserv,
Biotech, Juniper Networks,
Tintri
● Based in the Bay Area, CA
ERWIN DARIA
Principal Sales
Engineer
What does SRE stand for?
Simply Restart Everything
A
Senior Rebooting Engineer
B
Sleep Rarely Ever
C
Software Ruins Evening
D
Seriously Regretting Everything
E
Your job is incredibly difficult
Pressure Stakes are high
Uncharted waters Massive complexity
Cloud efficiency
trade-offs
Application
Performance
Time & Effort
Cost/Resource
Utilization
Achieving peak efficiency means...
Application
Performance Time & Effort
Cost/Resource
Utilization
...improving SLI performance
↑ Throughput
↑ Availability
↓ Latency
↓ Error rate
To consistently meet or
exceed SLOs...
...at the lowest
possible cost...
...with the least
amount of effort.
What is the impact of poor performance for your business?
Revenues
War rooms
Customer sat
Customer retention
Conversions
Slowed migration/
transformation
Brand/reputation
Delayed initiatives
Productivity Loss:
dev, ops, business users
Abandonment
Nearly half of cloud spend is wasted
Source: StormForge Cloud Waste Survey https://www.stormforge.io/survey-
report/cloud-waste-survey-findings/
Developer burnout is a big problem
Source: D2IQ, Kubernetes in the Enterprise: Uncovering Challenges & Opportunities
38% of developers and
architects claim their work
makes them feel
extremely burnt out
51% of developers and
architects say building
cloud native applications
makes them want to find a
new job
Why is it so hard?
10
APPLICATION
PERFORMANCE
COST/RESOURCE
UTILIZATION
AVAILABILITY
Container 1
CPU Memory
Requests
Limits
Requests
Limits
Replicas
Container 2
CPU Memory
Requests
Limits
Requests
Limits
Replicas
Application
Settings
JVM
Heap
size
Garbage
collection
Container 3
CPU Memory
Requests
Limits
Requests
Limits
Replicas
Shuffle
file
buffer
Reducer
max
size
What Tools Are You Using Today?
11
TRIAL-AND-ERROR PERFORMANCE &
LOAD TESTING
K8S AUTO-SCALING
(HPA)
MONITORING/
OBSERVABILITY TOOLS
Easiest way to get started!
Great way to waste
engineering time
K8s expertise is hard to find!
Important step for validating
non-functional requirements
Helps to understand scaling
behavior of app
Doesn’t tell you how to
address issues
Dynamically scales # of pods
Still requires configuration
to optimize
Doesn’t account for app-
specific configuration settings
Good for identifying and
troubleshooting problems
in production
Reactive by nature - by
the time you find a
problem it’s already
impacted your users
12
So, how do we
get there?
Automating efficient resource utilization
13
SRE approach
Applying SW engineering principles to optimization
Automated
Leverage machine learning
Empowered dev teams
Focused on innovation
Managed risk
Understand the trade-offs
TO THIS:
FROM THIS:
Sysadmin approach
Operating and maintaining
Manual
Resource-intensive manual trial-and-error
Dev teams on their own
Stuck in the weeds of K8s
Blind risk
No sense of trade-offs
Proactive
Build efficiency into the release process
Reactive
Running a service & responding to events
“My goal is to automate myself
into my next job.”
SRE = Software Ruins Evening?
15
Hope Is Not A Strategy: Automating Efficient Resource Utilization for SREs
Shift Optimization Left
Build continuous optimization into your CI/CD pipeline
17
The StormForge platform
APPLICATION OPTIMIZATION
PERFORMANCE TESTING
Create tests quickly with
performance testing as a service
Automate testing as code in CI/CD
workflow
Execute realistic tests with open
workload model
ML-powered Rapid Experimentation
Engine
No upfront training data required
Automatically implement optimal
config based on your goals
Identify high-risk configs to avoid
APPLICATION PARAMETERS
CPU, MEMORY, REPLICAS
OUTCOMES
LATENCY, THROUGHPUT,
COST, RELIABILITY
FEEDBACK LOOP
CONTINUOUS OPTIMIZATION
Test Case
Business Objectives
Your App
in Your
Environment
Rapid
Experimentation
Engine
Visualized
Business
Objective
Trade-offs
Optimal
Configuration
How does it work?
Product
Demo
Large travel website achieves 50% cost-efficiency
improvement with StormForge
Initial config after
extensive manual
work
Selected config
reduced cost by 50%
w. no impact on
performance
● Team of 9 focused on
manual optimization prior
to StormForge
● Minimal dev and test
environment due to private
cloud resource constraints
● 50% resource utilization
improvement with no
performance degradation
● Free up private cloud
resources for additional QA
environments
See the cloud waste survey results:
https://www.stormforge.io/cloud-waste-report
Schedule a Demo:
www.stormforge.io/request-a-demo/
Sign Up for Free:
www.stormforge.io/get-started/
info@stormforge.io @StormForgeIO
Next Steps
Event Feedback?
events@stormforge.io
Find all these assets and more at
www.stormforge.io/next-steps/

More Related Content

PDF
PDF
eXtreme programming (XP) - An Overview
PPT
XP Explained
ODP
Agile Injection, Varberg
PPT
Xp presentation 2003
PPT
Extreme & pair programming Slides ppt
PPTX
Xp(Xtreme Programming) presentation
PPTX
Continuous integration
eXtreme programming (XP) - An Overview
XP Explained
Agile Injection, Varberg
Xp presentation 2003
Extreme & pair programming Slides ppt
Xp(Xtreme Programming) presentation
Continuous integration

What's hot (17)

PPT
Resource#2
PPT
Managing distributed teams_webinar_v2_october_2011
PPTX
Extreme programming - a quick and agile overview !
PPT
extreme programming
PPTX
Going extreme-with-extreme-programming
PPTX
DevOps Torino Meetup - SRE Concepts
PPTX
Estimating and planning Agile projects
PDF
Upgrade Preparation Best Practices & Templates | INNOVATE16
PDF
Visual studio 2010 alm keynote sgp
PDF
Managing technical debt - Dan Nicola - Florin Cardasim
PDF
Sre summary
PDF
Enough about Process, Let’s Use Patterns
PDF
PDF
Prioritizing Your Product Backlog
PDF
Incorporating Learning and Expected Cost of Change
PDF
Project Economics
PPTX
MS Cloud Day - Cloud Computing – A Crash Course for Architects
Resource#2
Managing distributed teams_webinar_v2_october_2011
Extreme programming - a quick and agile overview !
extreme programming
Going extreme-with-extreme-programming
DevOps Torino Meetup - SRE Concepts
Estimating and planning Agile projects
Upgrade Preparation Best Practices & Templates | INNOVATE16
Visual studio 2010 alm keynote sgp
Managing technical debt - Dan Nicola - Florin Cardasim
Sre summary
Enough about Process, Let’s Use Patterns
Prioritizing Your Product Backlog
Incorporating Learning and Expected Cost of Change
Project Economics
MS Cloud Day - Cloud Computing – A Crash Course for Architects
Ad

Similar to Hope Is Not A Strategy: Automating Efficient Resource Utilization for SREs (20)

PDF
Accelerate Application Migration - August 5, 2020
PPTX
Rethinking Risk-Based Project Management in the Emerging IT initiatives.pptx
DOCX
SUMAN_BALU_PROFILE_LATEST
DOCX
SUMAN_BALU_PROFILE_LATEST
PPTX
Apm andre santos
PPT
Outsourcing With Agile
PPT
Reducing Cost With Agile
PDF
Using Lean Thinking to Identify and Address Delivery Pipeline Bottlenecks
PPT
Risk Management
ODP
2008 09 06 Eric Ries Haas Columbia Customer Development Engineering
PDF
굿 소프트웨어 컴퍼니로의 여정(Journey To Be a Good Software Company)
PDF
RUC 2003 reducing time to market using follow-the-sun techniques
PPTX
Unit 9 and Unit 10.pptx
PPT
Phoenix User Group Slides
PPTX
PureApp Presentation
DOCX
Chethan Updated Resume
PPTX
Strategies for Implementing Aras Innovator
PPTX
CWIN17 london delivering devops and release automation in fs - duncan bradf...
PDF
Accelerate and Streamline Performance Testing with AI-powered Test Automation...
PPT
Agile Pmi 102108 Final
Accelerate Application Migration - August 5, 2020
Rethinking Risk-Based Project Management in the Emerging IT initiatives.pptx
SUMAN_BALU_PROFILE_LATEST
SUMAN_BALU_PROFILE_LATEST
Apm andre santos
Outsourcing With Agile
Reducing Cost With Agile
Using Lean Thinking to Identify and Address Delivery Pipeline Bottlenecks
Risk Management
2008 09 06 Eric Ries Haas Columbia Customer Development Engineering
굿 소프트웨어 컴퍼니로의 여정(Journey To Be a Good Software Company)
RUC 2003 reducing time to market using follow-the-sun techniques
Unit 9 and Unit 10.pptx
Phoenix User Group Slides
PureApp Presentation
Chethan Updated Resume
Strategies for Implementing Aras Innovator
CWIN17 london delivering devops and release automation in fs - duncan bradf...
Accelerate and Streamline Performance Testing with AI-powered Test Automation...
Agile Pmi 102108 Final
Ad

Recently uploaded (20)

DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
Construction Project Organization Group 2.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
composite construction of structures.pdf
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
web development for engineering and engineering
DOCX
573137875-Attendance-Management-System-original
PPT
Mechanical Engineering MATERIALS Selection
PPTX
UNIT 4 Total Quality Management .pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
bas. eng. economics group 4 presentation 1.pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Construction Project Organization Group 2.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
composite construction of structures.pdf
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Model Code of Practice - Construction Work - 21102022 .pdf
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Internet of Things (IOT) - A guide to understanding
CYBER-CRIMES AND SECURITY A guide to understanding
web development for engineering and engineering
573137875-Attendance-Management-System-original
Mechanical Engineering MATERIALS Selection
UNIT 4 Total Quality Management .pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
bas. eng. economics group 4 presentation 1.pptx

Hope Is Not A Strategy: Automating Efficient Resource Utilization for SREs

  • 1. Hope is Not a Strategy June 3, 2021 Automating efficient resource utilization for SREs Welcome! We’ll get started in a few minutes...
  • 2. Today’s StormForge presenters ● Product mktg professional, former developer ● 30 years industry experience, including Dynatrace, Altiris, EDS ● Based in Ann Arbor, MI RICH BENTLEY Sr. Director, Product Marketing ● Sales Engineer, former IT leader ● 24 years experience: Finserv, Biotech, Juniper Networks, Tintri ● Based in the Bay Area, CA ERWIN DARIA Principal Sales Engineer
  • 3. What does SRE stand for? Simply Restart Everything A Senior Rebooting Engineer B Sleep Rarely Ever C Software Ruins Evening D Seriously Regretting Everything E
  • 4. Your job is incredibly difficult Pressure Stakes are high Uncharted waters Massive complexity
  • 6. Achieving peak efficiency means... Application Performance Time & Effort Cost/Resource Utilization ...improving SLI performance ↑ Throughput ↑ Availability ↓ Latency ↓ Error rate To consistently meet or exceed SLOs... ...at the lowest possible cost... ...with the least amount of effort.
  • 7. What is the impact of poor performance for your business? Revenues War rooms Customer sat Customer retention Conversions Slowed migration/ transformation Brand/reputation Delayed initiatives Productivity Loss: dev, ops, business users Abandonment
  • 8. Nearly half of cloud spend is wasted Source: StormForge Cloud Waste Survey https://www.stormforge.io/survey- report/cloud-waste-survey-findings/
  • 9. Developer burnout is a big problem Source: D2IQ, Kubernetes in the Enterprise: Uncovering Challenges & Opportunities 38% of developers and architects claim their work makes them feel extremely burnt out 51% of developers and architects say building cloud native applications makes them want to find a new job
  • 10. Why is it so hard? 10 APPLICATION PERFORMANCE COST/RESOURCE UTILIZATION AVAILABILITY Container 1 CPU Memory Requests Limits Requests Limits Replicas Container 2 CPU Memory Requests Limits Requests Limits Replicas Application Settings JVM Heap size Garbage collection Container 3 CPU Memory Requests Limits Requests Limits Replicas Shuffle file buffer Reducer max size
  • 11. What Tools Are You Using Today? 11 TRIAL-AND-ERROR PERFORMANCE & LOAD TESTING K8S AUTO-SCALING (HPA) MONITORING/ OBSERVABILITY TOOLS Easiest way to get started! Great way to waste engineering time K8s expertise is hard to find! Important step for validating non-functional requirements Helps to understand scaling behavior of app Doesn’t tell you how to address issues Dynamically scales # of pods Still requires configuration to optimize Doesn’t account for app- specific configuration settings Good for identifying and troubleshooting problems in production Reactive by nature - by the time you find a problem it’s already impacted your users
  • 12. 12 So, how do we get there?
  • 13. Automating efficient resource utilization 13 SRE approach Applying SW engineering principles to optimization Automated Leverage machine learning Empowered dev teams Focused on innovation Managed risk Understand the trade-offs TO THIS: FROM THIS: Sysadmin approach Operating and maintaining Manual Resource-intensive manual trial-and-error Dev teams on their own Stuck in the weeds of K8s Blind risk No sense of trade-offs Proactive Build efficiency into the release process Reactive Running a service & responding to events
  • 14. “My goal is to automate myself into my next job.”
  • 15. SRE = Software Ruins Evening? 15
  • 17. Shift Optimization Left Build continuous optimization into your CI/CD pipeline 17
  • 18. The StormForge platform APPLICATION OPTIMIZATION PERFORMANCE TESTING Create tests quickly with performance testing as a service Automate testing as code in CI/CD workflow Execute realistic tests with open workload model ML-powered Rapid Experimentation Engine No upfront training data required Automatically implement optimal config based on your goals Identify high-risk configs to avoid
  • 19. APPLICATION PARAMETERS CPU, MEMORY, REPLICAS OUTCOMES LATENCY, THROUGHPUT, COST, RELIABILITY FEEDBACK LOOP CONTINUOUS OPTIMIZATION Test Case Business Objectives Your App in Your Environment Rapid Experimentation Engine Visualized Business Objective Trade-offs Optimal Configuration How does it work?
  • 21. Large travel website achieves 50% cost-efficiency improvement with StormForge Initial config after extensive manual work Selected config reduced cost by 50% w. no impact on performance ● Team of 9 focused on manual optimization prior to StormForge ● Minimal dev and test environment due to private cloud resource constraints ● 50% resource utilization improvement with no performance degradation ● Free up private cloud resources for additional QA environments
  • 22. See the cloud waste survey results: https://www.stormforge.io/cloud-waste-report Schedule a Demo: www.stormforge.io/request-a-demo/ Sign Up for Free: www.stormforge.io/get-started/ info@stormforge.io @StormForgeIO Next Steps Event Feedback? events@stormforge.io Find all these assets and more at www.stormforge.io/next-steps/