SlideShare a Scribd company logo
Multigenerational
Observability and
Security for Evolving
Cloud Operations
August 2024
2
Patrick Hannah
CTO
CloudHesive
Todd Williams
Senior Partner Manager
Datadog
● It’s easier to build on AWS than it was 5 or even 10 years ago
○ Great!
● A common challenge among organizations is staying current
○ Architecture, Operations and Security
● How do you create a balance
○ Development of new feature/functionality
○ Addressing risk and debt
● Let’s dig in!
Introduction
3
4
75%
of our organizations will anchor
their digital transformation on
the cloud by 2026
Source: Gartner (Apr 2023)
5
Gartner Forecast: Public Cloud Services, Worldwide - 2010-2016, 4Q12 Update; 2011-2017, 4Q13 Update; 2012-2018, 4Q14 Update; 2013-2019, 4Q15 Update; 2014-2020, 4Q16 Update;
2015-2021, 4Q17 Update; 2016-2022, 4Q18 Update; 2017-2023, 4Q19 Update; 2018-2024, 4Q20 Update; 2019-2025, 4Q21 Update; 2020-2026, 4Q22 Update; 2021-2027, 4Q23 Update..
Gartner Market Databook - 4Q12 Update; 4Q13 Update; 4Q14 Update; 4Q15 Update; 4Q16 Update; 4Q17 Update; 4Q18 Update; 4Q19 Update; 4Q20 Update; 4Q21 Update; 4Q22 Update; 4Q23
Cloud migration and digital transformation
Cloud spend continues to grow rapidly
6
The problem: an explosion of complexity
Diversity of technologies in use
Frequency of release
Scale in number of computing units
Number of people involved
Where are you at on your journey?
● Small team
● Building an app
● To get customers
● Now you have customers
● And those customers have unstated, but expected,
expectations
○ Customers are not always the ones paying for the service
7
Datadog x CloudHesive: better together
8
Log Management Tool
Infrastructure Monitoring Tool
Application Monitoring Tool
Synthetics Tool
Cloud Security Management
Network Performance
Monitoring
Hybrid Workloads
Migrated Workloads
Containerized Workloads
Serverless Workloads
Why do we work better
together?
9
Common customer value drivers
Enhance customer
experience
Enable operational
scalability and
cost reduction
Reduce
operational,
security and
compliance risk
Accelerate digital
transformation
10
11
Datadog is a single platform
for all your observability &
security needs
12
CI Visibility
Test Visibility
Intelligent Test
Runner
Continuous Testing
Cloud Security
Management
Application Security
Management
Software
Composition
Analysis
Cloud SIEM
Log Management
Observability
Pipelines
Audit Trail
Log Forwarding
Error Tracking
Sensitive
Data Scanner
Synthetics
Mobile App Testing
Browser Real User
Monitoring
Mobile Real User
Monitoring
Session Replay
Application
Performance
Monitoring
Distributed Tracing
Continuous Profiler
Database
Monitoring
Universal Service
Monitoring
Data Streams
Monitoring
Data Jobs
Monitoring
LLM Observability
Infrastructure
Monitoring
Containers
Serverless
Network
Performance
Monitoring
Network Device
Monitoring
Metrics
Cloud Cost
Management
Cloudcraft
Incident
Management
Event Management
Workflow
Automation
App Builder
Shared Platform Services
Dashboards CoScreen Teams Agent OpenTelemetry Notebooks Service Catalog IDE Plugins ChatOps SLOs Case
● ● ● ● ● ● ● ● ● ●
Management
UNIFIED METRICS, LOGS, TRACES, SESSIONS
AI
Natural Language Querying Root Cause Analysis Anomaly Detection Impact Analysis Proactive Alerts Autonomous Investigations Bits
● ● ● ● ● ●
AI
Infrastructur
e
Applications Digital
Experience
Logs Security Software
Delivery
Cloud
Service
Managemen
t
Datadog Platform
750+ INTEGRATIONS
Datadog helps CloudHesive’s customers make
sense of their data
13
Migration Operations
Modernization
Deep visibility into complex
environments (e.g., on-perm,
hybrid, multi-cloud)
Remove undifferentiated heavy
lifting of the security and
operations of your workloads; focus
on your product
Maintain high observability of
rapidly evolving infrastructure (e.g.,
AI, ML, containers, serverless)
Solving customer
challenges together
14
15
Workload 1 - Serverless Architecture
Workload 1 - Serverless Services Map
16
What are we seeing?
● Technology Transformation
○ Broad set of technologies
○ Evolving workload architectures
● Observability
○ Pre-cloud
○ Native observability
○ Hybrid
● Well Architected - Common Challenges
○ Validation
○ Parameters
○ Monitoring, Response, Root Cause Analysis
● Release Safety
○ Test Automation
○ Release Automation
○ Post Release Monitoring
○ Rollback Capabilities
17
As an application owner, what’s expected?
18
● Product/Development
○ Responsiveness
○ Accuracy
○ Allocation
● Security and Compliance
○ Libraries
○ Infrastructure
○ Code
○ Intellectual Property
○ Events
○ Vendors, Licenses, SaaS, Software, NF Systems (DevOps), People Backgrounds, Training, Phishing –
Corporate and App
● Infrastructure
○ Availability
○ Performance – Sync
○ Performance – Async
As an application owner, what’s expected?
● Customer
○ Trials
○ Conversions
○ Attrition
○ Growth
○ NPS
● Financial
○ Budget to Plan
○ Margin
○ Vendors, Licenses, SaaS, Software, NF Systems (DevOps), People Backgrounds,
Training, Phishing People – Corporate and App
19
How can you measure this?
20
● Tiering
○ Portfolio Health
○ Customer Specific
○ Resource Specific
● Cohorting
○ Geo Customers and Teams
○ Line of Business Customers and Teams
● Vendor Management
○ Customer Margin
○ Non Customer Margin/Utilization
● Actions
○ ROI Proving
○ Churn Detection
○ Continuous Improvement/Optimization Pipeline
21
Workload 1 - Serverless - Feature Usage
Workload 1 - Serverless - Usage
22
What are common challenges?
23
● Infosec
○ Sprawl (Contracts, Software, Licenses, Services, Hardware), Auth, Unique Risks/Threats
● Data
○ Generative AI
○ Machine to Machine
● Platforming
○ Compute is a commodity
○ Containerization as a default
○ Serverless as a way forward
● Stack consolidation (Stackolidation)
Well Architected Framework
● Pillars
○ Operational Excellence
○ Security
○ Reliability
○ Performance Efficiency
○ Cost Optimization
○ Sustainability
● Lenses
○ SaaS
24
Information Security
25
● Frameworks
○ SOC ½ Type ½
○ PCI DSS
○ FedRAMP / CMMC
○ HIPAA / HITRUST
● Processes
○ Information
○ Risks
○ Policies
○ Procedures
○ Controls
○ Assessment/Audit/Testing
○ People
■ Onboarding, Offboarding, Change
■ Permissions: Privileged Access, Business Need
■ Credentials: Strength and Factors
■ Training
■ Background
■ Phishing
■ Vendors
■ Software/Services
What’s do customers often miss?
● Ingress Security Group
● Egress Security Group (Internet)
● Security Groups to/from other Services (AWS and On Premises)
● Security of the Environment
● Security of supporting servers (Active Directory)
● Security of other network-accessible resources (Web Servers)
● User Permissions (Non-Local Admin, Local Admin, Global Admin)
● Access of the environment (PKI Cert, PKI PIV, Network, MFA)
● The rest of the AWS Account? The rest of the AWS Account! (Services, APIs)
26
Workload 1 - Serverless - Security Events
27
Revenue to Usage to Cost Attribution
● Revenue
○ Hierarchy of Needs
● Usage
○ Typically tied to revenue if you are offering a SaaS based product
● Cost Attribution
○ Labor
■ Capitalized
○ Software
■ Opex vs. Capex
○ Services
■ Third Party
● Okayish
■ AWS
● With serverless (or well managed servers) margin should be linear, or better with scale of
usage
○ Optimization at Scale
○ Savings Plans 28
Cost Optimization Priority Funnel
● Inappropriate Resources (EC2 versus Fargate)
● Rightsizing (Compute Optimizer/CloudWatch Memory) – EC2 Unused/Underused
● EC2-Other (EBS Unused, Underused)
● Trusted Advisor (EBS Unused)
● Egress
● Well Architected Pillars
● Enterprise Agreement – Customer
○ Customer
● Enterprise Agreement – Service
○ Org - CloudFront
● Service Tiering
○ Org
● Savings Plans
○ EC2, Lambda, Fargate
● Reserved Instances
○ EC2, RDS, Elasticache
29
Workload 1 - Serverless - Cost Attribution
30
Sustainability in Technology
● Industry Goals
○ AWS Goals – Water Positive 2030, 100% Renewable Energy 2025
■ Partner Impact – 1 of 13 domains in MSP Audit focus on sustainability
■ Industry Impact – 1 of 6 pillars in Well Architected Framework focus on sustainability
■ Customer Impact – Proactive (planning) and reactive (actual consumption) visibility into a
workload’s Carbon Footprint
● Organizational Goals
○ Our Goals – Influence and impact our customers through leadership
● Our Unique Position
○ Cross section of customers
○ Influenced Impact
○ Direct Impact
● Sustainability in technology starts with optimization (cost, performance, etc.) – it doesn’t end there
○ Defining operational parameters – how “fast” does ”it” need to be?
○ Service selection (which can be influenced by/influences cost optimization objectives) – running 24
hours a day servicing work-day application
31
Cloud Workload Lifecycle Management
● Workload
● Architecture
● Monitoring
● Automation
● Processes
● Integration
32
Workload + Architecture Drives Service
Selection
● Containers
○ Container File
○ Versioning
○ Multi-threaded/Single-task
○ Minutes to Days
○ Per VM/Per Hour
● Virtual Machines
○ AMI
○ Patching
○ Multi-threaded/Multi-task
○ Hours to Months
○ Per VM/Per Hour
● Functions/Services
○ Code
○ Versioning
○ Single-threaded/Single-task
○ Microseconds to Seconds
○ Per Memory/Second/Per Request
33
How does multigenerational observability play a
role?
● Ease of implementation - cloud service
● Support for integrations - 110 AWS services
● Consideration for hyperscalers
● Common interface
● Decoupling from your planes
34
Workload 1 - Serverless - API Usage
35
Workload 1 - Serverless - Lambda Usage
36
37
Recent
Troubleshooting
Workload 1 - Serverless Detection, confirmation,
validation of defects and fixes -
escaped and undetected; root
cause analysis
3 Ways: Logs, APM Traces, Error
Tracking
a
a
38
Workload 1 - Serverless - Recent
Troubleshooting
39
Workload 2 - EC2 Fixed, EC2 ASG, K8S, EKS, Net
40
Workload 2 - Topology
Workload 2 - Networking
41
Customer Example - Architecture
● Notable percentage of Managed Services incidents could have been avoided through up-front
architecture
○ ~9 EC2 instances (NGINX, Front End, Back End, Database) = 4 hours/instance/month in
caring for/feeding is 108,000.00 USD/Year @ 250.00 USD/Hour
● That’s customer cost, what about opportunity cost?
○ Opportunity for tremendous customer value (customer saves 108,00.00 USD/Year) and
provides us an opportunity to be more strategic with our partner (moving up the stack)
● What can we do? None of the above systems need to be servers
○ Increases the customers we can touch without a direct correlation to headcount
42
Customer Example - Monitoring
● Previous Example entails hundreds of monitorable events and metrics, with a
composite required to understand state
○ Interesting events feed into event driven automation
● Eliminate the instances and changes focus monitoring on customer outcomes
○ Increasing the scope of automated data collection eliminates manual
checking but introduces complex correlation engines (people), which
Outcome based monitoring minimizes the need for/increases positive
customer sentiment
43
Customer Example - Automation
● Previous example also has numerous automation touch points (AWS
Services, Operating System, Services, etc.)
○ A move to serverless drops this number to practically none
● Automation skills shifted to development automation
○ Provides a consistent experience intra and inter customer, and
again increases the value of our impact to our customers without a
direct tie to headcount
44
Conclusion
● AWS and Datadog continue to increase the breadth and depth of their service offerings
○ I wish it did that
○ I didn’t know I needed that
● It’s easier to get started today than it was yesterday
○ Simplicity
○ Support
○ Cost
● Build with these in mind
○ Evolving workloads
○ Well Architected
○ Observability
● Conclusion
○ Consider sustainability when choosing an approach – Maslow’s Hammer
○ Don’t forget about team enablement
○ Limited by your imagination and ability to execute
45
Questions?
Ask your questions in the Q&A box below
47
Sign up for our
Partner Free Trial
Getting started with Datadog and
CloudHesive
Set up a
follow-up call
Multigenerational Observability
and Security for Evolving Cloud
Operations
Resource slides
49
Icons
50
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
abstract, data abstract, data abstract, data failure, alert,
alarm
bug
cloud security cloud cloud
infrastructure
cloud money,
cloud cost
code time,
speed
complex
dashboard data check,
document
data x,
document
download
gear gear, cycle gear x
framework
inbox integrate
open
telemetry
orchestrate
failure, alert,
alarm
collecting
change,
movement
filter, funnel
Icons
51
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
queues
pipeline
unhealthy
powerpacks
security app security
platform
wrench
workflows
window,
browser, check
window,
browser,
dashboard
server network,
connection
desktop,
computer
database
mobile phone
announcemen
t
blocks
all, grid
collaboration collaboration,
referral
pipeline
healthy
replay, cycle
collaboration
security
monitoring,
cloud SIEM
document document,
cheat sheet
security check
org chart
Icons
52
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
simplify step by step
eye, see,
vision
eye, see,
vision
feedback,
speech,
comment
feedback,
speech,
comment
flexibility
graph line graph bar
magnifying
glass, search
money cycle cost, money
pinpoint,
target
pinpoint,
target
separate,
disconnected
location arrow
time
user, person time money
team, people
money, coin
company,
building
bomb, threat chaos,
disorder
firewall
clock, time,
quick, speed
Icons
53
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
camcorder,
camera
camera
event,
celebration
explosion,
boom
fire, flame fitness, weight flag
food, burger
medical meditation,
meditate
recycle self defense
shirt
coffee coffee cup takeout, food tickets vacation,
holiday
video
government,
building
book global events diversity, DEI personal
improvement
scholarship,
education
workshops,
presentation
video, play
Icons
54
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
dashboard piggy bank,
savings
team, people,
group
charity, giving bill, cost,
document
office, desk law, legal,
judgment
partnership,
handshake
happy, smile,
fulfillment
inclusion
family
train, subway,
travel
calendar,
parental
pet, animal recycle question,
confusion
question,
confusion
question,
confusion
security
persona
operations
persona
serverless
function cloud
router arrow switch arrow dns globe host map container table dashboard
window
Icons
55
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
umbrella
firewall office supplies printer, print
parking bicycle, bike shipping,
package
shower
briefcase, job,
work
credit card retail,
e-commerce,
exchange,
arrow
store,
shopping
circle check circle x
point, touch
data money,
cost context
common,
similar
code, coding,
terminal
data routing metrics meter,
usage
settings,
configuration
transactions
code, coding abstract,
stack, layers
package, box
globe, global,
world
Icons
56
To edit color:
Select icon>
Click paint bucket>
Choose solid or
gradient swatch
error tracking
inbox
observability
pipelines
ai, sparkle
trace logs trace magnify trace queries
continuous
testing
ci visibility,
infinity loop
service
catalog
rum device
Datadog product icons
57
INFRASTRUCTURE
Infrastructure
Monitoring
Network Performance
Monitoring
Network Device
Monitoring
Container Monitoring
Serverless
Cloud Cost
Management
Cloudcraft
LOGS
Log Management
Sensitive Data Scanner
Audit Trail
Observability Pipelines
APPLICATIONS
Application
Performance
Monitoring
Universal Service
Monitoring
Continuous Profiler
Database Monitoring
Data Streams
Monitoring
Service Catalog
Dynamic
Instrumentation
SECURITY
Software Composition
Analysis
Application Security
Management
Cloud Security
Management
Cloud SIEM
AIOPS
Event Management
Watchdog
Bits AI
OpenTelemetry
Workflow Automation
CoScreen
Dashboards
Alerts
Integrations
IDE Plugins
API
PLATFORM CAPABILITIES
SOFTWARE DELIVERY
CI Pipeline Visibility
Test Visibility &
Intelligent Test Runner
Continuous Testing
DIGITAL EXPERIENCE
Browser Real User
Monitoring
Mobile Real User
Monitoring
Synthetic Monitoring
Mobile App Testing
Session Replay
Error Tracking
SERVICE MANAGEMENT
Incident Management
Case Management
Service Level
Objectives
Datadog product icons
58
INFRASTRUCTURE
Infrastructure
Monitoring
Network Performance
Monitoring
Network Device
Monitoring
Container Monitoring
Serverless
Cloud Cost
Management
Cloudcraft
LOGS
Log Management
Sensitive Data Scanner
Audit Trail
Observability Pipelines
APPLICATIONS
Application
Performance
Monitoring
Universal Service
Monitoring
Continuous Profiler
Database Monitoring
Data Streams
Monitoring
Service Catalog
Dynamic
Instrumentation
SECURITY
Software Composition
Analysis
Application Security
Management
Cloud Security
Management
Cloud SIEM
AIOPS
Event Management
Watchdog
Bits AI
OpenTelemetry
Workflow Automation
CoScreen
Dashboards
Alerts
Integrations
IDE Plugins
API
PLATFORM CAPABILITIES
SOFTWARE DELIVERY
CI Pipeline Visibility
Test Visibility &
Intelligent Test Runner
Continuous Testing
DIGITAL EXPERIENCE
Browser Real User
Monitoring
Mobile Real User
Monitoring
Synthetic Monitoring
Mobile App Testing
Session Replay
Error Tracking
SERVICE MANAGEMENT
Incident Management
Case Management
Service Level
Objectives

More Related Content

PDF
Machine learning
PDF
Python Spyder IDE | Edureka
PPTX
Next word prediction
PPT
SQL - Introducción
PDF
Book Recommendation Engine
PPTX
Frame buffer
PPTX
Natural language processing
PPTX
Genre Classification and Analysis
Machine learning
Python Spyder IDE | Edureka
Next word prediction
SQL - Introducción
Book Recommendation Engine
Frame buffer
Natural language processing
Genre Classification and Analysis

What's hot (18)

PPTX
Flat panel display
PPTX
Impresora matriz de punto
PPTX
Ruby exposición
PDF
Cricket match outcome prediction using machine learning
PPTX
Ai 8 puzzle problem
PPTX
NP completeness
PDF
Natural Language Processing (NLP)
PDF
I. Mini-Max Algorithm in AI
PPTX
knn classification
PDF
Paxos introduction
PDF
Text classification & sentiment analysis
PPTX
Amazon seniment
PDF
Temporal difference learning
PDF
Computer Science A Level Specification
PDF
Advanced Web Programming Chapter 12
PPTX
House price prediction
PPTX
Natural Language Processing: Parsing
Flat panel display
Impresora matriz de punto
Ruby exposición
Cricket match outcome prediction using machine learning
Ai 8 puzzle problem
NP completeness
Natural Language Processing (NLP)
I. Mini-Max Algorithm in AI
knn classification
Paxos introduction
Text classification & sentiment analysis
Amazon seniment
Temporal difference learning
Computer Science A Level Specification
Advanced Web Programming Chapter 12
House price prediction
Natural Language Processing: Parsing
Ad

Similar to CloudHesive x Datadog Multi Generational Observability (20)

PDF
How to Choose an Integration Platform Vendor for Your Business
PDF
Pivoting event streaming, from PROJECTS to a PLATFORM
PDF
Microservices operational management | Walkingtree Technologies
PDF
Overcoming Data Gravity in Multi-Cloud Enterprise Architectures
PDF
OSMC 2022 | Scaling SLOs with K8s and Cloud-native Observability by George Ha...
PPTX
Cloud computing
PDF
"Streamline Business Operations with End-to-End Odoo Implementation Services ...
PDF
SaaS startups - Software Engineering Challenges
PDF
Next gen software operations models in the cloud
PPTX
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
PDF
AWS vs Azure vs Google (GCP) - Slides
PDF
MuleSoft Manchester Meetup #4 slides 11th February 2021
PDF
Designing for operability and managability
PDF
NGINX Controller: Configuration, Management, and Troubleshooting at Scale – EMEA
PDF
David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th...
PDF
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
PPTX
Final_year_project cloud computing.....pptx
PDF
Dhaval Nagar - ServerlessDays Bengaluru 2023
PDF
Netflix IT Ops 2014 Roadmap
PDF
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
How to Choose an Integration Platform Vendor for Your Business
Pivoting event streaming, from PROJECTS to a PLATFORM
Microservices operational management | Walkingtree Technologies
Overcoming Data Gravity in Multi-Cloud Enterprise Architectures
OSMC 2022 | Scaling SLOs with K8s and Cloud-native Observability by George Ha...
Cloud computing
"Streamline Business Operations with End-to-End Odoo Implementation Services ...
SaaS startups - Software Engineering Challenges
Next gen software operations models in the cloud
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
AWS vs Azure vs Google (GCP) - Slides
MuleSoft Manchester Meetup #4 slides 11th February 2021
Designing for operability and managability
NGINX Controller: Configuration, Management, and Troubleshooting at Scale – EMEA
David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th...
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
Final_year_project cloud computing.....pptx
Dhaval Nagar - ServerlessDays Bengaluru 2023
Netflix IT Ops 2014 Roadmap
[DOST] OpenStack & the Enterprise Hybrid Cloud - Tech, People, Processes
Ad

More from CloudHesive (20)

PPTX
Modernization of your AWS based SaaS platform - Short
PPTX
Modernization of your AWS based SaaS platform
PPTX
Serverless Generative AI on AWS, AWS User Groups of Florida
PPTX
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
PPTX
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
PPTX
Accelerating Business and Research Through Automation and Artificial Intellig...
PPTX
Amazon Connect Rethink Your Contact Center with CloudHesive.pptx
PPTX
ConnectPath Introduction
PDF
Modernize your contact center with ConnectPath CX v2.pdf
PDF
Modernize your contact center with ConnectPath CX — Chart.pdf
PPTX
End User Computing at CloudHesive.pptx
PPTX
Analytics at CloudHesive
PPTX
Supporting your CMMC initiatives with Sumo Logic
PDF
Best Practices and Resources to Effectively Manage and Optimize Your AWS Costs
PPTX
Serverless data and analytics on AWS for operations
PPTX
reInvent reCap 2022
PPTX
Serverless without Code (Lambda)
PDF
AWS Advanced Analytics Automation Toolkit (AAA)
PDF
AWS Control Tower
PPTX
Security on AWS, 2021 Edition Meetup
Modernization of your AWS based SaaS platform - Short
Modernization of your AWS based SaaS platform
Serverless Generative AI on AWS, AWS User Groups of Florida
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Amazon Connect & AI - Shaping the Future of Customer Interactions - GenAI and...
Accelerating Business and Research Through Automation and Artificial Intellig...
Amazon Connect Rethink Your Contact Center with CloudHesive.pptx
ConnectPath Introduction
Modernize your contact center with ConnectPath CX v2.pdf
Modernize your contact center with ConnectPath CX — Chart.pdf
End User Computing at CloudHesive.pptx
Analytics at CloudHesive
Supporting your CMMC initiatives with Sumo Logic
Best Practices and Resources to Effectively Manage and Optimize Your AWS Costs
Serverless data and analytics on AWS for operations
reInvent reCap 2022
Serverless without Code (Lambda)
AWS Advanced Analytics Automation Toolkit (AAA)
AWS Control Tower
Security on AWS, 2021 Edition Meetup

Recently uploaded (20)

PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPT
Teaching material agriculture food technology
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Advanced methodologies resolving dimensionality complications for autism neur...
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Mobile App Security Testing_ A Comprehensive Guide.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
NewMind AI Weekly Chronicles - August'25 Week I
Chapter 3 Spatial Domain Image Processing.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Unlocking AI with Model Context Protocol (MCP)
Teaching material agriculture food technology
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Approach and Philosophy of On baking technology
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced Soft Computing BINUS July 2025.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

CloudHesive x Datadog Multi Generational Observability

  • 1. Multigenerational Observability and Security for Evolving Cloud Operations August 2024
  • 3. ● It’s easier to build on AWS than it was 5 or even 10 years ago ○ Great! ● A common challenge among organizations is staying current ○ Architecture, Operations and Security ● How do you create a balance ○ Development of new feature/functionality ○ Addressing risk and debt ● Let’s dig in! Introduction 3
  • 4. 4 75% of our organizations will anchor their digital transformation on the cloud by 2026 Source: Gartner (Apr 2023)
  • 5. 5 Gartner Forecast: Public Cloud Services, Worldwide - 2010-2016, 4Q12 Update; 2011-2017, 4Q13 Update; 2012-2018, 4Q14 Update; 2013-2019, 4Q15 Update; 2014-2020, 4Q16 Update; 2015-2021, 4Q17 Update; 2016-2022, 4Q18 Update; 2017-2023, 4Q19 Update; 2018-2024, 4Q20 Update; 2019-2025, 4Q21 Update; 2020-2026, 4Q22 Update; 2021-2027, 4Q23 Update.. Gartner Market Databook - 4Q12 Update; 4Q13 Update; 4Q14 Update; 4Q15 Update; 4Q16 Update; 4Q17 Update; 4Q18 Update; 4Q19 Update; 4Q20 Update; 4Q21 Update; 4Q22 Update; 4Q23 Cloud migration and digital transformation Cloud spend continues to grow rapidly
  • 6. 6 The problem: an explosion of complexity Diversity of technologies in use Frequency of release Scale in number of computing units Number of people involved
  • 7. Where are you at on your journey? ● Small team ● Building an app ● To get customers ● Now you have customers ● And those customers have unstated, but expected, expectations ○ Customers are not always the ones paying for the service 7
  • 8. Datadog x CloudHesive: better together 8 Log Management Tool Infrastructure Monitoring Tool Application Monitoring Tool Synthetics Tool Cloud Security Management Network Performance Monitoring Hybrid Workloads Migrated Workloads Containerized Workloads Serverless Workloads
  • 9. Why do we work better together? 9
  • 10. Common customer value drivers Enhance customer experience Enable operational scalability and cost reduction Reduce operational, security and compliance risk Accelerate digital transformation 10
  • 11. 11 Datadog is a single platform for all your observability & security needs
  • 12. 12 CI Visibility Test Visibility Intelligent Test Runner Continuous Testing Cloud Security Management Application Security Management Software Composition Analysis Cloud SIEM Log Management Observability Pipelines Audit Trail Log Forwarding Error Tracking Sensitive Data Scanner Synthetics Mobile App Testing Browser Real User Monitoring Mobile Real User Monitoring Session Replay Application Performance Monitoring Distributed Tracing Continuous Profiler Database Monitoring Universal Service Monitoring Data Streams Monitoring Data Jobs Monitoring LLM Observability Infrastructure Monitoring Containers Serverless Network Performance Monitoring Network Device Monitoring Metrics Cloud Cost Management Cloudcraft Incident Management Event Management Workflow Automation App Builder Shared Platform Services Dashboards CoScreen Teams Agent OpenTelemetry Notebooks Service Catalog IDE Plugins ChatOps SLOs Case ● ● ● ● ● ● ● ● ● ● Management UNIFIED METRICS, LOGS, TRACES, SESSIONS AI Natural Language Querying Root Cause Analysis Anomaly Detection Impact Analysis Proactive Alerts Autonomous Investigations Bits ● ● ● ● ● ● AI Infrastructur e Applications Digital Experience Logs Security Software Delivery Cloud Service Managemen t Datadog Platform 750+ INTEGRATIONS
  • 13. Datadog helps CloudHesive’s customers make sense of their data 13 Migration Operations Modernization Deep visibility into complex environments (e.g., on-perm, hybrid, multi-cloud) Remove undifferentiated heavy lifting of the security and operations of your workloads; focus on your product Maintain high observability of rapidly evolving infrastructure (e.g., AI, ML, containers, serverless)
  • 15. 15 Workload 1 - Serverless Architecture
  • 16. Workload 1 - Serverless Services Map 16
  • 17. What are we seeing? ● Technology Transformation ○ Broad set of technologies ○ Evolving workload architectures ● Observability ○ Pre-cloud ○ Native observability ○ Hybrid ● Well Architected - Common Challenges ○ Validation ○ Parameters ○ Monitoring, Response, Root Cause Analysis ● Release Safety ○ Test Automation ○ Release Automation ○ Post Release Monitoring ○ Rollback Capabilities 17
  • 18. As an application owner, what’s expected? 18 ● Product/Development ○ Responsiveness ○ Accuracy ○ Allocation ● Security and Compliance ○ Libraries ○ Infrastructure ○ Code ○ Intellectual Property ○ Events ○ Vendors, Licenses, SaaS, Software, NF Systems (DevOps), People Backgrounds, Training, Phishing – Corporate and App ● Infrastructure ○ Availability ○ Performance – Sync ○ Performance – Async
  • 19. As an application owner, what’s expected? ● Customer ○ Trials ○ Conversions ○ Attrition ○ Growth ○ NPS ● Financial ○ Budget to Plan ○ Margin ○ Vendors, Licenses, SaaS, Software, NF Systems (DevOps), People Backgrounds, Training, Phishing People – Corporate and App 19
  • 20. How can you measure this? 20 ● Tiering ○ Portfolio Health ○ Customer Specific ○ Resource Specific ● Cohorting ○ Geo Customers and Teams ○ Line of Business Customers and Teams ● Vendor Management ○ Customer Margin ○ Non Customer Margin/Utilization ● Actions ○ ROI Proving ○ Churn Detection ○ Continuous Improvement/Optimization Pipeline
  • 21. 21 Workload 1 - Serverless - Feature Usage
  • 22. Workload 1 - Serverless - Usage 22
  • 23. What are common challenges? 23 ● Infosec ○ Sprawl (Contracts, Software, Licenses, Services, Hardware), Auth, Unique Risks/Threats ● Data ○ Generative AI ○ Machine to Machine ● Platforming ○ Compute is a commodity ○ Containerization as a default ○ Serverless as a way forward ● Stack consolidation (Stackolidation)
  • 24. Well Architected Framework ● Pillars ○ Operational Excellence ○ Security ○ Reliability ○ Performance Efficiency ○ Cost Optimization ○ Sustainability ● Lenses ○ SaaS 24
  • 25. Information Security 25 ● Frameworks ○ SOC ½ Type ½ ○ PCI DSS ○ FedRAMP / CMMC ○ HIPAA / HITRUST ● Processes ○ Information ○ Risks ○ Policies ○ Procedures ○ Controls ○ Assessment/Audit/Testing ○ People ■ Onboarding, Offboarding, Change ■ Permissions: Privileged Access, Business Need ■ Credentials: Strength and Factors ■ Training ■ Background ■ Phishing ■ Vendors ■ Software/Services
  • 26. What’s do customers often miss? ● Ingress Security Group ● Egress Security Group (Internet) ● Security Groups to/from other Services (AWS and On Premises) ● Security of the Environment ● Security of supporting servers (Active Directory) ● Security of other network-accessible resources (Web Servers) ● User Permissions (Non-Local Admin, Local Admin, Global Admin) ● Access of the environment (PKI Cert, PKI PIV, Network, MFA) ● The rest of the AWS Account? The rest of the AWS Account! (Services, APIs) 26
  • 27. Workload 1 - Serverless - Security Events 27
  • 28. Revenue to Usage to Cost Attribution ● Revenue ○ Hierarchy of Needs ● Usage ○ Typically tied to revenue if you are offering a SaaS based product ● Cost Attribution ○ Labor ■ Capitalized ○ Software ■ Opex vs. Capex ○ Services ■ Third Party ● Okayish ■ AWS ● With serverless (or well managed servers) margin should be linear, or better with scale of usage ○ Optimization at Scale ○ Savings Plans 28
  • 29. Cost Optimization Priority Funnel ● Inappropriate Resources (EC2 versus Fargate) ● Rightsizing (Compute Optimizer/CloudWatch Memory) – EC2 Unused/Underused ● EC2-Other (EBS Unused, Underused) ● Trusted Advisor (EBS Unused) ● Egress ● Well Architected Pillars ● Enterprise Agreement – Customer ○ Customer ● Enterprise Agreement – Service ○ Org - CloudFront ● Service Tiering ○ Org ● Savings Plans ○ EC2, Lambda, Fargate ● Reserved Instances ○ EC2, RDS, Elasticache 29
  • 30. Workload 1 - Serverless - Cost Attribution 30
  • 31. Sustainability in Technology ● Industry Goals ○ AWS Goals – Water Positive 2030, 100% Renewable Energy 2025 ■ Partner Impact – 1 of 13 domains in MSP Audit focus on sustainability ■ Industry Impact – 1 of 6 pillars in Well Architected Framework focus on sustainability ■ Customer Impact – Proactive (planning) and reactive (actual consumption) visibility into a workload’s Carbon Footprint ● Organizational Goals ○ Our Goals – Influence and impact our customers through leadership ● Our Unique Position ○ Cross section of customers ○ Influenced Impact ○ Direct Impact ● Sustainability in technology starts with optimization (cost, performance, etc.) – it doesn’t end there ○ Defining operational parameters – how “fast” does ”it” need to be? ○ Service selection (which can be influenced by/influences cost optimization objectives) – running 24 hours a day servicing work-day application 31
  • 32. Cloud Workload Lifecycle Management ● Workload ● Architecture ● Monitoring ● Automation ● Processes ● Integration 32
  • 33. Workload + Architecture Drives Service Selection ● Containers ○ Container File ○ Versioning ○ Multi-threaded/Single-task ○ Minutes to Days ○ Per VM/Per Hour ● Virtual Machines ○ AMI ○ Patching ○ Multi-threaded/Multi-task ○ Hours to Months ○ Per VM/Per Hour ● Functions/Services ○ Code ○ Versioning ○ Single-threaded/Single-task ○ Microseconds to Seconds ○ Per Memory/Second/Per Request 33
  • 34. How does multigenerational observability play a role? ● Ease of implementation - cloud service ● Support for integrations - 110 AWS services ● Consideration for hyperscalers ● Common interface ● Decoupling from your planes 34
  • 35. Workload 1 - Serverless - API Usage 35
  • 36. Workload 1 - Serverless - Lambda Usage 36
  • 37. 37 Recent Troubleshooting Workload 1 - Serverless Detection, confirmation, validation of defects and fixes - escaped and undetected; root cause analysis 3 Ways: Logs, APM Traces, Error Tracking a a
  • 38. 38 Workload 1 - Serverless - Recent Troubleshooting
  • 39. 39 Workload 2 - EC2 Fixed, EC2 ASG, K8S, EKS, Net
  • 40. 40 Workload 2 - Topology
  • 41. Workload 2 - Networking 41
  • 42. Customer Example - Architecture ● Notable percentage of Managed Services incidents could have been avoided through up-front architecture ○ ~9 EC2 instances (NGINX, Front End, Back End, Database) = 4 hours/instance/month in caring for/feeding is 108,000.00 USD/Year @ 250.00 USD/Hour ● That’s customer cost, what about opportunity cost? ○ Opportunity for tremendous customer value (customer saves 108,00.00 USD/Year) and provides us an opportunity to be more strategic with our partner (moving up the stack) ● What can we do? None of the above systems need to be servers ○ Increases the customers we can touch without a direct correlation to headcount 42
  • 43. Customer Example - Monitoring ● Previous Example entails hundreds of monitorable events and metrics, with a composite required to understand state ○ Interesting events feed into event driven automation ● Eliminate the instances and changes focus monitoring on customer outcomes ○ Increasing the scope of automated data collection eliminates manual checking but introduces complex correlation engines (people), which Outcome based monitoring minimizes the need for/increases positive customer sentiment 43
  • 44. Customer Example - Automation ● Previous example also has numerous automation touch points (AWS Services, Operating System, Services, etc.) ○ A move to serverless drops this number to practically none ● Automation skills shifted to development automation ○ Provides a consistent experience intra and inter customer, and again increases the value of our impact to our customers without a direct tie to headcount 44
  • 45. Conclusion ● AWS and Datadog continue to increase the breadth and depth of their service offerings ○ I wish it did that ○ I didn’t know I needed that ● It’s easier to get started today than it was yesterday ○ Simplicity ○ Support ○ Cost ● Build with these in mind ○ Evolving workloads ○ Well Architected ○ Observability ● Conclusion ○ Consider sustainability when choosing an approach – Maslow’s Hammer ○ Don’t forget about team enablement ○ Limited by your imagination and ability to execute 45
  • 46. Questions? Ask your questions in the Q&A box below
  • 47. 47 Sign up for our Partner Free Trial Getting started with Datadog and CloudHesive Set up a follow-up call
  • 48. Multigenerational Observability and Security for Evolving Cloud Operations
  • 50. Icons 50 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch abstract, data abstract, data abstract, data failure, alert, alarm bug cloud security cloud cloud infrastructure cloud money, cloud cost code time, speed complex dashboard data check, document data x, document download gear gear, cycle gear x framework inbox integrate open telemetry orchestrate failure, alert, alarm collecting change, movement filter, funnel
  • 51. Icons 51 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch queues pipeline unhealthy powerpacks security app security platform wrench workflows window, browser, check window, browser, dashboard server network, connection desktop, computer database mobile phone announcemen t blocks all, grid collaboration collaboration, referral pipeline healthy replay, cycle collaboration security monitoring, cloud SIEM document document, cheat sheet security check org chart
  • 52. Icons 52 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch simplify step by step eye, see, vision eye, see, vision feedback, speech, comment feedback, speech, comment flexibility graph line graph bar magnifying glass, search money cycle cost, money pinpoint, target pinpoint, target separate, disconnected location arrow time user, person time money team, people money, coin company, building bomb, threat chaos, disorder firewall clock, time, quick, speed
  • 53. Icons 53 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch camcorder, camera camera event, celebration explosion, boom fire, flame fitness, weight flag food, burger medical meditation, meditate recycle self defense shirt coffee coffee cup takeout, food tickets vacation, holiday video government, building book global events diversity, DEI personal improvement scholarship, education workshops, presentation video, play
  • 54. Icons 54 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch dashboard piggy bank, savings team, people, group charity, giving bill, cost, document office, desk law, legal, judgment partnership, handshake happy, smile, fulfillment inclusion family train, subway, travel calendar, parental pet, animal recycle question, confusion question, confusion question, confusion security persona operations persona serverless function cloud router arrow switch arrow dns globe host map container table dashboard window
  • 55. Icons 55 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch umbrella firewall office supplies printer, print parking bicycle, bike shipping, package shower briefcase, job, work credit card retail, e-commerce, exchange, arrow store, shopping circle check circle x point, touch data money, cost context common, similar code, coding, terminal data routing metrics meter, usage settings, configuration transactions code, coding abstract, stack, layers package, box globe, global, world
  • 56. Icons 56 To edit color: Select icon> Click paint bucket> Choose solid or gradient swatch error tracking inbox observability pipelines ai, sparkle trace logs trace magnify trace queries continuous testing ci visibility, infinity loop service catalog rum device
  • 57. Datadog product icons 57 INFRASTRUCTURE Infrastructure Monitoring Network Performance Monitoring Network Device Monitoring Container Monitoring Serverless Cloud Cost Management Cloudcraft LOGS Log Management Sensitive Data Scanner Audit Trail Observability Pipelines APPLICATIONS Application Performance Monitoring Universal Service Monitoring Continuous Profiler Database Monitoring Data Streams Monitoring Service Catalog Dynamic Instrumentation SECURITY Software Composition Analysis Application Security Management Cloud Security Management Cloud SIEM AIOPS Event Management Watchdog Bits AI OpenTelemetry Workflow Automation CoScreen Dashboards Alerts Integrations IDE Plugins API PLATFORM CAPABILITIES SOFTWARE DELIVERY CI Pipeline Visibility Test Visibility & Intelligent Test Runner Continuous Testing DIGITAL EXPERIENCE Browser Real User Monitoring Mobile Real User Monitoring Synthetic Monitoring Mobile App Testing Session Replay Error Tracking SERVICE MANAGEMENT Incident Management Case Management Service Level Objectives
  • 58. Datadog product icons 58 INFRASTRUCTURE Infrastructure Monitoring Network Performance Monitoring Network Device Monitoring Container Monitoring Serverless Cloud Cost Management Cloudcraft LOGS Log Management Sensitive Data Scanner Audit Trail Observability Pipelines APPLICATIONS Application Performance Monitoring Universal Service Monitoring Continuous Profiler Database Monitoring Data Streams Monitoring Service Catalog Dynamic Instrumentation SECURITY Software Composition Analysis Application Security Management Cloud Security Management Cloud SIEM AIOPS Event Management Watchdog Bits AI OpenTelemetry Workflow Automation CoScreen Dashboards Alerts Integrations IDE Plugins API PLATFORM CAPABILITIES SOFTWARE DELIVERY CI Pipeline Visibility Test Visibility & Intelligent Test Runner Continuous Testing DIGITAL EXPERIENCE Browser Real User Monitoring Mobile Real User Monitoring Synthetic Monitoring Mobile App Testing Session Replay Error Tracking SERVICE MANAGEMENT Incident Management Case Management Service Level Objectives

Editor's Notes

  • #1: Suggested talk track: Thank you for your time today. We’re excited that you’re interested in or are already apart of the Datadog and CloudHesive partnership After this presentation, you’ll have a better understanding of the key trends impacting the observability space and how Datadog and CloudHesive can help your customers overcome these macro trends and IT challenges to ultimately accelerate business results. You’ll also understand how our partnership can support your growth.
  • #2: Todd is a Senior Partner Manager at Datadog. Prior to Datadog, he spent 5 years working at a DevSecOps consulting organization, where he led the Observability practice.
  • #4: Purpose: Align Datadog’s mission to that of the partner and invite them to talk about complex technology undertakings, like migrating to the cloud, increased adoption of microservices architectures, and promoting DevSecOps for improved application security. Suggested talk track: Now, we all understand technology’s critical role in a company’s operations, and what we’ve seen over the past few years in the industry is that the trend towards digital transformation and cloud migration is only accelerating. In fact, Gartner predicts that 75% of organizations will anchor their digital transformation on the cloud by 2026. At Datadog, we've seen this same rise from our own customers across the board in adopting cloud-native technologies to improve uptime, resource utilization, and time to market. (Click Next Slide)
  • #5: Suggested talk track: Let’s start with cloud migration and digital transformation. Beautiful chart with Gartner data Sustained rate of migration over the past few years, expected to continue into the foreseeable future Worth noting that Gartner expects spend on public cloud to exceed $1tr by 2027. And yet, it will be only 18% of global tech spend. Why is this happening? Because as a company you absolutely have to! You have to interact with your customers online You have to differentiate from the competition through innovation You have to run into the cloud to get agility, short time to value, and ops efficiency To be honest you also have to lean into tech to hire the best and brightest engineers In the end this modernization leads to better business outcomes. All of this was true over the past decade, and we expect it to be even more pronounced in the age of AI, where being digital and in the cloud are true prerequisites to adoption. Growth is great, but it doesn’t come without it’s challenges, isn’t that right, Patrick:? (click to next slide)
  • #6: Suggested talk track: When you lean into tech innovation, you are faced with an explosion of complexity (go through the charts)
  • #8: Suggested talk track: How are Datadog’s product used by CloudHesive’s customers? Datadog takes metrics, traces, and logs from business transactions, private cloud, on-prem, mobile apps, SaaS integrations, and more and aggregates it all into dashboards to show all of that data in one place.
  • #10: Most customers come to Datadog and begin with statements like We’re looking for a better solution for managing our logs We need code-level visibility into application performance Our legacy monitoring solutions don’t give us enough visibility into our Cloud environments These pointed statements are certainly very relevant and important, but they’re just a symptom of a larger business problem. That’s why our GTM teams spend time with customers to learn what value drivers are most critical for them and why. Partners like Cloudhesive are already very much ingrained in their customer’s business to help address these exact same value drivers. So with our shared focus on helping our customers, it makes for a very powerful partnership. [click to next slide]
  • #11: Suggested talk track: Datadog differentiator: Our mission here at Datadog is simple: to be the single unified platform for all your observability & security needs.
  • #12: Suggested talk track: We give you this end-to-end observability on a single unified platform - powered by shared services and AI/ML. From infra monitoring to APM to Real User Monitoring, Log Management, Security all the way to Developer Experience (monitoring code not only in production, but also as it’s being built). Todd - hit on multigenerational aspect Datadog is built as a platform but allows for modular access to these various SKUs, so customers can use only what they need. Not only for the traditional 3 pillars, but we’re also seeing the next generation of customers looking to add real-time Security data alongside their performance data. So, there’s a lot of capabilities within the platform. And we rely on partners like Cloudhesive to help fully realize the value of Datadog. [click to next slide]
  • #13: Suggested talk track: Datadog helps CloudHesive customers make sense of their data – our Observability & security solutions “turn on the lights” to help them pinpoint errors and lead to faster troubleshooting. In particular, Datadog’s top two superpowers for [insert partner name] are: migration and modernization Migration: Datadog provides deep visibility into complex environments. Datadog is also cloud-agnostic and can ingest data from on-prem and hybrid cloud environments. Datadog can also track and secure every stage of your cloud migration to give your teams the confidence to migrate to the cloud Modernization Datadog also rapidly evolves as you adopt new services and grow with [insert partner name] As a result, we evolve with our customers