KEDA - Kubernetes Event Driven Autoscaler Part 1/4 - CPU and Memory Triggers on Azure Kubernetes Services

Vishwas N.

Scouting for best Startups | Reinventing AI Acceleration for Enterprise | Playing the Big Boy Sport(Startups)

Published Sep 20, 2022

Introduction

Some of the top organizations in the world are using KEDA to operate production at a gigantic scale in the cloud based on practically any measure imaginable from almost any metric supplier.

The example given here is in Azure Kubernetes Service, but you can consider using it with Hardened clusters of the Kubernetes as well

In this article I'll explain what Kubernetes administrators can do using KEDA (Kubernetes Event-driven Autoscaling) and how to get started.

The Kubernetes Measurements Server, which is not deployed by default on some Kubernetes deployments, such as EKS on AWS, is used by KEDA to calculate basic CPU and memory metrics. Limits must also be included in the resources section of the applicable Kubernetes Pods (at a minimum).

Prerequisite

Kubernetes cluster version 1.5 or above is required.

Ownership of the Kubernetes cluster's administrator role -> change the setting in the RBAC with the Azure Active Directory or the IAM role while Kubernetes Cluster on AWS.
Installing the Kubernetes Metrics Server is necessary. Depending on your Kubernetes provider, different setup procedures are required.
Your Kubernetes Pod setup must have a resources section with set restrictions. See Pods and Containers Resource Management. The missing request for "cpu/memory" problem occurs if the resources section is empty (resources: or something similar).

What is KEDA?

A similar autoscaling method to the built-in Kubernetes Horizontal Pod Autoscaler is called Kubernetes Event-driven Autoscaling (KEDA) (HPA). The HPA is still utilized by KEDA in order to do its magic.

You may access the official KEDA website here. Since KEDA offers excellent documentation, installation is simple. Microsoft is supporting the open-source Cloud Native project KEDA, and Azure AKS (Azure's Kubernetes service) offers complete support for it. Large organizations like Microsoft, Zapier, Alibaba Cloud, and many more utilise it, therefore it is in use and being produced on a huge scale.

Why KEDA is a game-changer

KEDA allows Kubernetes to scale pod replicas from zero to any number, based on metrics like queue depth of a message queue, requests per second, scheduled cron jobs, custom metrics from your own application logging, and pretty much any other metric you can think of. The built-in HPA in Kubernetes is unable to accomplish this with ease. The scaling service providers that KEDA supports are listed below.

How KEDA operates

In order to grow according to the measure's value, KEDA monitors metrics from an external metric provider system, such as Azure Monitor. It has direct communication with the system that provides metrics. It functions as a single-pod Kubernetes operator and continually monitors.

from my slides link given here : https://docs.google.com/presentation/d/1IlFl_5d_mHQbEfpINMnB2jxhzrPq__PWs6HD-HQUWiA/edit?usp=sharing

How to setup KEDA

The easiest method of installing KEDA is to use a Helm chart

helm repo add kedacore https://kedacore.github.io/charts
helm repo update
kubectl create namespace keda
helm install keda kedacore/keda --namespace keda

setting up scaling

You must create a manifest file to specify what to scale based on, when to scale, and how to scale after KEDA has been deployed and is operating in the cluster. I'll offer advice on how to configure scaling depending on popular metrics like CPU and memory.

You can get a list of all supported scaling sources and kinds in the Documentation here.

you can also try deploying a workload before like the one which is given with the Azure Kubernetes Quick starter

After deploying you will get a External IP:

Similar to deployments, scaling is described as a manifest YAML file with a ScaledObject as the type.

You can get the working of the existing scaled objects :

kubectl get scaledobject

And a scaled object can be deleted with this command:

kubectl delete scaledobject <name of scaled object>

you can try commands like this:

kubectl get ns

Kubectl get namesapces

Let's talk about scaling infrastructure (basic triggers)

Memory Scaling

Based on the amount of memory used inside the pod's container, a rule may be configured to scale.

You can find the documentation on memory scaling here.

you have to do all the helm installs as shown here.

These kinds of values can serve as the basis for scaling:

Utilization's goal value, which is expressed as a percentage of the resource's required value for the pods, is the resource metric's average over all pertinent pods.

The goal value for the metric AverageValue is the average of the measure over all pertinent pods (quantity).

Memory Utilization has been considered for the particular manifest example below:

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: memory-scaledobject
  namespace: default
spec:
  scaleTargetRef:
    name: sandboxapi
  triggers:
  - type: memory
    metadata:
      type: Utilization
      value: "50"

This example develops a rule that scales pods in accordance with memory use.

The deployment that has to be scaled is referred to by TargetRef.

CPU Scaling

The CPU use of the container within the pod may also be taken into account while scaling.

These kinds of values can serve as the basis for scaling:

Utilization's goal value, which is expressed as a percentage of the resource's required value for the pods, is the resource metric's average over all pertinent pods.

The goal value for the metric AverageValue is the average of the measure over all pertinent pods (quantity).

Utilization has been considered for the particular manifest example below:

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: memory-scaledobject
  namespace: default
spec:
  scaleTargetRef:
    name: sandboxapi
  triggers:
  - type: cpu
    metadata:
      type: Utilization
      value: "50"

This example develops a rule that scales pods in accordance with the level of CPU usage.

The deployment that has to be scaled is referred to by TargetRef .

For more reference you can see this yaml manifest: Here I have done an Explanation about the specifications from "spec"

This will give you a idea about the manifests and also the resource that are considered.

# a working example of resources with specified limits
spec:
  containers:
  - name: app
    image: images.my-company.example/app:v4
    resources:
      limits:
        memory: "128Mi"
        cpu: "500m"

Trigger Information

The CPU trigger those scales based on CPU metrics is described in this standard.

triggers:
- type: cpu
  metadata:
    # Required
    type: Utilization # Allowed types are 'Utilization' or 'AverageValue'
    value: "60"

List of parameters

type - The chosen kind of metric. Utilization or AverageValue are available options.
value - Value for which scaling activities should be initiated:
Utilization's goal value, which is expressed as a percentage of the resource's required value for the pods, is the average of the resource measure over all pertinent pods.
The goal number when using AverageValue is the average of the statistic over all pertinent pods (quantity).

Example with the comments as explanation

apiVersion: keda.sh/v1alpha1
kind: ScaledObject
metadata:
  name: cpu-scaledobject
  namespace: default # you can choose any name space
spec:
  scaleTargetRef:
    name: my-deployment
  triggers:
  - type: cpu
    metadata:
      type: Utilization # Allowed types are 'Utilization' or 'AverageValue'
      value: "50"

Thus, it's easy to get the utilization ready for the deployment but it's very difficult to get the functions-based triggers in the world of Microservices.

There are still many more challenges to be addressed.

Conclusion

In this article, I discussed KEDA, how it can make it simple for you to create the most scalable apps for Kubernetes, and how to get started. There are several scalers available, so you may scale according to your precise needs.

Feel free to comment on this page if you have any queries.

Startup to Scaleup

2,288 followers

+ Subscribe

Marie Jaksman🔥

Helping businesses turn user/customer/market research into revenue-generating outcomes I B2B SaaS Marketer for Developer Tools (B2D)

1mo

We’re running a free, hands-on session where Zbyněk Roubalík (KEDA maintainer) walks through live KEDA demos and shows how to plug scaling gaps using real-time data. Killer Coda sandbox access included. No BS, no forms. Just smart people solving scaling problems. https://info.perfectscale.io/live-workshop-event-driven-vs-resource-based-scaling?utm_source=linkedin&utm_medium=referral&utm_campaign=workshop-june-2025

To view or add a comment, sign in

See all

KEDA - Kubernetes Event Driven Autoscaler Part 1/4 - CPU and Memory Triggers on Azure Kubernetes Services

Vishwas N.

Scouting for best Startups | Reinventing AI Acceleration for Enterprise | Playing the Big Boy Sport(Startups)

Introduction

Prerequisite

Why KEDA is a game-changer

How KEDA operates

How to setup KEDA

setting up scaling

you can try commands like this:

Let's talk about scaling infrastructure (basic triggers)

Memory Scaling

CPU Scaling

Example with the comments as explanation

Conclusion

Startup to Scaleup

2,288 followers

More articles by this author

Others also viewed

Simplifying Container Deployment and Management with AWS Fargate

EKS Auto Mode: the Verdict from nOps

Azure Kubernetes Service (AKS) – April 2025 Update

Karpenter: The Silent Engine Powering EKS Cost-Efficiency

Beyond Containers: Exploring the MicroVM Revolution 🔍 Part 1: What Are MicroVMs and Why They Matter Now

Azure Kubernetes Services (AKS)

[WORKSHOP] Simplified and Cost-Efficient GKE Node Management with NAP

How Kubernetes Is Still Dominating in a Serverless World?

Serverless Computing in Azure

Day 41 : Kubernetes Introduction #90DaysofDevOps

Explore topics

Introduction

Prerequisite

Why KEDA is a game-changer

How KEDA operates

How to setup KEDA

setting up scaling

you can try commands like this:

Let's talk about scaling infrastructure (basic triggers)

Memory Scaling

CPU Scaling

Example with the comments as explanation

Conclusion

Startup to Scaleup

2,288 followers

India's Startup Ecosystem: Navigating Challenges and Leveraging Opportunities through the eyes of North Star Report from We Work Labs India

Aug 14, 2025

Dubai's Startup Ecosystem: A Hub of Innovation and Entrepreneurship

Jun 19, 2025

What Makes the Perfect Family Office CIO?

Jun 18, 2025

ESOPs vs. PSOPs: Understanding the Difference and Why It Matters

Jun 9, 2025

All about Urban Company DRHP

May 14, 2025

The Power of Simplicity: How Instagram's MVP Strategy Led to Success

May 12, 2025

Indian Startups Raise Over ₹850 Crore in a Week: April 28 – May 11, 2025 - Emerging Startups in India

May 11, 2025

Bridging the Gap: Why US CPA Firms Build Businesses While Indian CA Firms Run Practices

May 10, 2025

Porter the 2nd Unicorn in India from the year 2025

May 10, 2025

Third Wave Coffee: A Detailed Case Study on Monetization and Competitive Analysis

Apr 18, 2025

Others also viewed

Simplifying Container Deployment and Management with AWS Fargate

EKS Auto Mode: the Verdict from nOps

Azure Kubernetes Service (AKS) – April 2025 Update

Karpenter: The Silent Engine Powering EKS Cost-Efficiency

Beyond Containers: Exploring the MicroVM Revolution 🔍 Part 1: What Are MicroVMs and Why They Matter Now

Azure Kubernetes Services (AKS)

[WORKSHOP] Simplified and Cost-Efficient GKE Node Management with NAP

How Kubernetes Is Still Dominating in a Serverless World?

Serverless Computing in Azure

Day 41 : Kubernetes Introduction #90DaysofDevOps

Explore topics