How does multiple replicas/pods scale Kubernetes? How does multiple replicas/pods scale Kubernetes? kubernetes kubernetes

How does multiple replicas/pods scale Kubernetes?


Kubernetes has couple of scaling mechanisms. Horizontal Pod Autoscaler being the basic, but not the only one.

With HPA you can spin up additional PODs according to some metrics (most commonly cpu and memory). At some point you will hit a moment when your cluster nodes do not have enough resources to satisfy resource requirements of your pods (you will have pods in Pending state due to lack of nodes available for scheduling).

At that point a Cluster Autoscaler can kick in and ie. scale AWS ASG (or some other cloud-ish node pool) to add new node to the cluster and make space for the pending pod(s)