Kubernetes Pod Scheduling & Resource Optimizatio…

Kubernetes Pod Scheduling and Resource Optimization Guide

Dargslan Team | April 12, 2026 | Updated: April 20, 2026 | 5 min read | 176 views

Kubernetes scheduling determines where your pods run. Understanding node affinity, taints and tolerations, resource requests/limits, and autoscaling is critical for running production workloads efficiently.

Resource Requests and Limits

Every production pod should specify resource requests (guaranteed minimum) and limits (maximum allowed).

apiVersion: apps/v1
kind: Deployment
metadata:
  name: api-server
spec:
  replicas: 3
  template:
    spec:
      containers:
      - name: api
        image: myapp:latest
        resources:
          requests:
            cpu: 250m
            memory: 256Mi
          limits:
            cpu: 500m
            memory: 512Mi

Sizing Guidelines

Requests: Set to the average resource usage (what the app normally needs)
Limits: Set to 2x the request (room for spikes without OOMKill)
CPU: 1000m = 1 CPU core. Start with 100m-250m for most microservices
Memory: Monitor actual usage with kubectl top pods and adjust

Node Affinity

spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: topology.kubernetes.io/zone
            operator: In
            values:
            - eu-central-1a
            - eu-central-1b
      preferredDuringSchedulingIgnoredDuringExecution:
      - weight: 80
        preference:
          matchExpressions:
          - key: node-type
            operator: In
            values:
            - compute-optimized

Taints and Tolerations

# Taint a node (only GPU workloads)
kubectl taint nodes gpu-node-1 gpu=true:NoSchedule

# Pod with toleration
spec:
  tolerations:
  - key: "gpu"
    operator: "Equal"
    value: "true"
    effect: "NoSchedule"
  containers:
  - name: ml-training
    image: tensorflow:latest

Horizontal Pod Autoscaler

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: api-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: api-server
  minReplicas: 2
  maxReplicas: 20
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 70
  - type: Resource
    resource:
      name: memory
      target:
        type: Utilization
        averageUtilization: 80
  behavior:
    scaleUp:
      stabilizationWindowSeconds: 60
    scaleDown:
      stabilizationWindowSeconds: 300

Pod Disruption Budgets

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: api-pdb
spec:
  minAvailable: 2    # or maxUnavailable: 1
  selector:
    matchLabels:
      app: api-server

📘 Want to master Kubernetes?

Check out our Kubernetes eBooks with hands-on examples, from beginner to CKA exam preparation.

Browse DevOps Books →

Production Best Practices

Always set resource requests and limits
Use pod anti-affinity to spread replicas across nodes
Configure PodDisruptionBudgets for critical services
Set up HPA with both CPU and memory metrics
Use topology spread constraints for zone-aware scheduling
Monitor with kubectl top nodes and kubectl top pods
Right-size containers using VPA recommendations

GitOps with Flux v2: Production Setup with Multi-Tenant Workloads

Flux v2 is the mature production GitOps engine for Kubernetes in 2026, having stabilized into a CNCF graduated project with broad adoption. This is a practical guide for running it at scale: the right repository structure, multi-tenant isolation patterns that actually work, secret management with SOPS or sealed-secrets, image automation, drift detection, and the operational patterns that turn GitOps from a buzzword into a reliable deployment model....

Kubernetes 1.31 Upgrade Guide: Breaking Changes and a Safe Migration Path

Kubernetes 1.31 is one of the more disruptive recent releases — removed in-tree volume plugins, AppArmor going GA, structured authentication maturing, and several long-deprecated APIs finally going away. This is a battle-tested upgrade guide for production clusters: what breaks, what to test on staging, and a safe step-by-step migration path that does not page you at 2 AM....

GitOps Workflow: Managing Infrastructure with Git and ArgoCD

Implement GitOps with ArgoCD for declarative infrastructure management. Learn Git as single source of truth, automated deployment, sync strategies, and rollback procedures....

Categories

Kubernetes Pod Scheduling and Resource Optimization Guide

Resource Requests and Limits

Sizing Guidelines

Node Affinity

Taints and Tolerations

Horizontal Pod Autoscaler

Pod Disruption Budgets

📘 Want to master Kubernetes?

Production Best Practices

Dargslan Editorial Team (Dargslan)

Stay Updated

Categories

Resource Requests and Limits

Sizing Guidelines

Node Affinity

Taints and Tolerations

Horizontal Pod Autoscaler

Pod Disruption Budgets

📘 Want to master Kubernetes?

Production Best Practices

Dargslan Editorial Team (Dargslan)

Related Articles

GitOps with Flux v2: Production Setup with Multi-Tenant Workloads

Kubernetes 1.31 Upgrade Guide: Breaking Changes and a Safe Migration Path

GitOps Workflow: Managing Infrastructure with Git and ArgoCD

Stay Updated