Kubernetes v1.33: Job's Backoff Limit Per Index Goes GA

Link
2025-05-13 ~1 min read kubernetes.io #kubernetes

⚡ TL;DR

Kubernetes v1.33: Job's Backoff Limit Per Index Goes GA About backoff limit per index How backoff limit per index works Example Learn more Get involved In Kubernetes v1.33, the Backoff Limit Per Index feature reaches general availability (GA). This blog describes the Backoff Limit Per Index feature and its benefits.

📝 Summary

Kubernetes v1.33: Job's Backoff Limit Per Index Goes GA About backoff limit per index How backoff limit per index works Example Learn more Get involved In Kubernetes v1.33, the Backoff Limit Per Index feature reaches general availability (GA). This blog describes the Backoff Limit Per Index feature and its benefits. When you run workloads on Kubernetes, you must consider scenarios where Pod failures can affect the completion of your workloads. Ideally, your workload should tolerate transient failures and continue running. To achieve failure tolerance in a Kubernetes Job, you can set the spec. backoffLimit field. This field specifies the total number of tolerated failures. spec. backoffLimit However, for workloads where every index is considered independent, like embarassingly parallel workloads - the spec. backoffLimit field is often not flexible enough. For example, you may choose to run multiple suites of integration tests by representing each suite as an index within an Indexed Job. In that setup, a fast-failing index (test suite) is likely to consume your entire budget for tolerating Pod failures, and you might not be able to run the other indexes.