Skip to content

Conversation

@fabioaraujopt
Copy link

Description

This PR makes the backoffLimit configurable for Velero's Helm job resources, allowing users to adjust retry limits based on their environment needs.

Motivation

The current hardcoded backoffLimit of 3 may be insufficient in environments with:

  • Slow image registries or pull times
  • Network latency issues
  • Resource constraints causing temporary pod failures
  • Image pull authentication delays

This is particularly problematic for the velero-upgrade-crds job which runs as a Helm pre-install/pre-upgrade hook and can cause installation failures when it exceeds the backoff limit.

Changes

  • Added upgradeCRDsJob.backoffLimit configuration (default: 3)
  • Added cleanUpCRDsJob.backoffLimit configuration (default: 3)
  • Added labelNamespaceJob.backoffLimit configuration (default: 3)
  • Updated all job templates to use these configurable values

Backward Compatibility

✅ All defaults are set to 3, maintaining current behavior.

Usage

Users can now override the backoff limit in their values:

upgradeCRDsJob:
  backoffLimit: 10

cleanUpCRDsJob:
  backoffLimit: 5

labelNamespaceJob:
  backoffLimit: 5

Testing

  • Helm template rendering works correctly
  • Default values maintain backward compatibility
  • Custom values are properly applied to job specs

Related Issues

Addresses scenarios where users encounter BackoffLimitExceeded errors during Velero installation.

Add configurable backoffLimit for upgrade-crds, cleanup-crds, and label-namespace jobs.
Defaults to 3 to maintain backward compatibility.

This allows users to increase the retry limit for jobs that may fail due to
transient issues like image pull delays, network latency, or resource constraints.

Fixes: vmware-tanzu/helm-charts#<issue-number> (if applicable)

Configuration:
- upgradeCRDsJob.backoffLimit: Controls retries for the pre-install/upgrade CRD job
- cleanUpCRDsJob.backoffLimit: Controls retries for the pre-delete CRD cleanup job
- labelNamespaceJob.backoffLimit: Controls retries for the namespace labeling job

Signed-off-by: Cursor AI Assistant <[email protected]>
@jenting
Copy link
Collaborator

jenting commented Jan 16, 2026

@fabioaraujopt please help bump chart version, thank you.

@fabioaraujopt
Copy link
Author

@fabioaraujopt please help bump chart version, thank you.

fixed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants