feat(helm): add api-server rollout restart cronjob by Subham-KRLX · Pull Request #67569 · apache/airflow

Subham-KRLX · 2026-05-26T17:36:13Z

This PR adds Helm chart support for periodic API server rollout restarts on Kubernetes.

Problem:
Long-running uvicorn processes in the API server can accumulate stale resources. Kubernetes provides kubectl rollout restart as a native mechanism, but the Airflow Helm chart has no built in support for scheduling this.

Fix:
Adds a new apiServer.rolloutRestart configuration section with a CronJob, ServiceAccount, Role, and RoleBinding following the exact same pattern as the existing databaseCleanup CronJob.

closes: #61432

Was generative AI tooling used to co-author this PR?

Yes — Claude(For pr description)

jscheffl

We had similar PR recently. I did not like it.

If a long running process has a problem then there should be other means. A rolling restart is for me only a workaround and should not be/get a permanent feature in Helm chart therefore.

See also https://airflow.apache.org/docs/apache-airflow/stable/faq.html#how-to-prevent-api-server-memory-growth

Subham-KRLX · 2026-05-26T17:49:14Z

We had similar PR recently. I did not like it.

If a long running process has a problem then there should be other means. A rolling restart is for me only a workaround and should not be/get a permanent feature in Helm chart therefore.

See also https://airflow.apache.org/docs/apache-airflow/stable/faq.html#how-to-prevent-api-server-memory-growth

I have marked this PR as a draft for now while I look into how we can better support/document the Gunicorn rolling restarts within the Helm chart instead of forcing a full pod rollout I will update the PR if there's a better native Helm integration we can add.

Miretpl · 2026-05-26T18:17:57Z

We had similar PR recently. I did not like it.

If a long running process has a problem then there should be other means. A rolling restart is for me only a workaround and should not be/get a permanent feature in Helm chart therefore.

See also https://airflow.apache.org/docs/apache-airflow/stable/faq.html#how-to-prevent-api-server-memory-growth

+1. Handling of worker restarts should be implemented in the API server itself if it is needed.

Subham-KRLX · 2026-05-27T05:16:19Z

We had similar PR recently. I did not like it.

If a long running process has a problem then there should be other means. A rolling restart is for me only a workaround and should not be/get a permanent feature in Helm chart therefore.

See also https://airflow.apache.org/docs/apache-airflow/stable/faq.html#how-to-prevent-api-server-memory-growth

+1. Handling of worker restarts should be implemented in the API server itself if it is needed.

Agree a rolling restart is a workaround and should not be a permanent chart feature. This PR adds an opt-in CronJob disabled by default so teams can use it as a short term mitigation.

Miretpl · 2026-05-27T19:07:52Z

Agree a rolling restart is a workaround and should not be a permanent chart feature. This PR adds an opt-in CronJob disabled by default so teams can use it as a short term mitigation.

I would be really against adding workaround features to the chart. We have, e.g., PostgreSQL within the chart currently, which was meant only for development purposes, and there are teams which are using it for production. I believe that we should not encourage users to use this particular workaround by implementing it and making it easy to use. If there is a team which will need to do it, they could just create the CronJob definition and apply it to the Kubernetes cluster.

jscheffl · 2026-05-27T19:23:35Z

Agree a rolling restart is a workaround and should not be a permanent chart feature. This PR adds an opt-in CronJob disabled by default so teams can use it as a short term mitigation.

I would be really against adding workaround features to the chart. We have, e.g., PostgreSQL within the chart currently, which was meant only for development purposes, and there are teams which are using it for production. I believe that we should not encourage users to use this particular workaround by implementing it and making it easy to use. If there is a team which will need to do it, they could just create the CronJob definition and apply it to the Kubernetes cluster.

Then - if really somebody needs it - would propose to add this being a Kustomize Layer example we can add to the repo with some docs how to apply but not adding this to main chart.

Subham-KRLX requested review from bugraoz93, hussein-awala, jedcunningham and jscheffl as code owners May 26, 2026 17:36

boring-cyborg Bot added the area:helm-chart Airflow Helm Chart label May 26, 2026

feat(helm): add api-server rollout restart cronjob (apache#61432)

7e10d46

Subham-KRLX force-pushed the feat/api-server-rollout-restart-cronjob branch from 4e4c5a6 to 7e10d46 Compare May 26, 2026 17:40

Subham-KRLX changed the title ~~Feat/api server rollout restart cronjob~~ feat(helm): add api-server rollout restart cronjob May 26, 2026

jscheffl reviewed May 26, 2026

View reviewed changes

Subham-KRLX mentioned this pull request May 26, 2026

Helm chart support for periodic API server rollout restarts on Kubernetes #61432

Open

1 task

Subham-KRLX marked this pull request as draft May 26, 2026 17:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(helm): add api-server rollout restart cronjob#67569

feat(helm): add api-server rollout restart cronjob#67569
Subham-KRLX wants to merge 1 commit into
apache:mainfrom
Subham-KRLX:feat/api-server-rollout-restart-cronjob

Subham-KRLX commented May 26, 2026 •

edited

Loading

Uh oh!

jscheffl left a comment

Uh oh!

Subham-KRLX commented May 26, 2026

Uh oh!

Miretpl commented May 26, 2026 •

edited

Loading

Uh oh!

Subham-KRLX commented May 27, 2026

Uh oh!

Miretpl commented May 27, 2026

Uh oh!

jscheffl commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Subham-KRLX commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Was generative AI tooling used to co-author this PR?

Uh oh!

jscheffl left a comment

Choose a reason for hiding this comment

Uh oh!

Subham-KRLX commented May 26, 2026

Uh oh!

Miretpl commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Subham-KRLX commented May 27, 2026

Uh oh!

Miretpl commented May 27, 2026

Uh oh!

jscheffl commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Subham-KRLX commented May 26, 2026 •

edited

Loading

Miretpl commented May 26, 2026 •

edited

Loading