Implement RestartAllContainers by yuanwang04 · Pull Request #134345 · kubernetes/kubernetes

yuanwang04 · 2025-09-30T17:17:53Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Implement RestartAllContainers KEP: kubernetes/enhancements#5532

Which issue(s) this PR is related to:

KEP: kubernetes/enhancements#5532

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Allows restart all containers when the source container exits with a matching restart policy rule. This is an alpha feature behind feature gate RestartAllContainersOnContainerExit.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

- [KEP]: https://github.com/kubernetes/enhancements/issues/5532

k8s-ci-robot · 2025-09-30T17:17:56Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

SergeyKanzhelev · 2025-10-01T02:00:13Z

/cc

marking for review

thockin · 2025-11-07T23:02:39Z

/approve for api

yuanwang04 · 2025-11-07T23:09:42Z

/retest-required

tallclair · 2025-11-07T23:25:27Z

 // discovery and inference logic.
 var AllFeatures = []nodedeclaredfeatures.Feature{
 	inplacepodresize.Feature,
+	restartallcontainers.Feature,


/cc @pravk03
Another customer!

tallclair

Is there any way that the pod could transition out of a Failed / Succeeded state with this feature? I'm having a hard time reasoning through the code paths to convince myself that that's not the case.

tallclair · 2025-11-09T18:55:12Z

+		// Kill and remove containers in reverse order. Source containers (which exited and triggered
+		// RestartAllContainers) are removed last.


Why does the order matter? Why are source containers separated out?

You resolved this, but didn't answer the question?

tallclair · 2025-11-09T19:31:16Z

Before this change, was it possible for a pod to transition from the Running to Pending phase?

yuanwang04 · 2025-11-10T09:43:32Z

@tallclair it's impossible for a pod to transition out of a Failed / Succeeded state. This is enforced by kubelet.generateAPIPodStatus function: https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/kubelet_pods.go#L1871-L1885

Before this change, was it possible for a pod to transition from the Running to Pending phase?

I think there is no check / validation forbidding it. I think it may happen if sidecar got restarted before the regular container starts running. However, I do think the pod should be kept in runnings state during the RestartAllContainers action. I did some refactor and added unit tests to ensure the pod status will be running during this restart.

tallclair · 2025-11-10T15:57:27Z

+		// Kill and remove containers in reverse order. Source containers (which exited and triggered
+		// RestartAllContainers) are removed last.


You resolved this, but didn't answer the question?

tallclair · 2025-11-10T16:33:59Z

+						return
+					}
+				}
+				if err := m.removeContainer(ctx, containerInfo.containerID.ID); err != nil {


removeContainer also removes the container logs. Is that desired? Normally logs aren't removed when a container restarts.

tallclair · 2025-11-10T16:38:03Z

+						return
+					}
+				}
+				if err := m.removeContainer(ctx, containerInfo.containerID.ID); err != nil {


removeContainer triggers topology manager to remove the topology assignment for the container. In the case of pod-level resource managers, I think we want to make sure that this doesn't reset the pod-level topology assignment.

/cc @KevinTMtz @ffromani

Thanks for the ping. Will read the KEP and the code and incorporate in the pod-level resource manager integration work.

Naive question because I'm not into this KEP details. Do we run admission again then? because removing the topology assignment MAY mean that a pod which used to run with alignment guarantees loses them on restart, which seems undesirable. If this is real, we should probably add at least an admission check: if topology manager is enforcing, a pod with the RestartAllContainers flag enabled (at glance this seems to be surfacing at podspec level?) should be not admitted.

The RestartAllContainers does not seem to affect the overall pod, only its status might become pending, which is not a concern (the CPU and Memory managers reconcileState considers a pending pod as active). In regards to clearing the Topology manager's state, it appears to not produce any secondary effects (the CPU/Memory managers are the effective source of truth), and it seems right to clean up the container ID mappings.

As action items, the RestartAllContainers should introduce a test to verify that the restart flow does not affect the resource manager's functionality (overall CPU and Memory managers features, not specific to PodLevelResourceManagers).

I created the following document with my findings and the discussion that I had with @yuanwang04, @ndixita and @SergeyKanzhelev: PodLevelResourceManagers & RestartAllContainers.

Any work that has to be done to address any situation created between PodLevelResourceManagers and other features can be followed in #136481.

It might be worth adding the reasoning behind removing in-memory topology manager state when a container is removed to RestartAllContainers KEP as a note @yuanwang04

And just to make sure that we are not missing out of any critical details, by any chance @ffromani @swatisehgal are you aware of why the in-memory topology manager state is retained for containers after the topology manager is done making the affinity decisions? We could have very well cleared the memory used to store the topology manager state

Adding a reference to a comment relevant to this discussion from the Pod Level Resource Managers KEP PR: kubernetes/enhancements#5775 (comment).

k8s-ci-robot · 2025-11-10T16:49:32Z

@tallclair: GitHub didn't allow me to request PR reviews from the following users: KevinTMtz.

Note that only kubernetes members and repo collaborators can review this PR, and authors cannot review their own PRs.

Details

In response to this:

removeContainer triggers topology manager to remove the topology assignment for the container. In the case of pod-level resource managers, I think we want to make sure that this doesn't reset the pod-level topology assignment.

/cc @KevinTMtz @ffromani

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

tallclair · 2025-11-10T20:01:55Z

+				logger.V(3).Info("Removing container before pod restarts", "containerName", cName, "containerID", containerInfo.containerID, "pod", klog.KObj(pod))
+				removeContainerResult := kubecontainer.NewSyncResult(kubecontainer.RemoveContainer, cName)
+				result.AddSyncResult(removeContainerResult)
+				if containerInfo.kill {


I noticed that killContainersWithSyncResult kills the containers in parallel. Would that be preferred here?

SergeyKanzhelev · 2025-11-10T21:21:11Z

/hold
/milestone v1.35

(as per https://kubernetes.slack.com/archives/C2C40FMNF/p1762622388020479)

Let's close on final comments and merge. @yuanwang04 please unhold at will whenever there is an lgtm

tallclair

/lgtm
/approve

k8s-ci-robot · 2025-11-11T03:26:00Z

LGTM label has been added.

Details

Git tree hash: 12ea3a573e06c086e46571468ab1867d0c9a23a5

k8s-ci-robot · 2025-11-11T03:26:07Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: SergeyKanzhelev, tallclair, thockin, yuanwang04

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~pkg/api/OWNERS~~ [thockin]
~~pkg/apis/OWNERS~~ [thockin]
~~pkg/features/OWNERS~~ [SergeyKanzhelev,tallclair,thockin]
~~pkg/kubelet/OWNERS~~ [SergeyKanzhelev,tallclair,thockin]
~~staging/src/k8s.io/api/OWNERS~~ [thockin]
~~staging/src/k8s.io/component-helpers/OWNERS~~ [thockin]
~~test/compatibility_lifecycle/reference/OWNERS~~ [SergeyKanzhelev,tallclair,thockin]
~~test/e2e/node/OWNERS~~ [SergeyKanzhelev,tallclair,thockin]
~~test/e2e_node/OWNERS~~ [SergeyKanzhelev,tallclair,thockin]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

yuanwang04 · 2025-11-11T06:38:56Z

/unhold

tallclair · 2025-11-11T14:09:34Z

/lgtm

k8s-ci-robot · 2025-11-11T14:09:41Z

LGTM label has been added.

Details

Git tree hash: 06922a6b32b3b49288f8e3cfc497591609c98401

k8s-ci-robot · 2025-11-11T14:32:55Z

@yuanwang04: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-kubernetes-node-e2e-containerd-alpha-features	`8305cbb`	link	false	`/test pull-kubernetes-node-e2e-containerd-alpha-features`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

yuanwang04 · 2025-11-11T15:03:27Z

/retest-required

kannon92 · 2025-11-12T16:08:43Z

#135277

Looks like this is failing in the serial lanes.

k8s-ci-robot requested review from dims and haircommander September 30, 2025 17:18

k8s-ci-robot added area/kubelet kind/api-change Categorizes issue or PR as related to adding, removing, or otherwise changing an API sig/apps Categorizes an issue or PR as relevant to SIG Apps. sig/node Categorizes an issue or PR as relevant to SIG Node. labels Sep 30, 2025

github-project-automation Bot added this to SIG Apps Sep 30, 2025

k8s-ci-robot removed the do-not-merge/needs-kind Indicates a PR lacks a `kind/foo` label and requires one. label Sep 30, 2025

github-project-automation Bot added this to SIG Node: code and documentation PRs Sep 30, 2025

github-project-automation Bot moved this to Needs Triage in SIG Apps Sep 30, 2025

github-project-automation Bot moved this to Triage in SIG Node: code and documentation PRs Sep 30, 2025

k8s-ci-robot removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Sep 30, 2025

k8s-ci-robot requested a review from SergeyKanzhelev October 1, 2025 02:00

yuanwang04 force-pushed the restart-pod branch from c6b7713 to 442a62a Compare October 6, 2025 04:25

k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 6, 2025

yuanwang04 force-pushed the restart-pod branch 3 times, most recently from 78eefbd to 3bb33af Compare October 8, 2025 00:21

tallclair reviewed Nov 7, 2025

View reviewed changes

kwohlfahrt mentioned this pull request Nov 8, 2025

Add presubmit for RestartAllContainers alpha feature kubernetes/test-infra#35882

Merged

tallclair reviewed Nov 9, 2025

View reviewed changes

yuanwang04 added 4 commits November 10, 2025 09:41

Implement restartPod action

83c5cd5

add disruptive tests

2eb1eee

Refactor validation

97c3f57

Add dependency for NodeDeclaredFeatures

aac951d

tallclair reviewed Nov 10, 2025

View reviewed changes

tallclair reviewed Nov 11, 2025

View reviewed changes

Comment thread test/e2e_node/restart_all_containers_test.go Outdated

Keep pod in running state and prune past container status from runtime

0b47a37

kannon92 mentioned this pull request Nov 12, 2025

[Failing-Test]: RestartAllContainersWithKubeletRestarts [Serial] [Disruptive] [FeatureGate:ContainerRestartRules] [Beta] [FeatureGate:RestartAllContainersOnContainerExits] [Alpha] [Feature:OffByDefault] kubelet restart during cleanup #135277

Closed

platform-engineering-bot mentioned this pull request Jan 16, 2026

Update all dependencies platform-engineering-org/afula#53

Open

1 task

KevinTMtz mentioned this pull request Jan 23, 2026

Pod Level Resource Managers integration work #136481

Open

2 tasks

KevinTMtz mentioned this pull request Feb 2, 2026

KEP-5526: Pod Level Resource Managers 1.36 kubernetes/enhancements#5775

Merged

DTLP mentioned this pull request Feb 10, 2026

Some crashed or OOMKilled containers fail to restart in v1.35 #136910

Closed

		// Kill and remove containers in reverse order. Source containers (which exited and triggered
		// RestartAllContainers) are removed last.

Conversation

yuanwang04 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR is related to:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

k8s-ci-robot commented Sep 30, 2025

Uh oh!

SergeyKanzhelev commented Oct 1, 2025

Uh oh!

thockin commented Nov 7, 2025

Uh oh!

yuanwang04 commented Nov 7, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tallclair left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tallclair commented Nov 9, 2025

Uh oh!

yuanwang04 commented Nov 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Nov 10, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SergeyKanzhelev commented Nov 10, 2025

Uh oh!

tallclair left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

k8s-ci-robot commented Nov 11, 2025

Uh oh!

k8s-ci-robot commented Nov 11, 2025

Uh oh!

yuanwang04 commented Nov 11, 2025

Uh oh!

tallclair commented Nov 11, 2025

Uh oh!

k8s-ci-robot commented Nov 11, 2025

yuanwang04 commented Sep 30, 2025 •

edited

Loading

k8s-ci-robot commented Nov 11, 2025 •

edited

Loading