api: document force-delete effect of TerminationGracePeriodSeconds=0 #112564

pohly · 2022-09-19T11:56:08Z

What type of PR is this?

/kind documentation

What this PR does / why we need it:

A pod with Spec.TerminationGracePeriodSeconds will get force-deleted also when the client doing the delete doesn't explicitly ask for it. That follows from

kubernetes/pkg/registry/core/pod/strategy.go

Lines 151 to 171 in 0f582f7

    
           // user has specified a value 
        
           if options.GracePeriodSeconds != nil { 
        
           	period = *options.GracePeriodSeconds 
        
           } else { 
        
           	// use the default value if set, or deletes the pod immediately (0) 
        
           	if pod.Spec.TerminationGracePeriodSeconds != nil { 
        
           		period = *pod.Spec.TerminationGracePeriodSeconds 
        
           	} 
        
           } 
        
           // if the pod is not scheduled, delete immediately 
        
           if len(pod.Spec.NodeName) == 0 { 
        
           	period = 0 
        
           } 
        
           // if the pod is already terminated, delete immediately 
        
           if pod.Status.Phase == api.PodFailed || pod.Status.Phase == api.PodSucceeded { 
        
           	period = 0 
        
           } 
        
           if period < 0 { 
        
           	period = 1 
        
           }

choosing TerminationGracePeriodSeconds as the value for GracePeriodSeconds when nothing is chosen explicitly.

This effect was not obvious from the documentation of the field and might be something that users should avoid.

Does this PR introduce a user-facing change?

NONE

A pod with Spec.TerminationGracePeriodSeconds will get force-deleted also when the client doing the delete doesn't explicitly ask for it. That follows from https://github.com/kubernetes/kubernetes/blob/0f582f7c3f504e807550310d00f130cb5c18c0c3/pkg/registry/core/pod/strategy.go#L151-L171 choosing TerminationGracePeriodSeconds as the value for GracePeriodSeconds when nothing is chosen explicitly. This effect was not obvious from the documentation of the field and might be something that users should avoid.

pohly · 2022-09-19T11:57:33Z

pkg/apis/core/types.go

-	// signal (no opportunity to shut down).
+	// signal (no opportunity to shut down) and also turns all Pod deletions for the pod into
+	// force-deletes (apiserver removes the Pod object immediately). Forced deletions can be potentially
+	// disruptive for some workloads and their Pods.


"Forced deletions ..." is a copy of the advice from https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#pod-termination-forced.

It's debatable whether it should get repeated here.

Even the comment above your change is not exactly true:

The value zero indicates stop immediately via the kill signal (no opportunity to shut down).

The doc reads:

Setting the grace period to 0 forcibly and immediately deletes the Pod from the API server. If the pod was still running on a node, that forcible deletion triggers the kubelet to begin immediate cleanup.
[...]
When a force deletion is performed, the API server does not wait for confirmation from the kubelet that the Pod has been terminated on the node it was running on. It removes the Pod in the API immediately so a new Pod can be created with the same name. On the node, Pods that are set to terminate immediately will still be given a small grace period before being force killed.

So here's my suggestion:

The value zero indicates that the pod should be forced-deleted immediately without waiting for confirmation
that it has been terminated. On the node, pods that are set to terminate immediately will still be given a
small grace period before being force killed. Forced deletions can be potentially disruptive for some workloads
and their Pods.

I think that explains both what happens on the apiserver and the node, and also gives the warning you're looking for, wdyt?

So "The value zero indicates stop immediately via the kill signal (no opportunity to shut down)." (from the original documentation) is really just plain wrong?

On the node, pods that are set to terminate immediately will still be given a small grace period before being force killed.

Where does this small grace period come from? Can it be configured?

I don't know anything but what I've seen in the doc you linked :-) I doubt it can be configured. But I think that's almost irrelevant, what matters most is that it's removed from the apiserver without confirmation.

What problem are you trying to solve? It sounds like you need some policy to prevent terminationGracePeriod from being set to 0.

I don't know anything but what I've seen in the doc you linked :-)

Those docs also might be wrong... If we want the API documentation to be correct, we probably need to determine what the implementation in kubelet actually does.

What problem are you trying to solve?

I wanted test pods to be killed immediately by kubelet but without also enabling a force delete because I wanted to go through the normal pod deletion process. From the documentation it sounded like TerminationGracePeriodSeconds: 0 would do that ("The value zero indicates stop immediately via the kill signal"), but than I discovered that a Delete(..., metav1.DeleteOptions{}) (i.e. a normal delete) acted like a force-delete (Delete(..., metav1.DeleteOptions{GracePeriodSeconds: &zero})).

This was surprising. I want to avoid that surprise for others, either:

by fixing the documentation to describe accurately what TerminationGracePeriodSeconds does or

by making the implementation behave as implied by the documentation (control kill period in kubelet, without the force-delete side effect).

If someone wants it to be one, why wouldn't they set it to one?

They might want it to be zero, without realizing the full implications. Fixing the documentation would help here because in practice, a delay of one second is close enough - users just need to know it.

@klueska do you know how quickly kubelet kills pods when TerminationGracePeriodSeconds: 0? Is it immediately (because the Pod object is gone) or after a certain grace period?

It would be counter-intuitive if pods got killed more slowly for TerminationGracePeriodSeconds: 0 than for TerminationGracePeriodSeconds: 1.

k8s-ci-robot · 2022-09-19T12:00:11Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: pohly
Once this PR has been reviewed and has the lgtm label, please assign liggitt for approval by writing /assign @liggitt in a comment. For more information see:The Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-triage-robot · 2022-09-19T12:18:31Z

This PR may require API review.

If so, when the changes are ready, complete the pre-review checklist and request an API review.

Status of requested reviews is tracked in the API Review project.

leilajal · 2022-09-20T20:05:54Z

/assign @apelisse
/triage accepted

pohly · 2022-09-20T20:25:47Z

FWIW, I also wouldn't mind changing the behavior so that if Spec.TerminationGracePeriodSeconds is used as default and happens to be zero, then one would be used instead of zero. I'd find that less surprising, but it probably has to be considered an API change.

apelisse · 2022-09-21T17:39:26Z

FWIW, I also wouldn't mind changing the behavior so that if Spec.TerminationGracePeriodSeconds is used as default and happens to be zero, then one would be used instead of zero. I'd find that less surprising, but it probably has to be considered an API change.

If someone wants it to be one, why wouldn't they set it to one?

aojea · 2022-09-22T06:35:59Z

/assign @smarterclayton

he was trying to clarify this too #102025

it seems the bot closed it

pohly · 2022-09-22T09:36:28Z

Thanks @aojea. That other PR shows that the current behavior is unintentionally and should be changed.

Let's put this doc update on hold and instead see whether we can finish the implementation update.

/hold

k8s-triage-robot · 2022-12-21T10:14:13Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

pohly · 2022-12-21T16:58:52Z

No progress on #102025 but as I think that that's the right solution I'll close this one here.

/close

k8s-ci-robot · 2022-12-21T16:58:58Z

@pohly: Closed this PR.

Details

In response to this:

No progress on #102025 but as I think that that's the right solution I'll close this one here.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pohly commented Sep 19, 2022

View reviewed changes

k8s-ci-robot added the area/code-generation label Sep 19, 2022

k8s-ci-robot requested review from caesarxuchao and liggitt September 19, 2022 12:00

k8s-ci-robot assigned apelisse Sep 20, 2022

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Sep 20, 2022

k8s-ci-robot assigned smarterclayton Sep 22, 2022

pohly mentioned this pull request Sep 22, 2022

Prevent pods from defaulting to zero second grace periods #102025

Closed

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 22, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 21, 2022

k8s-ci-robot closed this Dec 21, 2022

	// user has specified a value
	if options.GracePeriodSeconds != nil {
	period = *options.GracePeriodSeconds
	} else {
	// use the default value if set, or deletes the pod immediately (0)
	if pod.Spec.TerminationGracePeriodSeconds != nil {
	period = *pod.Spec.TerminationGracePeriodSeconds
	}
	}
	// if the pod is not scheduled, delete immediately
	if len(pod.Spec.NodeName) == 0 {
	period = 0
	}
	// if the pod is already terminated, delete immediately
	if pod.Status.Phase == api.PodFailed \|\| pod.Status.Phase == api.PodSucceeded {
	period = 0
	}

	if period < 0 {
	period = 1
	}

api: document force-delete effect of TerminationGracePeriodSeconds=0 #112564

api: document force-delete effect of TerminationGracePeriodSeconds=0 #112564

Uh oh!

Conversation

pohly commented Sep 19, 2022

What type of PR is this?

What this PR does / why we need it:

Does this PR introduce a user-facing change?

Uh oh!

pohly Sep 19, 2022

Choose a reason for hiding this comment

Uh oh!

apelisse Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

pohly Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

apelisse Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

pohly Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

pohly Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Sep 19, 2022

Uh oh!

k8s-triage-robot commented Sep 19, 2022

Uh oh!

leilajal commented Sep 20, 2022

Uh oh!

pohly commented Sep 20, 2022

Uh oh!

apelisse commented Sep 21, 2022

Uh oh!

aojea commented Sep 22, 2022

Uh oh!

pohly commented Sep 22, 2022

Uh oh!

k8s-triage-robot commented Dec 21, 2022

Uh oh!

pohly commented Dec 21, 2022

Uh oh!

k8s-ci-robot commented Dec 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants