runtime/v2: cleanup dead shim before delete bundle #4538

fuweid · 2020-09-07T15:20:48Z

The shim delete action needs bundle information to cleanup resources
created by shim. If the cleanup dead shim is called after delete bundle,
the part of resources maybe leaky.

The ttrpc client UserOnCloseWait() can make sure that resources are
cleanup before delete bundle, which synchronizes task deletion and
cleanup deadshim. It might slow down the task deletion, but it can make
sure that resources can be cleanup and avoid EBUSY umount case. For
example, the sandbox container like Kata/Firecracker might have mount
points over the rootfs. If containerd handles task deletion and cleanup
deadshim parallelly, the task deletion will meet EBUSY during umount and
fail to cleanup bundle, which makes case worse.

And also update cleanupAfterDeadshim, which makes sure that
cleanupAfterDeadshim must be called after shim disconnected. In some
case, shim fails to call runc-create for some reason, but the runc-create
already makes runc-init into ready state. If containerd doesn't call shim
deletion, the runc-init process will be leaky and hold the cgroup, which
makes pod terminating :(.

Signed-off-by: Wei Fu [email protected]

need to update commit after client: add UserOnCloseWait function ttrpc#68
leaky runc-init[2:stage] process case libcontainer: Store state.json before sync procRun opencontainers/runc#2575

theopenlab-ci · 2020-09-07T15:27:38Z

Build succeeded.

containerd-build-arm64 : FAILURE in 5m 14s (non-voting)

theopenlab-ci · 2020-09-09T16:35:08Z

Build succeeded.

containerd-build-arm64 : FAILURE in 5m 07s (non-voting)

fuweid · 2020-09-11T02:57:53Z

ping @crosbymichael @AkihiroSuda @cpuguy83 @mikebrow PTAL~

vendor.conf

estesp

LGTM

mikebrow

LGTM

thaJeztah · 2020-09-18T14:34:18Z

@fuweid I see you tagged ttrpc v1.0.2; https://github.com/containerd/ttrpc/releases/tag/v1.0.2 can you update the vendor.conf to use that tag?

The shim delete action needs bundle information to cleanup resources created by shim. If the cleanup dead shim is called after delete bundle, the part of resources maybe leaky. The ttrpc client UserOnCloseWait() can make sure that resources are cleanup before delete bundle, which synchronizes task deletion and cleanup deadshim. It might slow down the task deletion, but it can make sure that resources can be cleanup and avoid EBUSY umount case. For example, the sandbox container like Kata/Firecracker might have mount points over the rootfs. If containerd handles task deletion and cleanup deadshim parallelly, the task deletion will meet EBUSY during umount and fail to cleanup bundle, which makes case worse. And also update cleanupAfterDeadshim, which makes sure that cleanupAfterDeadshim must be called after shim disconnected. In some case, shim fails to call runc-create for some reason, but the runc-create already makes runc-init into ready state. If containerd doesn't call shim deletion, the runc-init process will be leaky and hold the cgroup, which makes pod terminating :(. Signed-off-by: Wei Fu <[email protected]>

theopenlab-ci · 2020-09-20T03:33:14Z

Build succeeded.

containerd-build-arm64 : SUCCESS in 5m 21s (non-voting)

fuweid · 2020-09-20T03:48:42Z

@thaJeztah sorry for late update. I updated the vendor.conf and PTAL. Thanks!

estesp

LGTM

AkihiroSuda · 2020-11-25T02:26:05Z

This commit seems causing a regression: #4769

fuweid force-pushed the update-shim-cleanup branch from 2e52f91 to 6a82da4 Compare September 9, 2020 16:28

fuweid marked this pull request as ready for review September 10, 2020 00:02

thaJeztah reviewed Sep 11, 2020

View reviewed changes

vendor.conf Outdated Show resolved Hide resolved

estesp approved these changes Sep 15, 2020

View reviewed changes

thaJeztah mentioned this pull request Sep 15, 2020

Prepare v1.4.1 release #4564

Merged

mikebrow approved these changes Sep 15, 2020

View reviewed changes

fuweid force-pushed the update-shim-cleanup branch from 6a82da4 to 4b05d03 Compare September 20, 2020 03:26

AkihiroSuda approved these changes Sep 20, 2020

View reviewed changes

estesp approved these changes Sep 21, 2020

View reviewed changes

estesp merged commit 68d9733 into containerd:master Sep 21, 2020

fuweid deleted the update-shim-cleanup branch September 24, 2020 02:32

This was referenced Nov 25, 2020

[containerd@master] docker run busybox true fails with io.containerd.runc.v2 moby/moby#41710

Closed

docker run busybox true does not work with io.containerd.runc.v2 (regression in #4538) #4769

Closed

claudiubelu mentioned this pull request Jun 1, 2021

Windows: The same container cannot be recreated in the same Pod #5094

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

runtime/v2: cleanup dead shim before delete bundle #4538

runtime/v2: cleanup dead shim before delete bundle #4538

Uh oh!

fuweid commented Sep 7, 2020 •

edited

Loading

Uh oh!

theopenlab-ci bot commented Sep 7, 2020

Uh oh!

theopenlab-ci bot commented Sep 9, 2020

Uh oh!

fuweid commented Sep 11, 2020

Uh oh!

Uh oh!

estesp left a comment

Uh oh!

mikebrow left a comment

Uh oh!

thaJeztah commented Sep 18, 2020

Uh oh!

theopenlab-ci bot commented Sep 20, 2020

Uh oh!

fuweid commented Sep 20, 2020

Uh oh!

estesp left a comment

Uh oh!

AkihiroSuda commented Nov 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

runtime/v2: cleanup dead shim before delete bundle #4538

runtime/v2: cleanup dead shim before delete bundle #4538

Uh oh!

Conversation

fuweid commented Sep 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

theopenlab-ci bot commented Sep 7, 2020

Uh oh!

theopenlab-ci bot commented Sep 9, 2020

Uh oh!

fuweid commented Sep 11, 2020

Uh oh!

Uh oh!

estesp left a comment

Choose a reason for hiding this comment

Uh oh!

mikebrow left a comment

Choose a reason for hiding this comment

Uh oh!

thaJeztah commented Sep 18, 2020

Uh oh!

theopenlab-ci bot commented Sep 20, 2020

Uh oh!

fuweid commented Sep 20, 2020

Uh oh!

estesp left a comment

Choose a reason for hiding this comment

Uh oh!

AkihiroSuda commented Nov 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fuweid commented Sep 7, 2020 •

edited

Loading