-
Notifications
You must be signed in to change notification settings - Fork 43k
Pod stuck in ContainerCreating: Unit ...slice already exists #102676
Copy link
Copy link
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.kind/regressionCategorizes issue or PR as related to a regression from a prior release.Categorizes issue or PR as related to a regression from a prior release.priority/critical-urgentHighest priority. Must be actively worked on as someone's top priority right now.Highest priority. Must be actively worked on as someone's top priority right now.release-blockersig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
Milestone
Metadata
Metadata
Assignees
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.kind/regressionCategorizes issue or PR as related to a regression from a prior release.Categorizes issue or PR as related to a regression from a prior release.priority/critical-urgentHighest priority. Must be actively worked on as someone's top priority right now.Highest priority. Must be actively worked on as someone's top priority right now.release-blockersig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
Type
Projects
Status
Done
What happened:
Errors like this one
(when using cgroupDriver: systemd)
What you expected to happen:
No such errors
How to reproduce it (as minimally and precisely as possible):
I don't know for sure.
Anything else we need to know?:
This was introduced in k8s in #102147 and backported to 1.21 in #102196, so needs to be fixed in both master and
release-1.21.RH BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1965545
The cause is a regression in runc/libcontainer: opencontainers/runc#2996
The fix is in opencontainers/runc#2997, which should make its way into runc 1.0.0 GA.
Currently there is DNM PR to bump runc to the version with the fix: #102508, but we have decided (#102250 (comment)) to wait until the release.
Environment:
kubectl version):cat /etc/os-release):uname -a):