Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[release/1.7] cri: implement RuntimeConfig rpc #11346

Open
wants to merge 6 commits into
base: release/1.7
Choose a base branch
from

Conversation

marquiz
Copy link
Contributor

@marquiz marquiz commented Feb 6, 2025

This PR backports #8722 to v1.7. It implements the CRI runtime part of Kubernetes KEP-4033 Discover cgroup driver from CRI. The motivation for the backport is to expedite graduation of the Kubernetes KEP to GA.

It consists of multiple commits:

  1. Bump cri-api to v0.28.9
  2. cri: implement RuntimeConfig rpc (Implement RuntimeConfig CRI call #8722)
  3. remove uses of github.com/runc/libcontainer/cgroups
    Cherry-picks relevant parts from remove uses of github.com/runc/libcontainer/cgroups #9045 (7d0ab4f)
  4. Move to use github.com/containerd/log
    Cherry-picks relevant parts from Use github.com/containerd/log #9086 (508aa3a)
  5. cri: support cgroup driver detection for legacy config types
    Adapts the mainline PR to work with v1.7 config
  6. cri: take runc options from the api directory
    Ditch usage of deprecated package, makes linter happy

@k8s-ci-robot
Copy link

Hi @marquiz. Thanks for your PR.

I'm waiting for a containerd member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@marquiz
Copy link
Contributor Author

marquiz commented Feb 6, 2025

/assign @mikebrow
/hold

@dosubot dosubot bot added area/cri Container Runtime Interface (CRI) kind/feature labels Feb 6, 2025
@marquiz marquiz force-pushed the backport-cgroup-driver branch 2 times, most recently from f06d9b1 to 347d02c Compare February 6, 2025 17:46
marquiz and others added 6 commits February 6, 2025 19:49
Contains RuntimeConfig rpc for communicating the cgroup driver to the
CRI client.

Signed-off-by: Markus Lehtonen <[email protected]>
The rpc only reports one field, i.e. the cgroup driver, to kubelet.
Containerd determines the effective cgroup driver by looking at all
runtime handlers, starting from the default runtime handler (the rest in
alphabetical order), and returning the cgroup driver setting of the
first runtime handler that supports one. If no runtime handler supports
cgroup driver (i.e. has a config option for it) containerd falls back to
auto-detection, returning systemd if systemd is running and cgroupfs
otherwise.

This patch implements the CRI server side of Kubernetes KEP-4033:
https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/4033-group-driver-detection-over-cri

Signed-off-by: Markus Lehtonen <[email protected]>
(cherry picked from commit ed47d6b)
runc considers libcontainer to be "unstable" (not for external use),
so we try not to use it. Commit ed47d6b
brought back the dependency on other parts of libcontainer, but looks to
be only depending on a single utility, which in itself was borrowed from
github.com/coreos/go-systemd to not introduce CGO code in the same package.

This patch copies the version from github.com/coreos/go-systemd (adding
proper attribution, although the function is pretty trivial).

runc is in process of moving the libcontainer/user package to an external
module, which means we can remove the dependency on libcontainer entirely
in the near future. There is one more use of `libcontainer` in our vendor
tree; it looks like CDI is depending on one utility (devices.DeviceFromPath);
https://github.com/containerd/containerd/blob/a943033a8b6d8dd94a171c6e9528ef413b5b22f3/vendor/github.com/container-orchestrated-devices/container-device-interface/pkg/cdi/container-edits_unix.go#L38

We should remove the dependency on that utility, and add a CI check to
prevent bringing it back.

Signed-off-by: Sebastiaan van Stijn <[email protected]>
(cherry picked from commit 7d0ab4f)
Signed-off-by: Markus Lehtonen <[email protected]>
Add github.com/containerd/log to go.mod

Signed-off-by: Derek McGowan <[email protected]>
(cherry picked from commit 508aa3a)
Signed-off-by: Markus Lehtonen <[email protected]>
Stop using the deprecated "runtime/v2/runc/options" package, makes
linter happy.

Signed-off-by: Markus Lehtonen <[email protected]>
@marquiz marquiz force-pushed the backport-cgroup-driver branch from 347d02c to c82fd82 Compare February 6, 2025 17:49
@marquiz
Copy link
Contributor Author

marquiz commented Feb 7, 2025

Worked for me
/unhold

/cc @AkihiroSuda @dmcgowan

@AkihiroSuda
Copy link
Member

The motivation for the backport is to expedite graduation of the Kubernetes KEP to GA.

How does this backport relate to the graduation of the KEP?

@marquiz
Copy link
Contributor Author

marquiz commented Feb 7, 2025

How does this backport relate to the graduation of the KEP?

The GA graduation criteria in the KEP (in it's current form) states that the --cgroup-driver flag from the kubelet will be removed. This makes it mandatory to use a CRI runtime that supports the RuntimeConfig rpc (to tell the cgroup driver to use). Backporting would "unbreak" users of v1.7.

@AkihiroSuda
Copy link
Member

Thanks, this should be backported to v1.6 too?

@marquiz
Copy link
Contributor Author

marquiz commented Feb 7, 2025

Thanks, this should be backported to v1.6 too?

If you ask me, yes :) I can do that if you think it would be appropriate

EDIT: I took a quick look and there's a big hurdle. v1.6 depends on cri-api v0.25 and also supports cri-api v1alpha2 which is not present in cri-api v0.28 (in which version the rpc is introduced.

@mikebrow
Copy link
Member

mikebrow commented Feb 7, 2025

FYI #11294 changed 1.7 from active to LTS 3months faster than I expected.. Will need to do an exception to the rule to get this important change in.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Needs Triage
Development

Successfully merging this pull request may close these issues.

6 participants