KEP for flagz page for Kubernetes Components by richabanker · Pull Request #4831 · kubernetes/enhancements

richabanker · 2024-09-06T23:49:33Z

One-line PR description: Add a KEP for flagz page in Kubernetes components.

Issue link: Flagz for Kubernetes Components #4828

dgrisonnet · 2024-09-19T16:40:51Z

/assign

dgrisonnet · 2024-09-25T19:32:12Z

+
+### Data Format and versioning
+
+Initially, the flagz page will exclusively support a plain text format for responses. We will implement these endpoints using versioned URLs, like /v1/flagz, to ensure future compatibility. This versioning strategy allows us to seamlessly introduce structured data formats (e.g., JSON) in the future, through a distinct endpoint such as /v2/flagz, without disrupting existing implementations.


Wouldn't it be better to have the endpoints in the form of /flagz/v1? Flagz is the subdomain that we want to version, if we were to put the version first, that would be confusing with the component version as it would then be at the top-level.

Yes you're right, I got that the other way round. Fixed it now, thanks!

One thing I was wondering since then is whether going with a query parameter might not be better. Something like:

/flagz?version=1 /flags?version=2

wdyt?

This was also raised in the statusz KEP PR. I am ok with either, whichever seems the best to indicate the version for the endpoint.

Updated the KEP to use query params for specifying the version.

dgrisonnet · 2024-09-25T19:37:23Z

+that might indicate a serious problem?
+-->
+
+- apiserver_request_duration_seconds metric that will tell us if there's a spike in request latency of kube-apiserver that might indicate that the flagz endpoint is interfering with the component's core functionality or causing delays in processing other requests


IIRC you won't get this metric for free, you'll have to make sure that the handler for the flagz endpoint is instrumented.

Umm unsure about how flagz handler relates with this metric which I thought is enabled by default? The idea was to use this metric to monitor general load of requests that apiserver handles and use that as a signal to check if flagz is consuming all system resources causing delays in apiserver's capability to serve other resource requests..

though yeah I guess that wouldnt be a clean signal to detect issues with the flagz endpoint in a deterministic way.. Do you suggest introducing a new metric that keeps track of request latency for just /flagz requests ?

I forgot to submit my comment on your PR, but you already dealt with that concern of mine in your POC: kubernetes/kubernetes#127581 (comment)

richabanker · 2024-09-26T20:48:43Z

cc @johnbelamaric for PRR. Hi John, could you please review this KEP for prod-readiness whenever you get a chance? Thanks a lot!

richabanker · 2024-09-26T20:52:07Z

/assign @johnbelamaric

whoops meant to add as assignee

johnbelamaric · 2024-09-26T21:00:38Z

/assign

thockin · 2024-10-07T19:41:30Z

+
+1. **No sensitive data exposed**
+
+    We will ensure that no sensitive data is exposed through flagz and that access to this endpoint is gated by using system-monitoring group.


how? Do flag definitions allow us to express the idea that the flag is "sensitive" ?

I dont think there currently is support for marking flags like that. So we would need to introduce a way to express this.
cc @cjcullen who I just reached out to for his thoughts on the same. Also @liggitt

Do we need to push that up to pflag? One could argue that falgs should NEVER have sensitive information, since they are visible via ps.

Do we need to push that up to pflag?

Or.. maybe we could create a separate set of "sensitive flags" and while printing out the values in the flagz handler, redact the values for those flags?

One could argue that falgs should NEVER have sensitive information, since they are visible via ps

That's indeed true. But if we still feel that exposing this info in an endpoint will be easier for attackers to get hold of, we can impose some restrictions on what data to expose in the endpoint.

Do we have any examples of these sorts of flags?

Perhaps the TLS related flags.. (also waiting for someone from sig-auth to help answer that..)

I'm not aware any flags in long-lived components (kube-apiserver, kubelet, scheduler, controller-manager, etc) that contain sensitive information directly. For things like TLS or credentials, they always point at files which contain the data.

There are shorter-lived invocations that take flags containing sensitive information (like kubeadm init with --certificate-key or --token), and components built that bind client-go user flags (like kubectl) can specify a token credential on the command-line using --token. Both of those have the option to consume those values from a file as well.

pflag does have the ability to annotate flags, which we apparently use like this to mark some flags as "classified":

// AddSecretAnnotation add secret flag to Annotation. func (f FlagInfo) AddSecretAnnotation(flags *pflag.FlagSet) FlagInfo { flags.SetAnnotation(f.LongName, "classified", []string{"true"}) return f }

and then filter out here when building an annotation to include in API objects when the client chooses to record the change cause in an annotation (--record)

parseFunc := func(flag *pflag.Flag, value string) error { flags = flags + " --" + flag.Name if set, ok := flag.Annotations["classified"]; !ok || len(set) == 0 { flags = flags + "=" + value } else { flags = flags + "=CLASSIFIED" } return nil }

Great! Thanks for pointing out the pflag annotation feature. That's perfect for displaying the non-CLASSIFIED flags.

dgrisonnet · 2024-10-07T20:58:04Z

+
+#### Request
+* Method: **GET** 
+* Endpoint: **v1/flagz**


this should be updated once we've agreed on the versioning format we want to follow

mattbailey

shadow prod readiness review

Just some nits/clerical (checkboxes), otherwise PRR looks good.

johnbelamaric

Some minor points on PRR, but looks OK to me.

johnbelamaric · 2024-10-08T22:19:23Z

PRR looks good, I will wait for sig approval to use the magic word

dgrisonnet · 2024-10-09T16:48:19Z

/lgtm
/approve

johnbelamaric · 2024-10-09T17:30:48Z

/approve

k8s-ci-robot · 2024-10-09T17:30:57Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dgrisonnet, johnbelamaric, richabanker

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [johnbelamaric]
~~keps/sig-instrumentation/OWNERS~~ [dgrisonnet,johnbelamaric]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Sep 6, 2024

k8s-ci-robot requested review from logicalhan and mrbobbytables September 6, 2024 23:49

richabanker mentioned this pull request Sep 6, 2024

Flagz for Kubernetes Components #4828

Open

30 tasks

k8s-ci-robot assigned dgrisonnet Sep 19, 2024

richabanker mentioned this pull request Sep 24, 2024

Add flagz endpoint for apiserver kubernetes/kubernetes#127581

Merged

dgrisonnet reviewed Sep 25, 2024

View reviewed changes

mrbobbytables removed their request for review September 25, 2024 19:40

k8s-ci-robot assigned johnbelamaric Sep 26, 2024

thockin reviewed Oct 7, 2024

View reviewed changes

dgrisonnet reviewed Oct 7, 2024

View reviewed changes

Comment thread keps/sig-instrumentation/4828-component-flagz/README.md Outdated

mattbailey reviewed Oct 8, 2024

View reviewed changes

Comment thread keps/sig-instrumentation/4828-component-flagz/README.md Outdated

Comment thread keps/sig-instrumentation/4828-component-flagz/README.md

Comment thread keps/sig-instrumentation/4828-component-flagz/README.md

johnbelamaric reviewed Oct 8, 2024

View reviewed changes

KEP for flagz page for Kubernetes Components

47dbde4

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 9, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 9, 2024

k8s-ci-robot merged commit 6bdbc9a into kubernetes:master Oct 9, 2024

k8s-ci-robot added this to the v1.32 milestone Oct 9, 2024


		### Data Format and versioning

		Initially, the flagz page will exclusively support a plain text format for responses. We will implement these endpoints using versioned URLs, like /v1/flagz, to ensure future compatibility. This versioning strategy allows us to seamlessly introduce structured data formats (e.g., JSON) in the future, through a distinct endpoint such as /v2/flagz, without disrupting existing implementations.


		1. No sensitive data exposed

		We will ensure that no sensitive data is exposed through flagz and that access to this endpoint is gated by using system-monitoring group.

Conversation

richabanker commented Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dgrisonnet commented Sep 19, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

richabanker commented Sep 26, 2024

Uh oh!

richabanker commented Sep 26, 2024

Uh oh!

johnbelamaric commented Sep 26, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

richabanker Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

liggitt Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattbailey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

johnbelamaric left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

johnbelamaric commented Oct 8, 2024

Uh oh!

dgrisonnet commented Oct 9, 2024

Uh oh!

johnbelamaric commented Oct 9, 2024

Uh oh!

k8s-ci-robot commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

richabanker commented Sep 6, 2024 •

edited

Loading

richabanker Oct 8, 2024 •

edited

Loading

liggitt Oct 8, 2024 •

edited

Loading