Refactor container stats collection #7469

danielye11 · 2022-10-03T21:37:49Z

Update the logic for container stats collection, storing the stats on the container object now. This is in preparation for the CRI changes described in KEP-2371, which adds a new gRPC call that returns unstructured Prometheus metrics. These will not be returned via a runtime struct, and instead will be added to stats. The additional unstructured metrics described in the above KEP will also be modified and retrieved from the container stats cache.

Directly tied to this issue

k8s-ci-robot · 2022-10-03T21:37:58Z

Hi @danielye11. Thanks for your PR.

I'm waiting for a containerd member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pkg/cri/sbserver/container_stats_list_linux.go

pkg/cri/store/stats/stats.go

pkg/cri/store/container/container.go

bobbypage · 2022-10-03T22:49:58Z

pkg/cri/store/container/container.go


 	c := s.containers[id]
-	c.Stats = newContainerStats
+	if c.Stats != nil {


why do we need to check if c.Stats is nil here? Shouldn't it be initialized previously?

I believe this checks for the case where old stats is nil here

Also get a memory panic if I take it out

Hmm, where do we actually assign c.Stats? Is it ever assigned?

As I see we are only returning *containerstorestats.ContainerStats in generatedContainerMetrics, but we are not actually storing the stats on the container object? Is that right? If that's the case, then c.Stats will always be nil...

Yep, good point -- actual caching of the metrics should be fixed now.

pkg/cri/sbserver/container_stats_list_linux.go

pkg/cri/store/stats/stats.go

pkg/cri/sbserver/container_stats_list_linux.go

samuelkarp · 2022-10-05T07:37:44Z

CI is failing as all of your commits are missing valid Signed-off-by lines in the commit message. Please see the contribution guide for instructions to sign-off your commits.

samuelkarp · 2022-10-05T07:40:08Z

Your commit messages and PR description don't really provide much context for the changes you'd like to contribute to containerd. Can you describe more of what you're trying to achieve? Is there an associated issue for this PR with more information?

pkg/cri/store/container/container.go

danielye11 · 2022-10-07T19:49:54Z

Your commit messages and PR description don't really provide much context for the changes you'd like to contribute to containerd. Can you describe more of what you're trying to achieve? Is there an associated issue for this PR with more information?

Updated description of PR

pkg/cri/store/stats/stats.go

pkg/cri/store/sandbox/sandbox.go

pkg/cri/store/container/container.go

bobbypage · 2022-10-11T23:55:55Z

Left a few small comments, but looking really good. Maybe time to move this out of draft?

Change how ListPodSandboxStats and ListContainerStats retrieves metrics. Previously, metrics were fetched from cgroup and returned directly via a runtime struct (ListContainerStats). With the need of Prometheus exporting, we will now collect these metrics and cache them, fetching metrics from the cache and exporting them to whichever metric type we need. Cache the metrics added in stats object of container and sandbox and revamp getUsageNanoCores to not cache metrics inside that function. Add and update tests based on changes. Signed-off-by: Daniel Ye <[email protected]>

samuelkarp

@danielye11, thanks for opening this PR! I took a first pass at it and left a few comments, but also I have two more overall comments:

I think I'm missing the caching and expiration/eviction behavior here. I do see where stats are stored (in the container.Store), but it looks like they're never updated and also never read.
It would be helpful for me as a reviewer to break this up into at least two commits: one where the structural changes (types, refactoring) are done and a second with the functional changes (new data + caching behavior).

samuelkarp · 2022-10-18T21:21:09Z

pkg/cri/store/stats/stats.go

+	ContainerCPUStats
+	ContainerMemoryStats
+	ContainerFileSystemStats
+}


What's the rationale for struct embedding here?

All stats whose additional metrics will eventually be returned via Prometheus format. Rationale would be to add the additional caching logic in the future, and since stats and metrics will be set in the same place (reading the same linux cgroup file) we can read from the container stats store or sandbox stats store.

Maybe my question wasn't clear. What's the reason for embedding versus using fields?

// embedding type ContainerStats struct { ContainerCPUStats ContainerMemoryStats ContainerFileSystemStats } // fields type ContainerStats struct { CPU ContainerCPUStats Memory ContainerMemoryStats FileSystem ContainerFileSystemStats }

samuelkarp · 2022-10-18T21:23:16Z

pkg/cri/store/stats/stats.go

+	// Timestamp in nanoseconds at which the information were collected. Must be > 0.
+	Timestamp int64


Each of these three structs (ContainerCPUStats, ContainerMemoryStats, and ContainerFileSystemStats contain a Timestamp field. Since all three are then embedded into ContainerStats, only one of the three Timestamp fields will be visible as ContainerStats.Timestamp; the other two will be shadowed. This can be a source of confusing behavior; if we do need to have this embedding pattern, please move Timestamp out of the individual structs and up into ContainerStats directly.

samuelkarp · 2022-10-18T21:25:01Z

pkg/cri/store/stats/stats.go

+	Timestamp int64
 	// Cumulative CPU usage (sum across all cores) since object creation.
 	UsageCoreNanoSeconds uint64
+	// Total CPU usage (sum of all cores) averaged over the sample window.


What's the sample window?

samuelkarp · 2022-10-18T21:26:10Z

pkg/cri/store/stats/stats.go

+
+type ContainerCPUStats struct {
+	// Timestamp in nanoseconds at which the information were collected. Must be > 0.
+	Timestamp int64


Existing code using the ContainerStats.Timestamp field are seeing a time.Time. This code changes it to an int64. Is there a specific reason why? Generally time.Time would be preferred as a standard type for representing time.

CRI fields use int64, figured it would be easier than converting

Since this is an internal struct that needs to be converted on the way out anyway, I think it'd make more sense to centralize the conversion and maintain the standard time.Time here.

samuelkarp · 2022-10-18T21:32:29Z

pkg/cri/store/stats/stats.go

+	WorkingSetBytes uint64
+	// Available memory for use. This is defined as the memory limit - workingSetBytes.
+	AvailableBytes uint64
+	// Total memory in use. This includes all memory regardless of when it was accessed.


I'm not sure I can parse "when it was accessed". Does this mean you're tracking the total number of bytes over the lifetime of a container (so maybe you mean the max usage?).

samuelkarp · 2022-10-20T23:45:59Z