-
Notifications
You must be signed in to change notification settings - Fork 40.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Vertical Pod Autoscaler to autoscaling/v2beta1 #63797
Conversation
adf2996
to
b396dd5
Compare
Is this ready for review? |
314992a
to
fc4a756
Compare
@thockin Yes, the api is ready for review. |
/lgtm from the sig-autoscaling point of view. @DirectXMan12 - last chance to veto :). |
hold on, I had some comments that never got sent |
pkg/apis/autoscaling/types.go
Outdated
|
||
var ( | ||
// Configured indicates whether the VPA recommender was able to load a valid VPA spec. | ||
Configured VerticalPodAutoscalerConditionType = "Configured" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we need this if we have validation instead?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right. Can be removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
pkg/apis/autoscaling/types.go
Outdated
// UpdateModeAuto means that autoscaler assigns resources on pod creation | ||
// and additionally can update them during the lifetime of the pod, | ||
// including evicting / rescheduling the pod. | ||
UpdateModeAuto UpdateMode = "Auto" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be "AutoRecreate", so that we can add "AutoInPlace" when we have in-place updates, and not be confusing (recreate is still valuable for things that need to do some resource-based config on creation, like environments that pre-set a heap size limit).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. I suggest we actually may want both. I think many users would want to configure the best (least intrusive) actuation method available. Once we have in-place updates, "auto" will automatically get promoted to in-place without rewriting configs. WDYT?
pkg/apis/autoscaling/types.go
Outdated
Name string | ||
// Whether autoscaler is enabled for the container. Defaults to "On". | ||
// +optional | ||
Mode ContainerScalingMode |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we want to just make this enabled
and be a boolean?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There might be some other options added in the future. I would stay with enum.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we name the options in a less "this should be a bool" way ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can imagine potentially more interesting policies in future, if users need it, e.g. "OnlyScaleUp". However I can't think of better names for the current options..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"Auto" and "Off" ?
pkg/apis/autoscaling/types.go
Outdated
} | ||
|
||
// PodResourcePolicy controls how autoscaler computes the recommended resources | ||
// for containers belonging to the pod. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// PodResourcePolicy controls how autoscaler computes the recommended resources
// or containers belonging to the pod. There must either be an entry for every named pod,
// or a wildcard entry.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
pkg/apis/autoscaling/types.go
Outdated
) | ||
|
||
// RecommendedPodResources is the recommendation of resources computed by | ||
// autoscaler. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
... It contains a recommendation for each container in the pod
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
(just some minor nits) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I only reviewed the API parts. A few relatively minor comments overall.
pkg/apis/autoscaling/types.go
Outdated
type VerticalPodAutoscalerStatus struct { | ||
// The time when the status was last refreshed. | ||
// +optional | ||
LastUpdateTime metav1.Time |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you explain the need for this? It's not present in any other APIs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
agree... heartbeat-style fields tend to cause scale issues (write traffic persists even in steady state)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The purpose was to let the user know about the freshness of the recommendation. But I understand your concern and I'm ok with deleting it.
|
||
// VerticalPodAutoscalerCondition describes the state of | ||
// a VerticalPodAutoscaler at a certain point. | ||
type VerticalPodAutoscalerCondition struct { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we could consider some common definition of Condition. Several APIs define their own FooCondition struct but they are all almost identical. The strong typing of the string doesn't protect against illegal values (Go does not have enums) anyway.
@kubernetes/api-reviewers
@bgrant0607 @smarterclayton
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK. However in this step I would rather keep vpa consistent with hpa and perhaps consider having both (and others) changed together in a larger rewrite?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK
pkg/apis/autoscaling/types.go
Outdated
) | ||
|
||
// PodUpdatePolicy describes the rules on how changes are applied to the pods. | ||
type PodUpdatePolicy struct { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a little invasive, but Can we organize these files in Go-introduction order? I find it makes reviews much easier.
E.g.
type VPA {
Spec VPASpec
Status VPAStatus
}
type VPASpec {
Foo FooType
Bar BarType
}
type FooType {
Whatever EnumString
}
Type EnumString
Type BarType
type VPAStatus {
Qux QuxType
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
pkg/apis/autoscaling/types.go
Outdated
Name string | ||
// Whether autoscaler is enabled for the container. Defaults to "On". | ||
// +optional | ||
Mode ContainerScalingMode |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we name the options in a less "this should be a bool" way ?
e94d35d
to
39c25a1
Compare
pkg/apis/autoscaling/types.go
Outdated
|
||
// Describes the rules on how changes are applied to the pods. | ||
// +optional | ||
UpdatePolicy PodUpdatePolicy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If this is optional, what is the default? Document.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
pkg/apis/autoscaling/types.go
Outdated
|
||
// Controls how the autoscaler computes recommended resources. | ||
// +optional | ||
ResourcePolicy PodResourcePolicy |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Document default behavior.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
pkg/apis/autoscaling/types.go
Outdated
type PodUpdatePolicy struct { | ||
// Controls when autoscaler applies changes to the pod resoures. | ||
// +optional | ||
UpdateMode UpdateMode |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this optional, when the struct itself is optional above? Put another way, is there a plausible future where the user specifies VP.spec.updatePolicy
and provides a struct that does not include mode?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also, all truly optional fields should be nil-able, (pointers or slices or maps)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PodUpdatePolicy may in future consist of multiple optional settings. It might be possible to specify PodUpdatePolicy with some other settings but leaving the mode unset (i.e. use the default).
Reg. nil-able fields: I've changed all optional fields to pointers, although I excluded "enum" string fields, because they support a natural empty value (an empty string) and I think this approach is widely used across other APIs elsewhere. WDYT?
pkg/apis/autoscaling/types.go
Outdated
|
||
// Specification of the behavior of the autoscaler. | ||
// More info: https://git.k8s.io/community/contributors/devel/api-conventions.md#spec-and-status. | ||
// +optional |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is spec really optional? What is the behavior if it is not provided? Document.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed optional.
Not sure why other APIs make it optional.
pkg/apis/autoscaling/types.go
Outdated
// named container and optionally a single wildcard entry with Name = "*", which | ||
// handles all containers that don't have individual policies. | ||
type PodResourcePolicy struct { | ||
// Per-container resource policies. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
default behavior if omitted?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a comment to ResourcePolicy PodResourcePolicy. In general this policy allows to provide constraints on how recommendations are computed for each container. If some container is not mentioned, there are no constraints on it.
|
||
var ( | ||
// RecommendationProvided indicates whether the VPA recommender was able to calculate a recommendation. | ||
RecommendationProvided VerticalPodAutoscalerConditionType = "RecommendationProvided" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shorten to "Recommended" ? Maybe..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure about this one.. I would prefer it to be as self-descriptive as possible.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK
|
||
// VerticalPodAutoscalerCondition describes the state of | ||
// a VerticalPodAutoscaler at a certain point. | ||
type VerticalPodAutoscalerCondition struct { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK
func validateVerticalPodAutoscalerSpec(spec *autoscaling.VerticalPodAutoscalerSpec, fldPath *field.Path) field.ErrorList { | ||
allErrs := field.ErrorList{} | ||
if spec != nil { | ||
if spec.Selector != nil { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
usually I see this flipped. Required test then valid test. Minor.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
type VerticalPodAutoscalerSpec struct { | ||
// A label query that determines the set of pods controlled by the Autoscaler. | ||
// More info: https://kubernetes.io/docs/concepts/overview/working-with-objects/labels/#label-selectors | ||
Selector *metav1.LabelSelector `json:"selector,omitempty" protobuf:"bytes,1,opt,name=selector"` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this optional or not? Doesn't say +optional, but is a pointer and proto says 'opt' ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed the annotation. Technically a nil selector is legal (matches nothing), however we disallowed it during validation. As far as I could see this approach was used elsewhere.
type PodResourcePolicy struct { | ||
// Per-container resource policies. | ||
// +optional | ||
ContainerPolicies []ContainerResourcePolicy `json:"containerPolicies,omitempty" protobuf:"bytes,1,rep,name=containerPolicies"` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we need patchStrategy
and patchMergeKey
on all lists.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
Adding approved-for-milestone as a sig-lead. |
[MILESTONENOTIFIER] Milestone Pull Request: Up-to-date for process Pull Request Labels
|
/retest |
/hold Taking discussion to Slack, because I'm concerned about the state of this PR. |
If @DirectXMan12 's objections were resolved on slack or elsewhere, please let me know in a comment and remove the hold. Thanks! Also, the release team needs to know: (a) what is the risk associated with this patch? (b) are there docs for this? |
@jberkus @DirectXMan12 is on vacation, however his objections seem to be resolved. @DirectXMan12 was one of the reviewer of the original proposal for VPA kubernetes/community#338. The risk associated with this PR is minimal. It is the api that is handled by an opt-in, independent component. Documentation for the feature is provided currently here: We will expand the main K8S doc soon. |
/remove-hold Removing hold per discussion with SIG. |
sigh. /hold cancel |
/test all [submit-queue is verifying that this PR is safe to merge] |
Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here. |
Automatic merge from submit-queue (batch tested with PRs 64836, 64890). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Revert: Add Vertical Pod Autoscaler to autoscaling/v2beta1 Reverts #63797 per [discussion](https://kubernetes.slack.com/archives/C09R1LV8S/p1528400528000615) with @jberkus and @mwielgus The scope of the follow-ups required in #64286 was not well understood when the API PR was merged, and we are now past code freeze for 1.11 This can be reopened against 1.12 ```release-note NONE ```
Automatic merge from submit-queue (batch tested with PRs 64836, 64890). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Revert: Add Vertical Pod Autoscaler to autoscaling/v2beta1 Reverts kubernetes/kubernetes#63797 per [discussion](https://kubernetes.slack.com/archives/C09R1LV8S/p1528400528000615) with @jberkus and @mwielgus The scope of the follow-ups required in kubernetes/kubernetes#64286 was not well understood when the API PR was merged, and we are now past code freeze for 1.11 This can be reopened against 1.12 ```release-note NONE ``` Kubernetes-commit: 169df7434155cbbc22f1532cba8e0a9588e29ad8
Automatic merge from submit-queue (batch tested with PRs 64836, 64890). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>. Revert: Add Vertical Pod Autoscaler to autoscaling/v2beta1 Reverts kubernetes/kubernetes#63797 per [discussion](https://kubernetes.slack.com/archives/C09R1LV8S/p1528400528000615) with @jberkus and @mwielgus The scope of the follow-ups required in kubernetes/kubernetes#64286 was not well understood when the API PR was merged, and we are now past code freeze for 1.11 This can be reopened against 1.12 ```release-note NONE ``` Kubernetes-commit: 169df7434155cbbc22f1532cba8e0a9588e29ad8
What this PR does / why we need it:
Adds Vertical Pod Autoscaler (https://github.com/kubernetes/community/blob/master/contributors/design-proposals/autoscaling/vertical-pod-autoscaler.md) to the autoscaling API (which currently has the Horizontal Pod Autoscaler).
This is needed for the Vertical Pod Autoscaler beta.
Special notes for your reviewer:
/cc @thockin @mwielgus @DirectXMan12
FYI. changes that add pkg/registry/autoscaling/verticalpodautoscaler/... will follow.
Release note: