Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proposal: HPA using external metrics #1801

Merged
merged 1 commit into from
Feb 22, 2018

Conversation

MaciekPytel
Copy link
Contributor

@MaciekPytel MaciekPytel commented Feb 14, 2018

This proposal describes an extension to the HPA v2 API that would allow autoscaling based on metrics coming from outside kubernetes cluster. This is not possible using existing API.

Common use-cases are autoscaling pods using a hosted service (ex. hosted message queue) or autoscaling in systems that have (so far) only partly moved to Kubernetes.

#1802 describes External Metrics API used internally by HPA to retrieve metrics.

/sig-autoscaling

@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Feb 14, 2018
@k8s-github-robot k8s-github-robot added kind/design Categorizes issue or PR as related to design. sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. labels Feb 14, 2018
@MaciekPytel
Copy link
Contributor Author

@kubernetes/sig-autoscaling-proposals
@kubernetes/api-reviewers

@DirectXMan12 - we discussed this during sig-autoscaling, can you take a look?

Copy link
Contributor

@DirectXMan12 DirectXMan12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

couple of nits inline.

You need to discuss more concretely how this maps to the external metrics API proposed in #1802.

I like the targetAverageValue change. No matter what happens with the rest of this PR, I've become convinced that we should land that, because it helps with certain aspects of queue-based scaling (it's insufficient on its own, IMO, but together with a "traffic intensity" metric it should work).

That being said: this needs to have a more serious discussion of alternatives, because there are some. I'm not saying that this isn't the solution (it very well might be the easiest to work with), but we need documentation on what alternatives we considered, and why they were rejected.

I'm going to play the Devil's Advocate here for a moment.

What, fundamentally, prevents you from associating your metrics with some cluster-level object, or the namespace object itself? Couldn't we add the metricSelector field to the pods and object metrics (to allow further drilling down of a particular metric), and then say "if you have external services that produce metrics, use relabeling to attach metadata to those services so that they're associate with some object, such as the "default" namespace?

What are the downsides to that approach? What does this approach buy/lose us in terms of reducing user complexity and confusion? What does this approach buy/lose us in terms of operator complexity and confusion?

        ResourceMetricSourceType MetricSourceType = "Resource"
        // ExternalMetricSourceType is a global metric coming from
        // outside of Kubernetes. It allows autoscaling based on information
        // coming from components running outside of cluster
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is a bit of a documentation issue, but I think "coming from outside the cluster" is the wrong term here. I think it's better to stress that this is for the case when the metrics aren't immediately associatable with objects inside the cluster.

For instance, I could run a DBaaS that runs outside the cluster, but still provide metrics to track usage per namespace, and thus not have to use the external type.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. Updated all comments as per your suggestion.

MetricSelector metav1.LabelSelector `json:"metricSelector" protobuf:"bytes,2,name=metricSelector"`

        // CurrentValue is the current value of the metric (as a quantity)
        CurrentValue resource.Quantity `json:"currentValue" protobuf:"bytes,3,name=currentValue"`
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This also needs currentAverageValue *resource.Quantity as an optional field.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added.

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 16, 2018
@DirectXMan12
Copy link
Contributor

holding due to needing some further some further consideration of the alternatives

@MaciekPytel MaciekPytel force-pushed the hpa_external branch 2 times, most recently from 9bbb929 to 4e93010 Compare February 20, 2018 16:28
@MaciekPytel
Copy link
Contributor Author

@DirectXMan12 I've added a section discussing considered alternatives. Can you take a look?

This proposal details an extension to the HPA that would
allow autoscaling based on metrics coming from outside
kubernetes cluster.
@DirectXMan12
Copy link
Contributor

DirectXMan12 commented Feb 21, 2018

So, after a bit discussion with @MaciekPytel offline, I'm inclined to accept this proposal, with a caveat. I'm still hesitant about potential confusion on the part of end-users (when do I use external vs object), and not entirely convinced that the better solution isn't association in the metrics adapters, so I think that we'll need to make super certain to collect user feedback and re-evaluate next release how we want to move forward with these changes.

I believe we still need an @kubernetes/api-reviewers on the sibling proposal, so we should hold off merge until that gets the ok.

@eghobo
Copy link

eghobo commented Feb 22, 2018

@MaciekPytel sorry for stupid question (I honestly read full proposal ;)). But how this different from existing API (e.g. in our use-case we wrote api-adapter for external system and use ObjectMetricSourceType in HPA)?

@mwielgus
Copy link
Contributor

@eghobo With ObjectMetricSource users have to bind their metrics to some kubernetes object (thus possibly modify their definition in the monitoring solution). With external metrics we allow to use unmodified metrics (like pubsub queue lenghth, external load balancer qps, etc) in HPA.

@mwielgus
Copy link
Contributor

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 22, 2018
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: MaciekPytel, mwielgus

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 22, 2018
@MaciekPytel
Copy link
Contributor Author

@eghobo
Short answer: If you can identify a metric you want to use using Object, than use it. Otherwise you need External.

The goal is to allow autoscaling based on metrics that have no inherent relationship to any k8s object. For example if your application use a hosted service and you want to autoscale based on its metrics, which contain no reference to your kubernetes object.

The design of Object metric (even the name :) ) is suited to use metrics inherently associated with some k8s object. Target field in ObjectMetricSource is useless for identifying metrics that have no such association. As discussed in Using Object metric source for external metrics section there are some workarounds that allow using Object for autoscaling such metrics. However, regardless which workaround you choose, at the end of the day you will need to explicitly define the metric you want to use (ie. specify the same set of information you give to External metric). You may write adapter so that using a specific Target gives you some arbitrary metric of course, but that's stretching the Object metric API. Also that means everyone will also need to somehow do that in their adapter. Instead the idea is to just create a standard way of accessing such metrics.

@DirectXMan12
Copy link
Contributor

/hold

for api review

@mwielgus
Copy link
Contributor

#60096 is approved by API reviewer (@thockin).

@DirectXMan12 Can we unhold?

@DirectXMan12
Copy link
Contributor

/unhold

yep

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 22, 2018
@DirectXMan12
Copy link
Contributor

(although I kind-of wanted approval on the external metrics API, too)

@k8s-ci-robot k8s-ci-robot merged commit a20dbc3 into kubernetes:master Feb 22, 2018
@eghobo
Copy link

eghobo commented Feb 23, 2018

@MaciekPytel thx for details, we are using adapter to access external metrics now. Sorry, but I am very skeptical about standard way of accessing such metrics. E.g. we have three monitoring systems Graphite/JMX/Prometheus in place now and there is no way to find standard way for them without writing adapter ):

k8s-github-robot pushed a commit to kubernetes/kubernetes that referenced this pull request Feb 23, 2018
Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add external metric type to HPA API

**What this PR does / why we need it**:
Add external metric type to HPA API proposed in xref kubernetes/community#1801

**Release note**:
```release-note
Allows HorizontalPodAutoscaler to use global metrics not associated with any Kubernetes object (for example metrics from a hoster service running outside of Kubernetes cluster).
```
k8s-publishing-bot added a commit to kubernetes/api that referenced this pull request Feb 27, 2018
Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add external metric type to HPA API

**What this PR does / why we need it**:
Add external metric type to HPA API proposed in xref kubernetes/community#1801

**Release note**:
```release-note
Allows HorizontalPodAutoscaler to use global metrics not associated with any Kubernetes object (for example metrics from a hoster service running outside of Kubernetes cluster).
```

Kubernetes-commit: b22b7853459db50a2c0619faa398f9fa1e98228e
k8s-publishing-bot added a commit to kubernetes/api that referenced this pull request Feb 27, 2018
Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add external metric type to HPA API

**What this PR does / why we need it**:
Add external metric type to HPA API proposed in xref kubernetes/community#1801

**Release note**:
```release-note
Allows HorizontalPodAutoscaler to use global metrics not associated with any Kubernetes object (for example metrics from a hoster service running outside of Kubernetes cluster).
```

Kubernetes-commit: b22b7853459db50a2c0619faa398f9fa1e98228e
sttts pushed a commit to sttts/api that referenced this pull request Mar 1, 2018
Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add external metric type to HPA API

**What this PR does / why we need it**:
Add external metric type to HPA API proposed in xref kubernetes/community#1801

**Release note**:
```release-note
Allows HorizontalPodAutoscaler to use global metrics not associated with any Kubernetes object (for example metrics from a hoster service running outside of Kubernetes cluster).
```

Kubernetes-commit: b22b7853459db50a2c0619faa398f9fa1e98228e
sttts pushed a commit to sttts/api that referenced this pull request Mar 1, 2018
Automatic merge from submit-queue (batch tested with PRs 60208, 60084, 60183, 59713, 60096). If you want to cherry-pick this change to another branch, please follow the instructions <a href="https://github.com/kubernetes/community/blob/master/contributors/devel/cherry-picks.md">here</a>.

Add external metric type to HPA API

**What this PR does / why we need it**:
Add external metric type to HPA API proposed in xref kubernetes/community#1801

**Release note**:
```release-note
Allows HorizontalPodAutoscaler to use global metrics not associated with any Kubernetes object (for example metrics from a hoster service running outside of Kubernetes cluster).
```

Kubernetes-commit: b22b7853459db50a2c0619faa398f9fa1e98228e
MadhavJivrajani pushed a commit to MadhavJivrajani/community that referenced this pull request Nov 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/design Categorizes issue or PR as related to design. lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/autoscaling Categorizes an issue or PR as relevant to SIG Autoscaling. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants