Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RuntimeClassScheduling] test(runtimeclass): add e2e tests for runtimeclass scheduling - Part3 #81915

Conversation

draveness
Copy link
Contributor

/kind cleanup
/sig node
/priority important-soon
/assign @tallclair @yastij

What this PR does / why we need it:

Add e2e tests for runtimeclass scheduling

Which issue(s) this PR fixes:

ref #81016

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:


@k8s-ci-robot k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. sig/node Categorizes an issue or PR as relevant to SIG Node. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. area/test sig/testing Categorizes an issue or PR as relevant to SIG Testing. labels Aug 26, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from bee1483 to e6da6bc Compare August 26, 2019 00:23
@draveness draveness changed the title test(runtimeclass): add e2e tests for runtimeclass scheduling [RuntimeClassScheduling] test(runtimeclass): add e2e tests for runtimeclass scheduling - Part3 Aug 26, 2019
@draveness
Copy link
Contributor Author

/retest

@tallclair
Copy link
Member

Can you cherry-pick #81016 into this PR so that the tests can be run?

@draveness
Copy link
Contributor Author

draveness commented Aug 26, 2019 via email

Copy link
Member

@tallclair tallclair left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@@ -97,21 +98,100 @@ var _ = ginkgo.Describe("[sig-node] RuntimeClass", func() {
pod := createRuntimeClassPod(f, rcName)
expectSandboxFailureEvent(f, pod, fmt.Sprintf("\"%s\" not found", rcName))
})

ginkgo.It("should reject a Pod requesting a RuntimeClass with conflict node selector", func() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

needs [NodeFeature:RuntimeHandler]

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, since we expect this to fail in scheduling, the handler doesn't matter, so you can leave it off.

handler := PreconfiguredRuntimeHandler
if framework.TestContext.ContainerRuntime == "docker" {
handler = DockerRuntimeHandler
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

extract this to a helper function in the framework directory, so it can be shared with the non-common tests.

"foo": "bar",
}
f.PodClient().Create(pod)
expectSandboxFailureEvent(f, pod, "conflict: runtimeClass.scheduling.nodeSelector[foo] = bar; pod.spec.nodeselector[foo] = conflict")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be an admission failure, not a sandbox failure. I'd expect the Create call to fail. Since the PodClient always asserts success, you'll need to call the client directly:

_, err := f.ClientSet.CoreV1().Pods(f.Namespace.Name).Create(...)

expectSandboxFailureEvent(f, pod, "conflict: runtimeClass.scheduling.nodeSelector[foo] = bar; pod.spec.nodeselector[foo] = conflict")
})

ginkgo.It("should run a Pod requesting a RuntimeClass with scheduling", func() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here you need the NodeFeature

f.PodClient().Create(pod)

expectPodSuccess(f, pod)
pod, err = f.PodClient().Get(pod.Name, metav1.GetOptions{})
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: just use the pod returned by PodClient().Create

}
f.PodClient().Create(pod)

expectPodSuccess(f, pod)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you ever labeled the nodes? In that case the pod won't actually be scheduled. See https://github.com/kubernetes/kubernetes/blob/master/test/e2e/scheduling/predicates.go#L372
Also, taint the node.

Once the node is labeled, also verify the pod is assigned to the correct node.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why are we tainting the node? this is disruptive and seems unnecessary?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tainting might break parallel tests - so either Disruptive or Serial seems appropriate.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see now that runtimeclass scheduling includes taints.

filed #83647 for tagging the test, looking for feedback...

@@ -97,21 +98,100 @@ var _ = ginkgo.Describe("[sig-node] RuntimeClass", func() {
pod := createRuntimeClassPod(f, rcName)
expectSandboxFailureEvent(f, pod, fmt.Sprintf("\"%s\" not found", rcName))
})

ginkgo.It("should reject a Pod requesting a RuntimeClass with conflict node selector", func() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The test/e2e/common directory is for tests that should be run as part of the regular cluster E2E suite, and also the node E2E suite. We don't actually run admission controllers or a scheduler in the node E2E suite, so these tests shouldn't be part of that suite.

Move these tests to test/e2e/node/runtimeclass.go, and add a comment explaining why these are note in common

@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from e6da6bc to 5d3a8e2 Compare August 27, 2019 06:32
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. area/e2e-test-framework Issues or PRs related to refactoring the kubernetes e2e test framework and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Aug 27, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from 5d3a8e2 to 8f93edb Compare August 27, 2019 06:34
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 27, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from 8f93edb to 78da0f2 Compare August 27, 2019 06:35
@k8s-ci-robot k8s-ci-robot added area/apiserver sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels Aug 27, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from e56161a to 170a227 Compare August 27, 2019 06:58
@draveness
Copy link
Contributor Author

/assign @oomichi @neolit123

Could you help to take a look at the change in the test/e2e/framework package?

@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from 7ba7421 to 6132bef Compare August 30, 2019 03:09
@k8s-ci-robot k8s-ci-robot added the sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. label Aug 30, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch 4 times, most recently from cada957 to 39ef57f Compare August 30, 2019 03:51
)

// PreconfiguredRuntimeClassHandler returns configured runtime handler.
func PreconfiguredRuntimeClassHandler() string {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think we need a better location for this function instead of framework.go.
possibly /test/e2e/framework/node/runtime.go, @oomichi WDYT?

/hold

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Either of test/e2e/framework/{node,pod}/runtime.go works for me.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it would also be really nice if we documented this somewhere, other than kube-up, clusters don't tend to have "test-handler" registered, so they're only passing whenw e run docker and ~ skip them ... (?)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I understand why it's like this, but it's a bit annoying to track this down and modify cluster deployments to support the test. I imagine most just don't have this setup and aren't aware of it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I understand why it's like this, but it's a bit annoying to track this down and modify cluster deployments to support the test. I imagine most just don't have this setup and aren't aware of it.

Moving it to the test/e2e/framework/node/runtime.go would make the e2e node package depends on the framework. I checked out several other e2e packages which do not have this dependency.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I mean documenting somewhere that the cluster must have this magical test-handler configured

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense. Since this PR is just moving the code that's already there & needs to get in to 1.16, can we follow up on that?

I think the better approach would be to make it a variable that get's filled in by the test setup, rather than branching on the runtime, just make the handler parameter explicit (and skip the test if it's missing). But again, I'd like to get this in to 1.16 as is, and follow up with that change.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SGTM 👍

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I created an issue to track this #82213

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 30, 2019
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from 39ef57f to d2cb6a7 Compare August 30, 2019 18:20
@draveness
Copy link
Contributor Author

draveness commented Aug 30, 2019 via email


framework.ExpectNoError(e2epod.WaitForPodSuccessInNamespace(f.ClientSet, pod.Name, f.Namespace.Name))
framework.ExpectEqual(nodeSelector, pod.Spec.NodeSelector)
framework.ExpectEqual(tolerations, pod.Spec.Tolerations)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tests fail due to this

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Try:

gomega.Expect(pod.Spec.Tolerations).To(gomega.ContainElement(tolerations[0]))

@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch 2 times, most recently from a296ed3 to ddc2e29 Compare September 1, 2019 06:54
@draveness draveness force-pushed the feature/add-e2e-tests-for-runtime-class-scheduling branch from ddc2e29 to 702646d Compare September 1, 2019 08:42
Copy link
Member

@yastij yastij left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

cc @neolit123

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 2, 2019
Copy link
Member

@neolit123 neolit123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the update.
/lgtm
/approve

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: draveness, neolit123, tallclair

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 2, 2019
@draveness
Copy link
Contributor Author

/hold cancel

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 2, 2019
@k8s-ci-robot k8s-ci-robot merged commit d114f33 into kubernetes:master Sep 2, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/apiserver area/e2e-test-framework Issues or PRs related to refactoring the kubernetes e2e test framework area/test cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. lgtm "Looks good to me", indicates that a PR is ready to be merged. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. release-note-none Denotes a PR that doesn't merit a release note. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants