[Bug Fix] Garbage Collect Node e2e Failing #42661

dashpole · 2017-03-07T19:52:43Z

This node e2e test uses its own deletion timeout (1 minute) instead of the default (3 minutes).
#41644 likely increased time for deletion. See that PR for analysis on that.
There may be other problems with this test, but those are difficult to pick apart from hitting this low timeout.

This PR changes the Garbage Collector test to use the default timeout. This should allow us to discern if there are any actual bugs to fix.

cc @kubernetes/sig-node-bugs @calebamiles @derekwaynecarr

k8s-reviewable · 2017-03-07T19:52:49Z

This change is

dashpole · 2017-03-07T19:52:51Z

/release-note-none

dashpole · 2017-03-07T19:55:06Z

Flake PR: #38050

dchen1107 · 2017-03-07T20:40:44Z

/lgtm
and
/approve

k8s-github-robot · 2017-03-07T20:40:52Z

[APPROVALNOTIFIER] This PR is APPROVED

The following people have approved this PR: dashpole, dchen1107

Needs approval from an approver in each of these OWNERS Files:

~~test/e2e_node/OWNERS~~ [dchen1107]

We suggest the following people:
cc @Random-Liu
You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

dashpole · 2017-03-07T20:48:45Z

@k8s-bot non-cri e2e test this

vishh · 2017-03-07T21:03:13Z

test/e2e_node/garbage_collector_test.go

 			for _, pod := range test.testPods {
 				By(fmt.Sprintf("Deleting Pod %v", pod.podName))
-				f.PodClient().DeleteSync(pod.podName, &metav1.DeleteOptions{}, defaultRuntimeRequestTimeoutDuration)
+				f.PodClient().DeleteSync(pod.podName, &metav1.DeleteOptions{}, podDisappearTimeout)


nit: The name is odd. Could it be podDeletionTimeout?

The description is equally odd:
// podDisappearTimeout is the timeout to wait node disappear.
podDisappearTimeout = time.Minute * 2

Might be worth just doing a quick pass over the e2e node suite and rename constants and move them to a single file

There is also podWaitTimeout in lifecycle_hook_test.go, which is set to 3 minutes, and is used as a timeout for deletion.

Yeah. I feel we can add default timeouts to DeleteSync() itself which can derive that from the framework. Less constants to deal with in such a workflow.

if this PR is urgent, don't block on these comments.

#42734 creates a default and addresses these comments

dashpole · 2017-03-07T21:11:52Z

@k8s-bot gce etcd3 e2e test this

k8s-ci-robot · 2017-03-08T00:03:51Z

@dashpole: The following test(s) failed:

Test name	Commit	Details	Rerun command
Jenkins GCI GCE e2e	`6a0d550`	link	`@k8s-bot gci gce e2e test this`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

k8s-github-robot · 2017-03-08T00:09:44Z

Automatic merge from submit-queue

@vishh

Automatic merge from submit-queue (batch tested with PRs 42734, 42745, 42758, 42814, 42694) Create DefaultPodDeletionTimeout for e2e tests In our e2e and e2e_node tests, we had a number of different timeouts for deletion. Recent changes to the way deletion works (kubernetes#41644, kubernetes#41456) have resulted in some timeouts in e2e tests. kubernetes#42661 was the most recent fix for this. Most of these tests are not meant to test pod deletion latency, but rather just to clean up pods after a test is finished. For this reason, we should change all these tests to use a standard, fairly high timeout for deletion. cc @vishh @Random-Liu

use default timeout

6a0d550

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 7, 2017

k8s-ci-robot added the release-note-none Denotes a PR that doesn't merit a release note. label Mar 7, 2017

k8s-github-robot assigned yifan-gu Mar 7, 2017

k8s-github-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Mar 7, 2017

dchen1107 assigned dchen1107 and unassigned yifan-gu Mar 7, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 7, 2017

dchen1107 added this to the v1.6 milestone Mar 7, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 7, 2017

dchen1107 added the kind/flake Categorizes issue or PR as related to a flaky test. label Mar 7, 2017

vishh reviewed Mar 7, 2017

View reviewed changes

k8s-github-robot merged commit 747b153 into kubernetes:master Mar 8, 2017

dashpole mentioned this pull request Mar 8, 2017

Create DefaultPodDeletionTimeout for e2e tests #42734

Merged

dashpole deleted the garbage_fix branch March 8, 2017 19:05

dchen1107 mentioned this pull request Mar 9, 2017

[k8s.io] GarbageCollect [Serial] Garbage Collection Test: Many Pods with Many Restarting Containers Should eventually garbage collect containers when we exceed the number of dead containers per container {E2eNode Suite} #36903

Closed

[Bug Fix] Garbage Collect Node e2e Failing #42661

[Bug Fix] Garbage Collect Node e2e Failing #42661

Conversation

dashpole commented Mar 7, 2017

Uh oh!

k8s-reviewable commented Mar 7, 2017

Uh oh!

dashpole commented Mar 7, 2017

Uh oh!

dashpole commented Mar 7, 2017

Uh oh!

dchen1107 commented Mar 7, 2017

Uh oh!

k8s-github-robot commented Mar 7, 2017

Uh oh!

dashpole commented Mar 7, 2017

Uh oh!

vishh Mar 7, 2017

Choose a reason for hiding this comment

Uh oh!

dashpole Mar 7, 2017

Choose a reason for hiding this comment

Uh oh!

dashpole Mar 7, 2017

Choose a reason for hiding this comment

Uh oh!

dashpole Mar 7, 2017

Choose a reason for hiding this comment

Uh oh!

vishh Mar 7, 2017

Choose a reason for hiding this comment

Uh oh!

dashpole Mar 8, 2017

Choose a reason for hiding this comment

Uh oh!

dashpole commented Mar 7, 2017

Uh oh!

k8s-ci-robot commented Mar 8, 2017

Uh oh!

k8s-github-robot commented Mar 8, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants