Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TestContainerPTY occasionally fails on windows due to expected \x00 in output #8422

Open
Iceber opened this issue Apr 20, 2023 · 5 comments
Open

Comments

@Iceber
Copy link
Member

Iceber commented Apr 20, 2023

Description

TestContainerPTY occasionally fails on windows due to expected \x00 in output.

out := buf.String()
if !strings.ContainsAny(fmt.Sprintf("%#q", out), `\x00`) {
t.Fatal(`expected \x00 in output`)
}

https://github.com/containerd/containerd/actions/runs/4701717798/jobs/8338201800?pr=8393#step:16:49
https://github.com/containerd/containerd/actions/runs/4750205702/jobs/8438224967?pr=8418#step:16:50

Occurrences

Date Job PR
2024-11-15 Windows Integration (windows-2022, cgroupfs) #11004
2024-11-25 Windows Integration (windows-2019, cgroupfs) #11053
@Iceber
Copy link
Member Author

Iceber commented Apr 25, 2023

@gabriel-samfira Hi,I don't have a windows environment to reproduce it, do you have time to help look at it. Thanks

@gabriel-samfira
Copy link
Contributor

Hi @Iceber

Yup. I'll have a look!

@gabriel-samfira
Copy link
Contributor

I have not forgotten about this. While reproducing this, it seems that the container exits normally, but there's nothing in the stdout buffer. Will debug further and see why that is.

@gabriel-samfira
Copy link
Contributor

@aznashwan would you mind taking this one? I've been sidetracked a bit and this deserves some attention. Thanks!

@aznashwan
Copy link
Contributor

While I am unsure of the cause yet, I've run the workflow with only the flaky test on repeat a dozen times or so to see how it behaves.

TL;DR:

  • more likely on 2022 than 2019, but affects both at less than 2% from my testing.
  • does not seem to be a race condition or conflict with other tests. (I ran TestContainerPTY by itself)
  • seems to occur per go test command (i.e. running one go test -count 1000 command will either see it failing instantly or working all 1k runs), so might be a test setup issue?
  • sandboxed 2019 seems immune but may be just "good luck" on it not flaking.

The runs on my fork, done over this containerd commit, can be broken down as follows:

  1. single-command runs with go test -run TestContainerPTY -count 1000 (run numbers 60-65)
  • all tests happened to run fine (a couple were killed for exceeding the time limit on GH actions though)
  1. re-executing go test -run TestContainerPTY every time (run numbers 66-75) yielded these results:
Host OS Failed test# average first fail
2019 non-sandboxed 449, ✅, 342, ✅, ✅, ✅, ✅, ✅, ✅, ✅ 395
2019 sandboxed all worked -
2022 non-sandboxed 6, 39, 95, 277, 40, 48, 142, ✅, 3, 40 68
2022 sandboxed 57, 4, 255, 127, 61, 77, 58, 44, 4, ✅ 76

At this stage I suspect it may be related to either how the tests are set up, or it really is a random I/O failure in HCS, but will be investigating further...

@dosubot dosubot bot added the Stale label Aug 2, 2024
@dmcgowan dmcgowan changed the title Integration: TestContainerPTY occasionally fails on windows due to expected \x00 in output TestContainerPTY occasionally fails on windows due to expected \x00 in output Nov 22, 2024
@dmcgowan dmcgowan added this to the 2.1 milestone Jan 21, 2025
@dmcgowan dmcgowan moved this to Todo in Issue Management Jan 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

No branches or pull requests

4 participants