-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Closed
Labels
high prioritymodule: flaky-testsProblem is a flaky test in CIProblem is a flaky test in CIoncall: distributedAdd this issue/PR to distributed oncall triage queueAdd this issue/PR to distributed oncall triage queuetriage reviewtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
May 07 08:33:15 ======================================================================
May 07 08:33:15 FAIL [0.206s]: test_barrier_group_cuda (__main__.TestDistBackend)
May 07 08:33:15 ----------------------------------------------------------------------
May 07 08:33:15 Traceback (most recent call last):
May 07 08:33:15 File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_distributed.py", line 173, in wrapper
May 07 08:33:15 self._join_processes(fn)
May 07 08:33:15 File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_distributed.py", line 275, in _join_processes
May 07 08:33:15 self._check_return_codes(elapsed_time)
May 07 08:33:15 File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_distributed.py", line 327, in _check_return_codes
May 07 08:33:15 "Expected zero exit code but got {}".format(first_process.exitcode)
May 07 08:33:15 File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py", line 973, in assertEqual
May 07 08:33:15 super().assertLessEqual(abs(x - y), atol, message)
May 07 08:33:15 AssertionError: 77 not less than or equal to 1e-05 : Expected zero exit code but got 77
cc @ezyang @gchanan @zou3519 @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @xush6528 @osalpekar
Metadata
Metadata
Assignees
Labels
high prioritymodule: flaky-testsProblem is a flaky test in CIProblem is a flaky test in CIoncall: distributedAdd this issue/PR to distributed oncall triage queueAdd this issue/PR to distributed oncall triage queuetriage reviewtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module