[forward-fix] Fix multigpu varying tensor optim tests #106887

janeyx99 · 2023-08-09T18:24:22Z

Forward fixes #106615 by increasing tolerance in the test.

The capturable implementation for foreach simply varies due to a different order of operations when updating params. I had also attempted to compare against fp64 but that introduced more disparity in the other optimizer configs. It is worth trying the fp64 comparison at a later point, but let's get the test passing first.

Stack from ghstack (oldest at bottom):

-> [forward-fix] Fix multigpu varying tensor optim tests #106887

[ghstack-poisoned]

pytorch-bot · 2023-08-09T18:24:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106887

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm CI upgrade in progress

✅ 3 Unrelated Failures

As of commit 2040fd6:

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 6a2a2a3 Pull Request resolved: #106887

janeyx99 · 2023-08-09T18:29:12Z

test/optim/test_optim.py

-                        and isinstance(actual, torch.Tensor)
-                        and actual.ndim == 1
-                    ):
-                        actual = actual[0]


This check is not needed after the change of step to a 0D tensor.

janeyx99 · 2023-08-10T14:21:47Z

@pytorchbot merge

pytorchmergebot · 2023-08-10T14:23:44Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[forward-fix] Fix multigpu varying tensor optim tests

2040fd6

[ghstack-poisoned]

pytorch-bot bot added the release notes: foreach_frontend release notes category label Aug 9, 2023

janeyx99 added a commit that referenced this pull request Aug 9, 2023

[forward-fix] Fix multigpu varying tensor optim tests

d2735ed

ghstack-source-id: 6a2a2a3 Pull Request resolved: #106887

janeyx99 added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Aug 9, 2023

janeyx99 commented Aug 9, 2023

View reviewed changes

janeyx99 added the topic: not user facing topic category label Aug 9, 2023

janeyx99 requested review from albanD and izaitsevfb August 9, 2023 19:17

janeyx99 added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 9, 2023

izaitsevfb approved these changes Aug 9, 2023

View reviewed changes

pytorchmergebot added the merging label Aug 10, 2023

pytorchmergebot added Merged and removed merging labels Aug 10, 2023

pytorchmergebot closed this in c0f80c6 Aug 10, 2023

facebook-github-bot deleted the gh/janeyx99/81/head branch August 14, 2023 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[forward-fix] Fix multigpu varying tensor optim tests #106887

[forward-fix] Fix multigpu varying tensor optim tests #106887

Uh oh!

janeyx99 commented Aug 9, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 9, 2023 •

edited

Loading

Uh oh!

janeyx99 Aug 9, 2023

Uh oh!

janeyx99 commented Aug 10, 2023

Uh oh!

pytorchmergebot commented Aug 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[forward-fix] Fix multigpu varying tensor optim tests #106887

[forward-fix] Fix multigpu varying tensor optim tests #106887

Uh oh!

Conversation

janeyx99 commented Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106887

❗ 1 Active SEVs

✅ 3 Unrelated Failures

Uh oh!

janeyx99 Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

janeyx99 commented Aug 10, 2023

Uh oh!

pytorchmergebot commented Aug 10, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

janeyx99 commented Aug 9, 2023 •

edited

Loading

pytorch-bot bot commented Aug 9, 2023 •

edited

Loading