-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[DCP] Add fsspec.transaction context when writing checkpoint to storage #112191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112191
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New Failure, 1 Unrelated FailureAs of commit 167491b with merge base 5b71834 ( NEW FAILURE - The following job has failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
|
|
This pull request was exported from Phabricator. Differential Revision: D50701929 |
497be62 to
00ded68
Compare
|
This pull request was exported from Phabricator. Differential Revision: D50701929 |
1 similar comment
|
This pull request was exported from Phabricator. Differential Revision: D50701929 |
…ge (pytorch#112191) Summary: Adding fsspec.transaction to safeguard checkpointing writing. With the context, it should only commit if there was no exception and discard otherwise. Test Plan: ``` command: buck test @//mode/dev-nosan //caffe2/test/distributed/checkpoint/fb:test_fsspec_filesystem -- --print-passing-details Buck UI: https://www.internalfb.com/buck2/7eff3718-9c08-4397-86ae-31ba0bdee65c Test UI: https://www.internalfb.com/intern/testinfra/testrun/4785074810069469 Network: Up: 0B Down: 0B Jobs completed: 10. Time elapsed: 1:25.0s. Tests finished: Pass 3. Fail 0. Fatal 0. Skip 0. Build failure 0 ``` Reviewed By: rohan-varma Differential Revision: D50701929
00ded68 to
89c17a1
Compare
|
This pull request was exported from Phabricator. Differential Revision: D50701929 |
…ge (pytorch#112191) Summary: Adding fsspec.transaction to safeguard checkpointing writing. With the context, it should only commit if there was no exception and discard otherwise. Test Plan: ``` command: buck test @//mode/dev-nosan //caffe2/test/distributed/checkpoint/fb:test_fsspec_filesystem -- --print-passing-details Buck UI: https://www.internalfb.com/buck2/7eff3718-9c08-4397-86ae-31ba0bdee65c Test UI: https://www.internalfb.com/intern/testinfra/testrun/4785074810069469 Network: Up: 0B Down: 0B Jobs completed: 10. Time elapsed: 1:25.0s. Tests finished: Pass 3. Fail 0. Fatal 0. Skip 0. Build failure 0 ``` Reviewed By: rohan-varma Differential Revision: D50701929
89c17a1 to
167491b
Compare
|
This pull request was exported from Phabricator. Differential Revision: D50701929 |
|
@pytorchbot merge (Initiating merge automatically since Phabricator Diff has merged) |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…ge (pytorch#112191) Summary: Adding fsspec.transaction to safeguard checkpointing writing. With the context, it should only commit if there was no exception and discard otherwise. Test Plan: ``` command: buck test @//mode/dev-nosan //caffe2/test/distributed/checkpoint/fb:test_fsspec_filesystem -- --print-passing-details ``` Reviewed By: rohan-varma Differential Revision: D50701929 Pull Request resolved: pytorch#112191 Approved by: https://github.com/rohan-varma
…ge (pytorch#112191) Summary: Adding fsspec.transaction to safeguard checkpointing writing. With the context, it should only commit if there was no exception and discard otherwise. Test Plan: ``` command: buck test @//mode/dev-nosan //caffe2/test/distributed/checkpoint/fb:test_fsspec_filesystem -- --print-passing-details ``` Reviewed By: rohan-varma Differential Revision: D50701929 Pull Request resolved: pytorch#112191 Approved by: https://github.com/rohan-varma
Summary: Adding fsspec.transaction to safeguard checkpointing writing. With the context, it should only commit if there was no exception and discard otherwise.
Test Plan:
Reviewed By: rohan-varma
Differential Revision: D50701929