Skip to content

[Revert] FSDP+Dtensor refactor related changes#46246

Merged
vasqu merged 8 commits into
mainfrom
revert-dtensors-refactor
May 28, 2026
Merged

[Revert] FSDP+Dtensor refactor related changes#46246
vasqu merged 8 commits into
mainfrom
revert-dtensors-refactor

Conversation

@vasqu
Copy link
Copy Markdown
Contributor

@vasqu vasqu commented May 27, 2026

As per title 👀

Discussed internally

@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: afmoe, apertus, arcee, aria, audioflamingo3, bamba, bitnet, cohere, cohere2, cohere2_moe, csm, cwm, data2vec, dbrx, deepseek_v2, deepseek_v3

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@vasqu vasqu marked this pull request as ready for review May 27, 2026 17:24
@3outeille
Copy link
Copy Markdown
Member

everything is here

Copy link
Copy Markdown
Member

@Cyrilvallez Cyrilvallez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Core files such as modeling_utils and core_model_loading lgtm, I think you got it all!

@vasqu vasqu added this pull request to the merge queue May 28, 2026
@vasqu vasqu removed this pull request from the merge queue due to a manual request May 28, 2026
@vasqu vasqu merged commit 295cee3 into main May 28, 2026
31 checks passed
@vasqu vasqu deleted the revert-dtensors-refactor branch May 28, 2026 06:20
@3outeille 3outeille restored the revert-dtensors-refactor branch May 28, 2026 17:19
3outeille added a commit that referenced this pull request May 28, 2026
@3outeille 3outeille deleted the revert-dtensors-refactor branch May 28, 2026 17:33
yuchenxie4645 pushed a commit to yuchenxie4645/transformers that referenced this pull request May 28, 2026
* Revert "init FSDP through from_pretrained (huggingface#46102)"

This reverts commit 0588858.

* Revert "Fix FSDP2 and distributed checkpointing imports for older PyTorch versions (huggingface#46141)"

This reverts commit 634500b.

* Revert "Update cohere2_moe tp_plan (huggingface#46189)"

This reverts commit e65c3a2.

* Revert "FSDP + TP & native save/load distributed (huggingface#45028)"

This reverts commit 9ba8e85.

* fix

* they should have been deleted I think

* these are actually needed changes

* oops
kashif pushed a commit to kashif/transformers that referenced this pull request Jun 1, 2026
* Revert "init FSDP through from_pretrained (huggingface#46102)"

This reverts commit 0588858.

* Revert "Fix FSDP2 and distributed checkpointing imports for older PyTorch versions (huggingface#46141)"

This reverts commit 634500b.

* Revert "Update cohere2_moe tp_plan (huggingface#46189)"

This reverts commit e65c3a2.

* Revert "FSDP + TP & native save/load distributed (huggingface#45028)"

This reverts commit 9ba8e85.

* fix

* they should have been deleted I think

* these are actually needed changes

* oops
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants