ulysses mpu: additional api #7649

stas00 · 2025-10-25T19:19:15Z

It looks like save_checkpoint expects get_model_parallel_* API in the mpu object. So adding it to the Ulysses slim mpu version.

This solves this problem in HF Trainer:

[rank1]:   File "/code/users/stas/github/transformers-alst-integration/src/transformers/trainer.py", line 3248, in _save_optimizer_and_scheduler
[rank1]:     self.model_wrapped.save_checkpoint(output_dir)
[rank1]:   File "/code/users/stas/github/DeepSpeed/deepspeed/runtime/engine.py", line 3497, in save_checkpoint
[rank1]:     self._save_checkpoint(save_dir,
[rank1]:   File "/code/users/stas/github/DeepSpeed/deepspeed/runtime/engine.py", line 3709, in _save_checkpoint
[rank1]:     save_path = self._get_ckpt_name(save_dir, tag)
[rank1]:                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank1]:   File "/code/users/stas/github/DeepSpeed/deepspeed/runtime/engine.py", line 3039, in _get_ckpt_name
[rank1]:     mp_rank = 0 if self.mpu is None else self.mpu.get_model_parallel_rank()
[rank1]:                                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank1]: AttributeError: module 'deepspeed.runtime.sequence_parallel.parallel_state_sp' has no attribute 'get_model_parallel_rank'. Did you mean: 'get_sequence_parallel_rank'?

Signed-off-by: Stas Bekman <[email protected]>

stas00 · 2025-10-28T05:06:04Z

Thank you Tunji!

ulysses mpu: additional api

404d1bd

Signed-off-by: Stas Bekman <[email protected]>

stas00 requested review from tjruwase and tohtana as code owners October 25, 2025 19:19

stas00 enabled auto-merge (squash) October 25, 2025 20:22

stas00 mentioned this pull request Oct 28, 2025

HF Trainer: ALST/Ulysses sequence parallelism integration via HF Accelerate huggingface/transformers#41832

Merged

6 tasks

tjruwase approved these changes Oct 28, 2025

View reviewed changes

stas00 mentioned this pull request Oct 28, 2025

Deepspeed Ulysses/ALST integration huggingface/accelerate#3817

Merged

6 tasks

Merge branch 'master' into stas/ulysses-mpu

6fccbd3

stas00 merged commit 433e3c7 into master Oct 28, 2025
12 checks passed

stas00 deleted the stas/ulysses-mpu branch October 28, 2025 03:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ulysses mpu: additional api #7649

ulysses mpu: additional api #7649

Uh oh!

stas00 commented Oct 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

stas00 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ulysses mpu: additional api #7649

ulysses mpu: additional api #7649

Uh oh!

Conversation

stas00 commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

stas00 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stas00 commented Oct 25, 2025 •

edited

Loading