Skip to content

Conversation

@tohtana
Copy link
Collaborator

@tohtana tohtana commented Sep 9, 2025

The initialization of DeepCompile+Z1/2 now fails due to the change introduced in #7509.

This PR resolves the issue by:

  • Adding an argument to optimizer.get_flat_partition
  • Skipping the entire allreduce function in the engine

@tohtana tohtana enabled auto-merge (squash) September 10, 2025 17:59
@tohtana tohtana merged commit 0e859aa into master Sep 10, 2025
12 checks passed
@tohtana tohtana deleted the tohtana/fix_dc_gradbuf_z1 branch September 10, 2025 18:12
mauryaavinash95 pushed a commit to DataStates/DeepSpeed that referenced this pull request Oct 4, 2025
The initialization of DeepCompile+Z1/2 now fails due to the change
introduced in deepspeedai#7509.

This PR resolves the issue by:
- Adding an argument to optimizer.get_flat_partition
- Skipping the entire allreduce function in the engine

---------

Signed-off-by: Masahiro Tanaka <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants