Skip to content

Conversation

@delock
Copy link
Collaborator

@delock delock commented Nov 17, 2025

The AutoTP supported models does not include Qwen2.5, which is already supported. Update the document.
(https://www.deepspeed.ai/tutorials/automatic-tensor-parallelism/#supported-models)

Signed-off-by: Ma, Guokai <[email protected]>
@xylian86
Copy link
Contributor

@delock Qwen3 is the latest generation in Qwen series. Does AutoTP support Qwen3 (https://huggingface.co/Qwen/Qwen3-8B)? If yes, it would be great to include it as well.

@delock
Copy link
Collaborator Author

delock commented Nov 18, 2025

@xylian86 Just checked AutoTP works on Qwen3 as well, will update doc, thanks!

Signed-off-by: Ma, Guokai <[email protected]>
@PKUWZP PKUWZP self-requested a review November 18, 2025 15:01
@sfc-gh-truwase sfc-gh-truwase merged commit a83fd7b into master Nov 18, 2025
2 checks passed
@sfc-gh-truwase sfc-gh-truwase deleted the gma/autotp-models branch November 18, 2025 16:06
rraminen pushed a commit to rraminen/DeepSpeed that referenced this pull request Dec 1, 2025
The AutoTP supported models does not include Qwen2.5, which is already
supported. Update the document.

(https://www.deepspeed.ai/tutorials/automatic-tensor-parallelism/#supported-models)

---------

Signed-off-by: Ma, Guokai <[email protected]>
Signed-off-by: rraminen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants