Skip to content

Conversation

@wangyan-mms
Copy link
Contributor

@wangyan-mms wangyan-mms commented Oct 21, 2025

This change fixes an overflow issue in TensorListMetadata where the sizes array used int (32-bit signed integer). This caused incorrect behavior (e.g., no parameter updates) when handling tensor sizes exceeding INT_MAX (2^31 - 1).

The change here is identical to NVIDIA/apex PR #1825 for multi_tensor_apply.cuh.

For further details regarding this fix, please refer to issue #7640.

@hwchen2017 hwchen2017 merged commit 7af561c into deepspeedai:master Oct 21, 2025
15 of 16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants