Fix illegal memory access with multi_tensor_apply size above INT_MAX #7639

wangyan-mms · 2025-10-21T04:17:16Z

This change fixes an overflow issue in TensorListMetadata where the sizes array used int (32-bit signed integer). This caused incorrect behavior (e.g., no parameter updates) when handling tensor sizes exceeding INT_MAX (2^31 - 1).

The change here is identical to NVIDIA/apex PR #1825 for multi_tensor_apply.cuh.

For further details regarding this fix, please refer to issue #7640.

Signed-off-by: Wang Yan <[email protected]>

fix illegal memory access with multi_tensor_apply size above INT_MAX

baa8734

Signed-off-by: Wang Yan <[email protected]>

wangyan-mms requested a review from tjruwase as a code owner October 21, 2025 04:17

wangyan-mms marked this pull request as draft October 21, 2025 05:49

wangyan-mms marked this pull request as ready for review October 21, 2025 07:04

wangyan-mms mentioned this pull request Oct 21, 2025

[BUG] Integer overflow in FusedAdam silently prevents weight updates for large tensors (>= 2^31 elements) #7640

Closed

hwchen2017 approved these changes Oct 21, 2025

View reviewed changes

hwchen2017 merged commit 7af561c into deepspeedai:master Oct 21, 2025
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix illegal memory access with multi_tensor_apply size above INT_MAX #7639

Fix illegal memory access with multi_tensor_apply size above INT_MAX #7639

Uh oh!

wangyan-mms commented Oct 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix illegal memory access with multi_tensor_apply size above INT_MAX #7639

Fix illegal memory access with multi_tensor_apply size above INT_MAX #7639

Uh oh!

Conversation

wangyan-mms commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wangyan-mms commented Oct 21, 2025 •

edited

Loading