Skip to content

Conversation

@stas00
Copy link
Collaborator

@stas00 stas00 commented Aug 9, 2025

Adding inference support for TiledFusedLogitsLoss by skipping backward inside forward if the incoming tensor doesn't require grad.

xref: snowflakedb/ArcticTraining#259

@sfc-gh-truwase sfc-gh-truwase merged commit a54c394 into master Aug 11, 2025
12 checks passed
@sfc-gh-truwase sfc-gh-truwase deleted the stas/TiledFusedLogitsLoss_eval branch August 11, 2025 21:44
LYMDLUT pushed a commit to LYMDLUT/DeepSpeed that referenced this pull request Aug 20, 2025
Adding inference support for `TiledFusedLogitsLoss` by skipping
`backward` inside `forward` if the incoming tensor doesn't require grad.

xref: snowflakedb/ArcticTraining#259

---------

Signed-off-by: Stas Bekman <[email protected]>
Co-authored-by: Rui Yan <[email protected]>
Co-authored-by: Olatunji Ruwase <[email protected]>
Signed-off-by: lym <[email protected]>
mauryaavinash95 pushed a commit to DataStates/DeepSpeed that referenced this pull request Oct 4, 2025
Adding inference support for `TiledFusedLogitsLoss` by skipping
`backward` inside `forward` if the incoming tensor doesn't require grad.

xref: snowflakedb/ArcticTraining#259

---------

Signed-off-by: Stas Bekman <[email protected]>
Co-authored-by: Rui Yan <[email protected]>
Co-authored-by: Olatunji Ruwase <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants