adding TiledFusedLogitsLoss #7437

stas00 · 2025-07-18T20:35:22Z

This PR adds TiledFusedLogitsLoss for an efficient fused logits+loss computation - this version pre-calculates grads in forward, avoiding recomputation in the backward (similar to the Liger-Kernel implementation).

Signed-off-by: Stas Bekman <[email protected]>

stas00 · 2025-07-30T16:32:45Z

@tjruwase, could you please review?

it should be easy to review since it's a fork of an existing SequenceTiledCompute autograd class, just specialized for logit-loss, so it's easier to understand and invoke. i.e. nothing new.

sfc-gh-aqiao · 2025-07-30T20:15:38Z

deepspeed/runtime/sequence_parallel/ulysses_sp.py

+            with torch.enable_grad():
+                args = (self, x_shard, y_shard)
+                if mask is not None:
+                    args.append(mask_shards[i])


@stas00 this needs to be args = args + (mask_shards[i],)

Thank you, Aurick - applying it here #7459

This PR adds `TiledFusedLogitsLoss` for an efficient fused logits+loss computation - this version pre-calculates grads in `forward`, avoiding recomputation in the backward (similar to the Liger-Kernel implementation). --------- Signed-off-by: Stas Bekman <[email protected]> Co-authored-by: Aurick Qiao <[email protected]> Signed-off-by: qimcis <[email protected]>

This PR adds `TiledFusedLogitsLoss` for an efficient fused logits+loss computation - this version pre-calculates grads in `forward`, avoiding recomputation in the backward (similar to the Liger-Kernel implementation). --------- Signed-off-by: Stas Bekman <[email protected]> Co-authored-by: Aurick Qiao <[email protected]> Signed-off-by: lym <[email protected]>

This PR adds `TiledFusedLogitsLoss` for an efficient fused logits+loss computation - this version pre-calculates grads in `forward`, avoiding recomputation in the backward (similar to the Liger-Kernel implementation). --------- Signed-off-by: Stas Bekman <[email protected]> Co-authored-by: Aurick Qiao <[email protected]>

adding TiledFusedLogitsLoss

f5dc59f

Signed-off-by: Stas Bekman <[email protected]>

sfc-gh-sbekman mentioned this pull request Jul 18, 2025

integrate TiledFusedLogitsLoss snowflakedb/ArcticTraining#244

Merged

stas00 and others added 3 commits July 22, 2025 23:55

add mask + some fixes

638fd50

Signed-off-by: Stas Bekman <[email protected]>

support more complex use cases for SwiftKV

b1e1868

add docs

6d6bd56

Signed-off-by: Stas Bekman <[email protected]>

stas00 marked this pull request as ready for review July 29, 2025 17:08

stas00 requested review from loadams, tjruwase and tohtana as code owners July 29, 2025 17:08

stas00 and others added 2 commits July 29, 2025 10:08

Merge branch 'master' into stas/TiledFusedLogitsLoss

e9584a0

redo asserts

c783916

Signed-off-by: Stas Bekman <[email protected]>

stas00 enabled auto-merge (squash) July 29, 2025 18:20

sfc-gh-truwase approved these changes Jul 30, 2025

View reviewed changes

stas00 merged commit 3292e07 into master Jul 30, 2025
9 checks passed

stas00 deleted the stas/TiledFusedLogitsLoss branch July 30, 2025 18:15

sfc-gh-aqiao reviewed Jul 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding TiledFusedLogitsLoss #7437

adding TiledFusedLogitsLoss #7437

Uh oh!

stas00 commented Jul 18, 2025 •

edited

Loading

Uh oh!

stas00 commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

sfc-gh-aqiao Jul 30, 2025

Uh oh!

sfc-gh-sbekman Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

adding TiledFusedLogitsLoss #7437

adding TiledFusedLogitsLoss #7437

Uh oh!

Conversation

stas00 commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stas00 commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sfc-gh-aqiao Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-sbekman Jul 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

stas00 commented Jul 18, 2025 •

edited

Loading

stas00 commented Jul 30, 2025 •

edited

Loading