[associative_scan] Autograd separated #139939

bohnstingl · 2024-11-06T23:39:52Z

This PR implements the Autograd feature of the associative_scan.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela @yf225 @ColinPeppler @desertfire @ydwu4

pytorch-bot · 2024-11-06T23:39:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139939

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 4 Pending

As of commit 393a3bd with merge base ada43ed ():

CANCELLED JOB - The following job was cancelled. Please retry:

trunk / linux-jammy-cuda12.8-py3.10-gcc11 / build (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

bohnstingl · 2024-11-06T23:47:12Z

@pytorchbot label "topic: not user facing"

bhack · 2024-12-14T22:16:22Z

Any review on this?

WeihanLikk · 2025-01-08T14:47:32Z

Thanks for your implementation! I have a question regarding the shape check for xs:

assert x.shape == shape, "All xs tensors must have the same shape"

Why does it require the tensors to have exactly the same shape? In the JAX implementation, only the first dimension is required to match:

num_elems = int(elems_flat[0].shape[axis])
if not all(int(elem.shape[axis]) == num_elems for elem in elems_flat[1:]):

bohnstingl · 2025-01-16T21:13:12Z

Hi @WeihanLikk

Thank you for looking into this. I was slow in working on this over the holidays but will pick up steam again. You are right, I don't think that this is necessarily required. Just the scanned dimension needs to be identical for all xs. I will take a look

…an_74

garrett361 · 2025-02-26T20:52:05Z

torch/_higher_order_ops/associative_scan.py

+            y_T = f(y_{T-1}, x_T)
+
+            The gradients of y_T with respect to the vector x are computed as:
+            dy_T / dx = dy_T/dx_1 + dy_T/dx_2 + ... + dy_T/dx_T


I'm not understanding this expression:

dy_T / dx = dy_T/dx_1 + dy_T/dx_2 + ... + dy_T/dx_T

Is there some typo in here?

Well, I guess this should be the gradient of the element y_T with respect to the vector of inputs, i.e., with respect to every element x_1, x_2, ..., x_T of the vector. This would give individual elements like [dy_T / dx_1, dy_T / dx_2, ... dy_T / dx_T] and to get the final gradient these elements are summed.

…an_74

…pointwise

bohnstingl · 2025-03-09T00:52:44Z

@garrett361 I've implemented a first version of the backward approach we discussed offline. The algorithm per se works, but there are still some things to sort out. In particular the lifted argument and partial gradient support.

EDIT: Of course there is still further room to cleanup the code and to adjust the documentation.

garrett361 · 2025-03-10T17:53:29Z

lifted argument and partial gradient support

What does lifted argument mean?

Partial gradient = when only some inputs require_grad?

bohnstingl · 2025-03-10T19:13:22Z

What does lifted argument mean?

In some cases, variables and other properties from the combine_fn are lifted as additional inputs. For example, these could be external variables, or symbolic shapes of tensors as well. In the following case

H = torch.rand(2, device=device)
def fct_freevars1(x: torch.Tensor, y: torch.Tensor):
    return x * H + y * 2

H would become a lifted variable. I know how to handle those, but I think @ydwu4 is currently working on simplifying the autograd architecture for higher order operators, such as associative scan, to simplify this handling.

Partial gradient = when only some inputs require_grad?

Yes, that is correct. Same applies here as well. I know how to handle it, but I wanted to wait for the autograd rework.

windsornguyen · 2025-03-14T17:04:02Z

Currently using associative scan for a big research project related to linear attention. (or perhaps I should say, logarithmic attention 😉)

Is there an expected timeline for autograd support to be available? Really excited about this PR!!

This PR was reopened (likely due to being reverted), so your approval was removed. Please request another review.

bohnstingl · 2025-09-08T21:16:43Z

@huydhn I wouldn't know of this failure, but it could be of course

huydhn · 2025-09-08T23:07:35Z

@pytorchbot merge -f 'Incorrect revert, this change is not related to the trunk failure'

pytorchmergebot · 2025-09-08T23:09:37Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-08T23:09:49Z

Merge failed

Reason: PR #139939 has not been reviewed yet

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

huydhn · 2025-09-08T23:28:18Z

@pytorchbot merge -f 'Incorrect revert, this change is not related to the trunk failure'

pytorchmergebot · 2025-09-08T23:29:52Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/ydwu4

This reverts commit 103f725. Reverted pytorch#139939 on behalf of https://github.com/huydhn due to Sorry for reverting your change but I am seeing a weird failure after this lands in trunk ([comment](pytorch#139939 (comment)))

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/huydhn

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/ydwu4

This reverts commit 103f725. Reverted pytorch#139939 on behalf of https://github.com/huydhn due to Sorry for reverting your change but I am seeing a weird failure after this lands in trunk ([comment](pytorch#139939 (comment)))

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/huydhn

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/ydwu4

This reverts commit 103f725. Reverted pytorch#139939 on behalf of https://github.com/huydhn due to Sorry for reverting your change but I am seeing a weird failure after this lands in trunk ([comment](pytorch#139939 (comment)))

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/huydhn

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/ydwu4

This reverts commit 103f725. Reverted pytorch#139939 on behalf of https://github.com/huydhn due to Sorry for reverting your change but I am seeing a weird failure after this lands in trunk ([comment](pytorch#139939 (comment)))

This PR implements the Autograd feature of the associative_scan. Pull Request resolved: pytorch#139939 Approved by: https://github.com/huydhn

bohnstingl added 2 commits November 6, 2024 22:30

WIP: Associative_scan Autograd

9c49a36

Working implementation of Autograd

100c598

bohnstingl requested a review from zou3519 as a code owner November 6, 2024 23:39

pytorch-bot bot added module: dynamo module: inductor labels Nov 6, 2024

bohnstingl mentioned this pull request Nov 6, 2024

Improvements for associative_scan - Autograd #136966

Closed

pytorchbot added the open source label Nov 6, 2024

pytorch-bot bot added the topic: not user facing topic category label Nov 6, 2024

bohnstingl added 2 commits November 7, 2024 10:59

Added partial gradient tests

0e7c8d5

Separated out the partial gradient functionality to a separate PR

67d62fb

bdhirsh added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Nov 8, 2024

zou3519 requested review from ydwu4 and removed request for zou3519 November 11, 2024 17:59

bohnstingl changed the title ~~Improvements for associative_scan - Autograd separated~~ [associative_scan] Autograd separated Nov 19, 2024

bohnstingl mentioned this pull request Dec 14, 2024

Parallel Associative Scan #95408

Open

Merge branch 'main' of github.com:pytorch/pytorch into associative_sc…

9e5e1f9

…an_74

garrett361 reviewed Feb 26, 2025

View reviewed changes

bohnstingl added 4 commits March 8, 2025 14:35

Working version uncleaned

6b41565

Almost all tests pass

d68b31b

Merge branch 'main' of github.com:pytorch/pytorch into associative_sc…

9de0caf

…an_74

First working implementation of simplified autograd for combine_mode=…

653eab0

…pointwise

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Sep 8, 2025

pytorchmergebot reopened this Sep 8, 2025

huydhn mentioned this pull request Sep 8, 2025

DISABLED test_aoti_fx_const (__main__.AOTFxirTestCase) #160951

Closed

pytorchmergebot added the merging label Sep 8, 2025

pytorchmergebot removed the merging label Sep 8, 2025

huydhn approved these changes Sep 8, 2025

View reviewed changes

pytorchmergebot added the merging label Sep 8, 2025

pytorchmergebot closed this in 07f0730 Sep 8, 2025

pytorchmergebot removed the merging label Sep 8, 2025

[associative_scan] Autograd separated #139939

[associative_scan] Autograd separated #139939

Uh oh!

Conversation

bohnstingl commented Nov 6, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139939

❌ 1 Cancelled Job, 4 Pending

Uh oh!

bohnstingl commented Nov 6, 2024

Uh oh!

bhack commented Dec 14, 2024

Uh oh!

WeihanLikk commented Jan 8, 2025

Uh oh!

bohnstingl commented Jan 16, 2025

Uh oh!

garrett361 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

bohnstingl Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

bohnstingl commented Mar 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garrett361 commented Mar 10, 2025

Uh oh!

bohnstingl commented Mar 10, 2025

Uh oh!

windsornguyen commented Mar 14, 2025

Uh oh!

bohnstingl commented Sep 8, 2025

Uh oh!

huydhn commented Sep 8, 2025

Uh oh!

pytorchmergebot commented Sep 8, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 8, 2025

Merge failed

Uh oh!

huydhn commented Sep 8, 2025

Uh oh!

pytorchmergebot commented Sep 8, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

bohnstingl commented Nov 6, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Nov 6, 2024 •

edited

Loading

bohnstingl commented Mar 9, 2025 •

edited

Loading