Introduce torch.sym_add, variadic add #138660

laithsakka · 2024-10-23T00:20:19Z

Stack from ghstack (oldest at bottom):

-> Introduce torch.sym_add, variadic add #138660

Tested internally here: https://www.internalfb.com/diff/D64057744
This is a reland after previous internal failures.
main change is

 if min is None and max is None:
        torch._check_is_size(size)
        return

Partially addresses #128150

When you have big sums of values, we end up computing long chains of
binary addition in our FX graph representation. Not only is this ugly,
it also is quadratic, as the sympy.Add constructor is O(N) in number
of arguments. Instead, ensure that we maintain the summation as a
single FX node so we can do the entire addition all in one go.

Signed-off-by: Edward Z. Yang [email protected]

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @ezyang @SherlockNoMad @EikanWang @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @zhuhaozhe @blzheng @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @rec

Partially addresses #128150 When you have big sums of values, we end up computing long chains of binary addition in our FX graph representation. Not only is this ugly, it also is quadratic, as the sympy.Add constructor is O(N) in number of arguments. Instead, ensure that we maintain the summation as a single FX node so we can do the entire addition all in one go. Signed-off-by: Edward Z. Yang <[email protected]> [ghstack-poisoned]

pytorch-bot · 2024-10-23T00:20:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138660

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[PREEMTIVE] Experimenting with new runners linux.aws.a100 on inductor-perf-compare.yml

✅ You can merge normally! (1 Unrelated Failure)

As of commit c84b2af with merge base 72dde6e ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor-periodic / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_smoketest_perf, 1, 1, linux.gcp.a100) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

laithsakka · 2024-10-23T06:31:46Z

@pytorchbot merge -i

pytorchmergebot · 2024-10-23T06:33:42Z

Merge started

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-23T06:34:16Z

Merge failed

Reason: 17 jobs have failed, first few of them are: inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_inductor_torchbench, 2, 2, linux.12xlarge), inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_inductor_freezing_torchbench, 2, 2, linux.12xlarge), inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_inductor_amp_freezing_torchbench, 2, 2, linux.16xlarge.spr), inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_torchbench, 2, 2, linux.12xlarge), inductor / linux-jammy-cpu-py3.9-gcc11-inductor / test (cpu_aot_inductor_freezing_torchbench, 2, 2, linux.12xlarge)

Details for Dev Infra team

Raised by workflow job

Tested internally here: https://www.internalfb.com/diff/D64057744 This is a reland after previous internal failures. main change is ``` if min is None and max is None: torch._check_is_size(size) return ``` Partially addresses #128150 When you have big sums of values, we end up computing long chains of binary addition in our FX graph representation. Not only is this ugly, it also is quadratic, as the sympy.Add constructor is O(N) in number of arguments. Instead, ensure that we maintain the summation as a single FX node so we can do the entire addition all in one go. Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 ezyang SherlockNoMad EikanWang wenzhe-nrv voznesenskym penguinwu Guobing-Chen zhuhaozhe blzheng jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov rec [ghstack-poisoned]

Partially addresses #128150 When you have big sums of values, we end up computing long chains of binary addition in our FX graph representation. Not only is this ugly, it also is quadratic, as the sympy.Add constructor is O(N) in number of arguments. Instead, ensure that we maintain the summation as a single FX node so we can do the entire addition all in one go. Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: a170651 Pull Request resolved: #138660

laithsakka · 2024-10-23T07:31:34Z

rebase

XuehaiPan · 2024-10-23T11:18:37Z

torch/_dynamo/variables/functions.py

+        # Special case for sum on tuple/list of ints
+        if (
+            self.fn is builtins.sum
+            and len(args) == 1
+            and not kwargs
+            and isinstance(args[0], (variables.ListVariable, variables.TupleVariable))
+            and all(
+                (isinstance(x, variables.ConstantVariable) and isinstance(x.value, int))
+                or (isinstance(x, variables.SymNodeVariable) and x.python_type() is int)
+                for x in args[0].items
+            )
+        ):
+            return variables.SymNodeVariable.create(
+                tx,
+                tx.output.create_proxy(
+                    "call_function",
+                    torch.sym_sum,
+                    (tuple(a.as_proxy() for a in args[0].items),),
+                    {},
+                ),
+                sym_num=torch.sym_sum(
+                    [
+                        (
+                            x.value
+                            if isinstance(x, variables.ConstantVariable)
+                            else x.sym_num
+                        )
+                        for x in args[0].items
+                    ]
+                ),
+            )


I'd like to create a dispatch registry to make the code easier to maintain.

check_fn, dispatch_fn = self.shortcuts.get(self.fn, (None, None)) if check_fn is not None and check_fn(args, kwargs): return dispatch_fn(self, tx, args, kwargs)

laithsakka · 2024-10-23T17:34:48Z

@pytorchbot merge

pytorchmergebot · 2024-10-23T17:36:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Tested internally here: https://www.internalfb.com/diff/D64057744 This is a reland after previous internal failures. main change is ``` if min is None and max is None: torch._check_is_size(size) return ``` Partially addresses #128150 When you have big sums of values, we end up computing long chains of binary addition in our FX graph representation. Not only is this ugly, it also is quadratic, as the sympy.Add constructor is O(N) in number of arguments. Instead, ensure that we maintain the summation as a single FX node so we can do the entire addition all in one go. Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: #138660 Approved by: https://github.com/ezyang, https://github.com/bobrenjc93

pytorch-bot bot added ciflow/inductor module: cpu CPU specific problem (e.g., perf, algorithm) module: dynamo module: inductor release notes: fx release notes category labels Oct 23, 2024

facebook-github-bot added the fx label Oct 23, 2024

ezyang approved these changes Oct 23, 2024

View reviewed changes

laithsakka requested review from angelayi, bobrenjc93 and ezyang October 23, 2024 00:29

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 23, 2024

pytorchmergebot added the merging label Oct 23, 2024

pytorchmergebot removed the merging label Oct 23, 2024

XuehaiPan reviewed Oct 23, 2024

View reviewed changes

bobrenjc93 approved these changes Oct 23, 2024

View reviewed changes

pytorchmergebot added the merging label Oct 23, 2024

pytorchmergebot added the Merged label Oct 23, 2024

pytorchmergebot closed this in ed313a5 Oct 23, 2024

pytorchmergebot removed the merging label Oct 23, 2024

This was referenced Oct 24, 2024

Introduce torch.sym_sum #136429

Closed

maybe_mark_dynamic causes max recursion error when used with compile during tensordict consolidation #138729

Closed

github-actions bot deleted the gh/laithsakka/89/head branch November 23, 2024 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce torch.sym_add, variadic add #138660

Introduce torch.sym_add, variadic add #138660

Uh oh!

laithsakka commented Oct 23, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 23, 2024 •

edited

Loading

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

pytorchmergebot commented Oct 23, 2024

Uh oh!

pytorchmergebot commented Oct 23, 2024

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

XuehaiPan Oct 23, 2024

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

pytorchmergebot commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Introduce torch.sym_add, variadic add #138660

Introduce torch.sym_add, variadic add #138660

Uh oh!

Conversation

laithsakka commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138660

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

pytorchmergebot commented Oct 23, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 23, 2024

Merge failed

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

XuehaiPan Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

laithsakka commented Oct 23, 2024

Uh oh!

pytorchmergebot commented Oct 23, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

laithsakka commented Oct 23, 2024 •

edited

Loading

pytorch-bot bot commented Oct 23, 2024 •

edited

Loading