[dynamo][guards] Skip no tensor aliasing guards on parameters #138954

anijain2305 · 2024-10-25T22:44:56Z

Stack from ghstack (oldest at bottom):

This is another unsound guard eval optimization. Its rare in practice to
compile a function with two different parameters as inputs, and then
later call the function with one parameter input as two different inputs
(aliasing). This further reduces guard overhead from 280 us to 240 us
for the model in #138386

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames @rec

This is another unsound guard eval optimization. Its rare in practice to compile a function with two different parameters as inputs, and then later call the function with the same input as two different inputs (aliasing). This further reduces guard overhead from 280 us to 240 us for the model in #138386 [ghstack-poisoned]

pytorch-bot · 2024-10-25T22:44:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138954

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit b399150 with merge base f9ae3fa ():

NEW FAILURE - The following job has failed:

Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/fx/experimental/sym_node.py:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

trunk / win-vs2019-cpu-py3 / test (default, 1, 3, windows.4xlarge.nonephemeral) (gh) (trunk failure)
[ FAILED ] ListTestIValueBasedList.whenMoveConstructingList_thenOldIsUnchanged

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…ers" This is another unsound guard eval optimization. Its rare in practice to compile a function with two different parameters as inputs, and then later call the function with the same input as two different inputs (aliasing). This further reduces guard overhead from 280 us to 240 us for the model in #138386 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]

ezyang · 2024-10-28T00:17:21Z

I am sympathetic to this but it doesn't seem like it should be taking 40us to test this in the first place

ezyang · 2024-10-28T00:19:31Z

torch/_dynamo/guards.py

            else:
-                self.tensor_check_examples.append(value)
-                self.tensor_check_names.append(tensor_name)
-                self.tensor_check_guards.append(guard)


There's a refactor going on here that makes it hard to see the substantive change that happened here

I can pull the refactor in a separate PR.

ezyang · 2024-10-28T00:22:56Z

torch/_dynamo/guards.py

                    add_code_part(code, gcl.guard, True)
                    seen.add(code)

-        tensor_check_names = builder.tensor_check_names


Does the code in torch/csrc/dynamo/guards.cpp referencing tensor_check_names have to be adjusted?

Also, since this happened before, can you explicitly make sure our verbose reporting (e.g., in logs and tlparse) did not regress with these changes? (The switch to cpp guard manager caused us to lose all tlparse guard output)

anijain2305 · 2024-10-28T02:43:51Z

I am sympathetic to this but it doesn't seem like it should be taking 40us to test this in the first place

The quantized model in question has a really large number of parameters, and therefore we end up calling this guard for every parameter. There is not much going on in the guard itself. Its just the sheer quantity of them.

…ers" This is another unsound guard eval optimization. Its rare in practice to compile a function with two different parameters as inputs, and then later call the function with one parameter input as two different inputs (aliasing). This further reduces guard overhead from 280 us to 240 us for the model in #138386 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]

This brings some unsoundness in guards. Earlier we were skipping empty nn module hooks dict guard only on inbuilt nn modules, but as seen in #138386, there could be still be significant guard overhead. With this PR, we reduce the guard eval latency from 420 us to 280 us (1.5x reduction). Pull Request resolved: #138942 Approved by: https://github.com/ezyang, https://github.com/jansel ghstack dependencies: #139040, #138954

…h#138954) This is another unsound guard eval optimization. Its rare in practice to compile a function with two different parameters as inputs, and then later call the function with one parameter input as two different inputs (aliasing). This further reduces guard overhead from 280 us to 240 us for the model in pytorch#138386 Pull Request resolved: pytorch#138954 Approved by: https://github.com/jansel ghstack dependencies: pytorch#139040

This brings some unsoundness in guards. Earlier we were skipping empty nn module hooks dict guard only on inbuilt nn modules, but as seen in pytorch#138386, there could be still be significant guard overhead. With this PR, we reduce the guard eval latency from 420 us to 280 us (1.5x reduction). Pull Request resolved: pytorch#138942 Approved by: https://github.com/ezyang, https://github.com/jansel ghstack dependencies: pytorch#139040, pytorch#138954

…h#138954) This is another unsound guard eval optimization. Its rare in practice to compile a function with two different parameters as inputs, and then later call the function with one parameter input as two different inputs (aliasing). This further reduces guard overhead from 280 us to 240 us for the model in pytorch#138386 Pull Request resolved: pytorch#138954 Approved by: https://github.com/jansel ghstack dependencies: pytorch#139040

This brings some unsoundness in guards. Earlier we were skipping empty nn module hooks dict guard only on inbuilt nn modules, but as seen in pytorch#138386, there could be still be significant guard overhead. With this PR, we reduce the guard eval latency from 420 us to 280 us (1.5x reduction). Pull Request resolved: pytorch#138942 Approved by: https://github.com/ezyang, https://github.com/jansel ghstack dependencies: pytorch#139040, pytorch#138954

This was referenced Oct 25, 2024

[dynamo][refactor][config-cleanp] Use guard_manager consistently instead of check_fn #138896

Closed

[dynamo][guards] Log average time of constructed guard_manager #138941

Closed

anijain2305 mentioned this pull request Oct 25, 2024

[dynamo][guards] Skip guards on empty nn module hooks #138942

Closed

pytorch-bot bot added ciflow/inductor module: dynamo labels Oct 25, 2024

anijain2305 requested review from ezyang and jansel October 26, 2024 04:17

anijain2305 added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Oct 26, 2024

anijain2305 mentioned this pull request Oct 27, 2024

[dynamo] Prevent Dynamo from triggering on an unwanted frame #139022

Closed

ezyang reviewed Oct 28, 2024

View reviewed changes

anijain2305 mentioned this pull request Oct 28, 2024

[dynamo] "skip_guard_eval_unsafe" API for power users #139038

Closed

anijain2305 mentioned this pull request Oct 28, 2024

[dynamo][refactor] Remaining cleanup from config-cleanup of enable_cpp_guard_manager #139040

Closed

anijain2305 requested a review from ezyang October 28, 2024 17:35

jansel approved these changes Oct 29, 2024

View reviewed changes

pytorchmergebot closed this in 2aa5348 Oct 29, 2024

pytorchmergebot added the Merged label Oct 29, 2024

github-actions bot deleted the gh/anijain2305/562/head branch November 29, 2024 02:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[dynamo][guards] Skip no tensor aliasing guards on parameters #138954

[dynamo][guards] Skip no tensor aliasing guards on parameters #138954

Uh oh!

anijain2305 commented Oct 25, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 25, 2024 •

edited

Loading

Uh oh!

ezyang commented Oct 28, 2024

Uh oh!

ezyang Oct 28, 2024

Uh oh!

anijain2305 Oct 28, 2024

Uh oh!

ezyang Oct 28, 2024

Uh oh!

anijain2305 commented Oct 28, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[dynamo][guards] Skip no tensor aliasing guards on parameters #138954

[dynamo][guards] Skip no tensor aliasing guards on parameters #138954

Uh oh!

Conversation

anijain2305 commented Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138954

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

ezyang commented Oct 28, 2024

Uh oh!

ezyang Oct 28, 2024

Choose a reason for hiding this comment

Uh oh!

anijain2305 Oct 28, 2024

Choose a reason for hiding this comment

Uh oh!

ezyang Oct 28, 2024

Choose a reason for hiding this comment

Uh oh!

anijain2305 commented Oct 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

anijain2305 commented Oct 25, 2024 •

edited

Loading

pytorch-bot bot commented Oct 25, 2024 •

edited

Loading

anijain2305 commented Oct 28, 2024 •

edited

Loading