[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

ydwu4 · 2023-08-16T23:36:39Z

Stack from ghstack (oldest at bottom):

-> [dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

Motivation:
We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures.

Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code:

def foo()


# Inside torch.cond, we'd like to do something like 
torch.compile(foo, backend="eager", fullgraph=True)(...)
...
# Users may then call torch.compile somewhere else. 
# Dynamo will use the cached code of foo for "eager" backend 
# but we expect dynamo to recompile with "inductor" backend.
torch.compile(foo, backend="inductor")(...)

This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True).

Implementation:

We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag.

Note: More lines are printed for debug log due to newly added context manager and guard adds .

Test Plan:
Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @anijain2305

[ghstack-poisoned]

pytorch-bot · 2023-08-16T23:36:41Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107337

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit ce3c4fc with merge base 916183a ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 29b8631 Pull Request resolved: #107337

…hanges" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

ghstack-source-id: 9db6f54 Pull Request resolved: #107337

…hen backend changes" **Motivation:** We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). **Implementation:** We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. **Test Plan:** Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

ghstack-source-id: 6d31f1e Pull Request resolved: #107337

ezyang · 2023-08-18T01:50:11Z

torch/_dynamo/eval_frame.py

-                        "calling `torch._dynamo.reset()` to take effect"
-                    )
            most_recent_backend = compiler_fn
+            guarded_backend_cache[id(most_recent_backend)] = most_recent_backend


It seems like it would be better to have a thread local which is set when we enter a torch.compile region specifying what the current backend is. At the very least it needs to be thread local, which it doesn't seem like most_recent_backend is. (Yes, I know some of this is preexisting problems, but now you know.)

I've modified accordingly and added a test for multi-threads but I'm not sure if the implementation is good enough.

…hen backend changes" **Motivation:** We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). **Implementation:** We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. **Test Plan:** Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

torch/_dynamo/eval_frame.py

huydhn · 2023-09-08T02:01:51Z

@pytorchbot revert -m 'Sorry for reverting your change but inductor perf smoke test starts to regress after this' -c ignoresignal

Here is an example failure https://hud.pytorch.org/pytorch/pytorch/commit/1a64ec7dd48408d6839a5c2cceb55b0c4be2243b

2023-09-07T23:51:08.8980334Z + python benchmarks/dynamo/check_hf_bert_perf_csv.py -f /var/lib/jenkins/workspace/test/test-reports/inductor_training_smoketest.csv
2023-09-07T23:51:09.3421344Z hf_Bert                            0.997172
2023-09-07T23:51:09.3421670Z 
2023-09-07T23:51:09.3421819Z Error 1 models performance regressed
2023-09-07T23:51:09.3422217Z     hf_Bert

huydhn · 2023-09-08T02:02:16Z

@pytorchbot revert -m 'Sorry for reverting your change but inductor perf smoke test starts to regress after this' -c ignoredsignal

pytorchmergebot · 2023-09-08T02:03:44Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2023-09-08T02:03:53Z

@ydwu4 your PR has been successfully reverted.

… backend changes (#107337)" This reverts commit 1a64ec7. Reverted #107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](#107337 (comment)))

…hen backend changes" **Motivation:** We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). **Implementation:** 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. **Test Plan:** Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

ghstack-source-id: 50bed22 Pull Request resolved: #107337

…hen backend changes" **Motivation:** We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). **Implementation:** 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. **Test Plan:** Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

ghstack-source-id: 02ccdaf Pull Request resolved: #107337

…hen backend changes" **Motivation:** We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). **Implementation:** 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . **Test Plan:** Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov anijain2305 [ghstack-poisoned]

ghstack-source-id: eda95be Pull Request resolved: #107337

ydwu4 · 2023-09-14T15:47:31Z

@pytorchbot merge

pytorchmergebot · 2023-09-14T15:49:25Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

1d1f546

[ghstack-poisoned]

ydwu4 mentioned this pull request Aug 16, 2023

Fakify leaf of FunctionalTensor #107062

Closed

ydwu4 added a commit that referenced this pull request Aug 16, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

66bccb1

ghstack-source-id: 29b8631 Pull Request resolved: #107337

github-actions bot added module: inductor module: dynamo ciflow/inductor labels Aug 16, 2023

ydwu4 requested review from ezyang, jansel, voznesenskym and zou3519 August 17, 2023 18:43

ydwu4 added a commit that referenced this pull request Aug 17, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

2ee8c1a

ghstack-source-id: 9db6f54 Pull Request resolved: #107337

ydwu4 changed the title ~~[dynamo] Add BACKEND_MATCH guard to recompile if backend changes~~ [dynamo] Add BACKEND_MATCH guard to guard backend changes Aug 17, 2023

ydwu4 changed the title ~~[dynamo] Add BACKEND_MATCH guard to guard backend changes~~ [dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes Aug 17, 2023

ydwu4 added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 17, 2023

ydwu4 added a commit that referenced this pull request Aug 17, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

2d82747

ghstack-source-id: 6d31f1e Pull Request resolved: #107337

ezyang reviewed Aug 18, 2023

View reviewed changes

github-actions bot added the module: export label Aug 18, 2023

ydwu4 commented Aug 18, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Show resolved Hide resolved

ydwu4 commented Aug 18, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Show resolved Hide resolved

pytorchmergebot removed the merging label Sep 7, 2023

pytorchmergebot closed this in 1a64ec7 Sep 7, 2023

pytorch deleted a comment from pytorch-bot bot Sep 8, 2023

pytorchmergebot added the Reverted label Sep 8, 2023

ydwu4 added a commit that referenced this pull request Sep 8, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

a427768

ghstack-source-id: 50bed22 Pull Request resolved: #107337

ydwu4 reopened this Sep 8, 2023

ydwu4 added a commit that referenced this pull request Sep 11, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

ca1c4ca

ghstack-source-id: 02ccdaf Pull Request resolved: #107337

zou3519 removed their request for review September 12, 2023 17:20

ydwu4 added a commit that referenced this pull request Sep 13, 2023

[dynamo] Add BACKEND_MATCH guard to recompile if backend changes

e9b1420

ghstack-source-id: eda95be Pull Request resolved: #107337

pytorchmergebot added the merging label Sep 14, 2023

pytorchmergebot removed the merging label Sep 14, 2023

pytorchmergebot closed this in 94a54b8 Sep 14, 2023

facebook-github-bot deleted the gh/ydwu4/4/head branch September 18, 2023 14:24

ezyang mentioned this pull request Oct 11, 2023

Dynamo guard on global configuration #110682

Closed

jansel mentioned this pull request Nov 29, 2023

Dynamo guards key error for guarded_backend_cache.cached_backends #114674

Closed

ydwu4 mentioned this pull request Nov 29, 2023

make __lookup_backend return None when cache misses #114766

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

Uh oh!

ydwu4 commented Aug 16, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 16, 2023 •

edited

Loading

Uh oh!

ezyang Aug 18, 2023

Uh oh!

ydwu4 Aug 18, 2023

Uh oh!

Uh oh!

Uh oh!

huydhn commented Sep 8, 2023

Uh oh!

huydhn commented Sep 8, 2023

Uh oh!

pytorchmergebot commented Sep 8, 2023

Uh oh!

pytorchmergebot commented Sep 8, 2023

Uh oh!

ydwu4 commented Sep 14, 2023

Uh oh!

pytorchmergebot commented Sep 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes #107337

Uh oh!

Conversation

ydwu4 commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107337

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

ezyang Aug 18, 2023

Choose a reason for hiding this comment

Uh oh!

ydwu4 Aug 18, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

huydhn commented Sep 8, 2023

Uh oh!

huydhn commented Sep 8, 2023

Uh oh!

pytorchmergebot commented Sep 8, 2023

Uh oh!

pytorchmergebot commented Sep 8, 2023

Uh oh!

ydwu4 commented Sep 14, 2023

Uh oh!

pytorchmergebot commented Sep 14, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

ydwu4 commented Aug 16, 2023 •

edited

Loading

pytorch-bot bot commented Aug 16, 2023 •

edited

Loading