[AOTI] Add standalone version of TORCH_CHECK #136873

desertfire · 2024-09-27T17:28:22Z

Stack from ghstack (oldest at bottom):

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained.

Differential Revision: D63911928

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @chauhang

Summary: Use TORCH_CHECK_STD_ERROR which throws std::runtime_error, instead of using TORCH_CHECK which throws c10::Error, to cut dependency on libtorch. Specifically, Inductor CPU backend may generate cpp code that calls ATen vectorization ops and we need to make sure those ops don't call TORCH_CHECK. [ghstack-poisoned]

pytorch-bot · 2024-09-27T17:28:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136873

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit db42a23 with merge base d1b87e2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Use TORCH_CHECK_STD_ERROR which throws std::runtime_error, instead of using TORCH_CHECK which throws c10::Error, to cut dependency on libtorch. Specifically, Inductor CPU backend may generate cpp code that calls ATen vectorization ops and we need to make sure those ops don't call TORCH_CHECK. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

albanD

I'm a bit surprised by this change. c10::Error gives a lot of niceties like stack traces tracking, internal logging, TORCH_SHOW_CPP_STACKTRACES, etc
Also this is going to change the error we give to the python users?

desertfire · 2024-10-02T20:00:16Z

I'm a bit surprised by this change. c10::Error gives a lot of niceties like stack traces tracking, internal logging, TORCH_SHOW_CPP_STACKTRACES, etc Also this is going to change the error we give to the python users?

I think we have to give up some properties if we want to make Vectorized ABI-compatible. The first attempt I made was to make c10::Error header-only, but it is not easy with something like

pytorch/c10/util/Exception.h

Lines 61 to 63 in 2b329d3

    
           // PyTorch-style Error constructor.  NB: the implementation of this 
        
           // is actually in Logging.cpp 
        
           Error(SourceLocation source_location, std::string msg);

. I am open to other suggestion if you have a better idea.

albanD · 2024-10-02T20:09:40Z

Making this particular call go through the capi shim should be relatively simple though given the types involved right?

desertfire · 2024-10-02T20:46:56Z

Making this particular call go through the capi shim should be relatively simple though given the types involved right?

If we give up on supporting varargs for this one, then yes.

albanD · 2024-10-02T20:57:13Z

Ho we were actually chatting about this one with Jane earlier, wouldn't we be able to do the conversion vararg -> string in a header-only c++ layer and make the raw c api take a single raw string?

desertfire · 2024-10-02T21:36:57Z

AOTI_TORCH_CHECK already does that. I could use that for now, but there will still be a need to make these utility Vectorized implementation self-contained, i.e. libtorch independent.

What about if I define TORCH_CHECK based on a macro, e.g. ABI_COMPATIBLE or LIBTORCH_INDEPENDENT, and only define that macro when Inductor compiles the generated cpp code. That way, the default python build will still use the current TORCH_CHECK, and AOTI still sees those headers as libtorch independent.

Summary: Use TORCH_CHECK_STD_ERROR which throws std::runtime_error, instead of using TORCH_CHECK which throws c10::Error, to cut dependency on libtorch. Specifically, Inductor CPU backend may generate cpp code that calls ATen vectorization ops and we need to make sure those ops don't call TORCH_CHECK. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained. [ghstack-poisoned]

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained. ghstack-source-id: 5707c8a Pull Request resolved: #136873

albanD

Sounds ok as a temporary unblock, let's discuss offline for broader plan and how to ensure this is preserved.

desertfire · 2024-10-04T19:22:23Z

@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained. Differential Revision: [D63911928](https://our.internmc.facebook.com/intern/diff/D63911928) cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained. ghstack-source-id: 91649dd Pull Request resolved: #136873

desertfire · 2024-10-06T23:21:42Z

@desertfire has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-08T15:27:41Z

@pytorchbot merge -f 'Landed internally'

(Initiating merge automatically since Phabricator Diff has merged, using force because this PR might not pass merge_rules.json but landed internally)

pytorchmergebot · 2024-10-08T15:29:52Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: In the standalone mode, TORCH_CHECK throws std::runtime_error, instead of c10::Error. The goal is to cut dependency on libtorch. Specifically, AOTI generates CPU code which may call ATen vectorization ops and we need to make sure those ops are self-contained. ghstack-source-id: 6dab1f7 Pull Request resolved: pytorch/pytorch#136873

desertfire mentioned this pull request Sep 27, 2024

[AOTI] Turn on the ABI-compatible mode as default #136534

Closed

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Sep 27, 2024

desertfire added the topic: not user facing topic category label Sep 27, 2024

desertfire added 2 commits September 27, 2024 13:08

This was referenced Oct 2, 2024

[CI] Remove nanogpt from perf smoke test #137176

Closed

[AOTI] Add C shim for MKLDNN _linear_pointwise #136999

Closed

albanD reviewed Oct 2, 2024

View reviewed changes

pytorch-bot bot added ciflow/inductor module: inductor labels Oct 4, 2024

desertfire changed the title ~~[AOTI] Add TORCH_CHECK_STD_ERROR~~ [AOTI] Add standalone version of TORCH_CHECK Oct 4, 2024

desertfire requested a review from albanD October 4, 2024 17:13

albanD approved these changes Oct 4, 2024

View reviewed changes

desertfire requested a review from chenyang78 October 5, 2024 12:46

chenyang78 approved these changes Oct 6, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 6, 2024

pytorchmergebot added the merging label Oct 8, 2024

pytorchmergebot added the Merged label Oct 8, 2024

pytorchmergebot closed this in c04b35a Oct 8, 2024

pytorchmergebot removed the merging label Oct 8, 2024

[AOTI] Add standalone version of TORCH_CHECK #136873

[AOTI] Add standalone version of TORCH_CHECK #136873

Uh oh!

Conversation

desertfire commented Sep 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136873

✅ No Failures

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

desertfire commented Oct 2, 2024

Uh oh!

albanD commented Oct 2, 2024

Uh oh!

desertfire commented Oct 2, 2024

Uh oh!

albanD commented Oct 2, 2024

Uh oh!

desertfire commented Oct 2, 2024

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

desertfire commented Oct 4, 2024

Uh oh!

desertfire commented Oct 6, 2024

Uh oh!

facebook-github-bot commented Oct 8, 2024

Uh oh!

pytorchmergebot commented Oct 8, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

desertfire commented Sep 27, 2024 •

edited

Loading

pytorch-bot bot commented Sep 27, 2024 •

edited

Loading