test only smaller block_k for mm_plus_mm by ngimel · Pull Request #96385 · pytorch/pytorch

ngimel · 2023-03-09T02:29:35Z

Trim number of tested mm_plus_mm configs to work around triton-lang/triton#1298

cc @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire

pytorch-bot · 2023-03-09T02:29:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/96385

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Merge Blocking SEVs

There is 1 active merge blocking SEVs. Please view them below:

(merge blocking) Issue with internal merge, blocking merges until further notice

If you must merge, use @pytorchbot merge -f.

❌ 1 Failures

As of commit fe000f8:

NEW FAILURES - The following jobs have failed:

cuda11.8-py3.10-gcc7-sm86 / test (inductor_timm, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

bertmaher

Nice, thank you for figuring this out!

Just a few comments inline:

bertmaher · 2023-03-09T03:05:12Z

torch/_inductor/kernel/mm_plus_mm.py

        # Splitting this into two loops causes an internal triton LLVM error
        # https://github.com/openai/triton/issues/967


This comment is stale now, right?

Yep, deleted

bertmaher · 2023-03-09T03:06:12Z

torch/_inductor/kernel/mm_plus_mm.py

    # rematerialize rm and rn to save registers
-    rm = pid_m * BLOCK_M + tl.arange(0, BLOCK_M)
-    rn = pid_n * BLOCK_N + tl.arange(0, BLOCK_N)
+    #rm = pid_m * BLOCK_M + tl.arange(0, BLOCK_M)
+    #rn = pid_n * BLOCK_N + tl.arange(0, BLOCK_N)


Is this rematerialization a bad idea now, or is it temporary? Probably we should either delete it (or drop in a comment describing why it's temporary).

Didn't see any difference with or without, deleted

bertmaher · 2023-03-09T03:06:34Z

torch/_inductor/kernel/mm_plus_mm.py

-                    (mat1, mat2, mat3, mat4),
-                    layout,
-                    **mm_options(config, k, layout),
+            if config.kwargs['BLOCK_K'] < k:


Maybe add a comment with a pointer to the triton issue so we can revisit someday.

ngimel · 2023-03-09T23:00:39Z

@pytorchbot merge -f "dla102 test flaky"

pytorchmergebot · 2023-03-09T23:03:25Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Trim number of tested mm_plus_mm configs to work around triton-lang/triton#1298 Pull Request resolved: pytorch/pytorch#96385 Approved by: https://github.com/bertmaher, https://github.com/jansel

Trim number of tested mm_plus_mm configs to work around triton-lang/triton#1298 Pull Request resolved: pytorch#96385 Approved by: https://github.com/bertmaher, https://github.com/jansel

test only smaller block_k for mm_plus_mm

7132356

ngimel requested review from bertmaher and jansel March 9, 2023 02:29

github-actions bot added ciflow/inductor module: inductor labels Mar 9, 2023

bertmaher approved these changes Mar 9, 2023

View reviewed changes

address comments

fe000f8

ngimel added release notes: inductor topic: bug fixes topic category labels Mar 9, 2023

jansel approved these changes Mar 9, 2023

View reviewed changes

pytorchmergebot added the Merged label Mar 9, 2023

pytorchmergebot closed this in 1bde36b Mar 9, 2023

ngimel mentioned this pull request Mar 11, 2023

Assertion `!(srcMmaLayout && dstMmaLayout) && "Unexpected mma -> mma layout conversion"' failed triton-lang/triton#1298

Open

ngimel deleted the ngimel/mm_plus_mm_config branch March 14, 2023 06:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test only smaller block_k for mm_plus_mm#96385

test only smaller block_k for mm_plus_mm#96385
ngimel wants to merge 2 commits intomasterfrom
ngimel/mm_plus_mm_config

ngimel commented Mar 9, 2023 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 9, 2023 •

edited

Loading

Uh oh!

bertmaher left a comment

Uh oh!

bertmaher Mar 9, 2023

Uh oh!

ngimel Mar 9, 2023

Uh oh!

bertmaher Mar 9, 2023

Uh oh!

ngimel Mar 9, 2023

Uh oh!

bertmaher Mar 9, 2023

Uh oh!

ngimel Mar 9, 2023

Uh oh!

ngimel commented Mar 9, 2023

Uh oh!

pytorchmergebot commented Mar 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		# Splitting this into two loops causes an internal triton LLVM error
		# https://github.com/openai/triton/issues/967

Conversation

ngimel commented Mar 9, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/96385

❗ 1 Merge Blocking SEVs

❌ 1 Failures

Uh oh!

bertmaher left a comment

Choose a reason for hiding this comment

Uh oh!

bertmaher Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

ngimel Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

bertmaher Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

ngimel Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

bertmaher Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

ngimel Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

ngimel commented Mar 9, 2023

Uh oh!

pytorchmergebot commented Mar 9, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ngimel commented Mar 9, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 9, 2023 •

edited

Loading