[Inductor] Represent tiling as a dict #141751

blaine-rister · 2024-11-28T04:41:06Z

Summary

Preparatory refactor for #137243. This makes it easier to generalize to multi-dimensional reductions.

This diff refactors self.numels from a tuple like (8,16) to a dict like {"x": 8, "r": 16}.

Note: this is based off of #141738, which enables tree.is_reduction. That PR should land first.

Test plan

The existing CI provides good coverage.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

pytorch-bot · 2024-11-28T04:41:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141751

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Cancelled Jobs, 3 Unrelated Failures

As of commit 53c08c1 with merge base ed77901 ():

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / linux-jammy-py3.10-clang15-asan / test (default, 2, 6, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.
pull / linux-jammy-py3.10-clang15-asan / test (default, 5, 6, lf.linux.4xlarge) (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-py3_9-clang9-xla / test (xla, 1, 1, lf.linux.12xlarge) (gh) (trunk failure)
##[error]Process completed with exit code 128.

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

inductor / cuda12.1-py3.10-gcc9-sm86 / test (inductor_timm, 1, 2, linux.g5.4xlarge.nvidia.gpu, unstable) (gh) (#141703)
convnext_base
inductor / cuda12.4-py3.10-gcc9-sm86 / test (inductor_timm, 1, 2, linux.g5.4xlarge.nvidia.gpu, unstable) (gh) (#141498)
convnext_base

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jansel · 2024-11-30T02:46:45Z

torch/_inductor/codegen/triton.py

                tree.grid_dim += 1

-        sem_count, _ = self.numels
+        sem_count, _ = self.numels.values()


This line should be updated to do a dict access to the x dim.

Updated d3146d1

jansel · 2024-11-30T02:47:07Z

torch/_inductor/codegen/triton.py

        value independently.
        """
-        xnumel, rnumel = self.numels
+        xnumel, rnumel = self.numels.values()


This should be updated to do dict access

Updated d3146d1

torch/_inductor/codegen/simd.py

… brister/tiling_dict

blaine-rister · 2024-11-30T20:29:54Z

@pytorchbot merge

pytorchmergebot · 2024-11-30T20:31:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-11-30T21:35:20Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 3, 5, lf.linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

blaine-rister · 2024-12-01T06:42:32Z

@pytorchbot merge

pytorchmergebot · 2024-12-01T06:44:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-12-04T19:30:41Z

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/141751/head returned non-zero exit code 1

Rebasing (1/6)
Auto-merging torch/_inductor/codegen/halide.py
Auto-merging torch/_inductor/codegen/simd.py
CONFLICT (content): Merge conflict in torch/_inductor/codegen/simd.py
Auto-merging torch/_inductor/codegen/triton.py
CONFLICT (content): Merge conflict in torch/_inductor/codegen/triton.py
Auto-merging torch/_inductor/codegen/triton_combo_kernel.py
error: could not apply 0fc5665c7e3... add helper function to tell if a tree or prefix is a reduction
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
hint: Disable this message with "git config advice.mergeConflict false"
Could not apply 0fc5665c7e3... add helper function to tell if a tree or prefix is a reduction

Raised by https://github.com/pytorch/pytorch/actions/runs/12166894104

facebook-github-bot · 2024-12-04T19:40:32Z

@blaine-rister has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

blaine-rister · 2024-12-05T02:22:32Z

@pytorchbot merge

pytorchmergebot · 2024-12-05T02:24:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-12-05T02:24:26Z

Merge failed

Reason: 2 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

blaine-rister · 2024-12-05T02:26:21Z

@pytorchbot merge -f "landed internally"

pytorchmergebot · 2024-12-05T02:28:04Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

blaine-rister · 2024-12-05T02:32:09Z

@atalman I landed the diff internally, but the changes didn't seem to show up in github. There was some mysterious failure preventing GH from landing which seemed like an infra issue. I went ahead and did a "merge -f" to make sure GH and fbcode were synced. Was that the right course of action?

# Summary Preparatory refactor for pytorch#137243. This makes it easier to generalize to multi-dimensional reductions. This diff refactors `self.numels` from a tuple like `(8,16)` to a dict like `{"x": 8, "r": 16}`. Note: this is based off of pytorch#141738, which enables `tree.is_reduction`. That PR should land first. # Test plan The existing CI provides good coverage. Pull Request resolved: pytorch#141751 Approved by: https://github.com/jansel

This reverts commit 5deca07. Reverted pytorch#141751 on behalf of https://github.com/atalman due to Failing internal builds ([comment](pytorch#141751 (comment)))

# Summary Preparatory refactor for pytorch#137243. This makes it easier to generalize to multi-dimensional reductions. This diff refactors `self.numels` from a tuple like `(8,16)` to a dict like `{"x": 8, "r": 16}`. Note: this is based off of pytorch#141738, which enables `tree.is_reduction`. That PR should land first. # Test plan The existing CI provides good coverage. Pull Request resolved: pytorch#141751 Approved by: https://github.com/jansel

Preparatory refactor for #137243. # Feature Follow up to #141751. Since we now represent `numels` as a dict, it's natural to extend this to `size_hints`. The latter are basically just the former rounded up to the nearest power of 2. This simplifies various heuristics such as the coordinate descent tuner. Where we previously needed to determine which index in `size_hints` corresponds to each dimension, now we can just query by prefix. This will be especially important when we enable 2D reductions, as it becomes harder to keep track of these things when we have multiple reduction dimensions. (See the previous PR for some examples.) # Test plan The existing CI provides good coverage. This PR modifies a few tests which explicitly constructed size hints. Pull Request resolved: #142249 Approved by: https://github.com/jansel

Skylion007 · 2024-12-07T16:27:06Z

This also is screaming to me that this should be a TypedDict given that we have a fixed subset of static keys.

This reverts commit 5deca07. Reverted pytorch#141751 on behalf of https://github.com/atalman due to Failing internal builds ([comment](pytorch#141751 (comment)))

# Summary Preparatory refactor for #137243. This makes it easier to generalize to multi-dimensional reductions. This diff refactors `self.numels` from a tuple like `(8,16)` to a dict like `{"x": 8, "r": 16}`. Note: this is based off of #141738, which enables `tree.is_reduction`. That PR should land first. # Test plan The existing CI provides good coverage. Pull Request resolved: #141751 Approved by: https://github.com/jansel

Preparatory refactor for #137243. # Feature Follow up to #141751. Since we now represent `numels` as a dict, it's natural to extend this to `size_hints`. The latter are basically just the former rounded up to the nearest power of 2. This simplifies various heuristics such as the coordinate descent tuner. Where we previously needed to determine which index in `size_hints` corresponds to each dimension, now we can just query by prefix. This will be especially important when we enable 2D reductions, as it becomes harder to keep track of these things when we have multiple reduction dimensions. (See the previous PR for some examples.) # Test plan The existing CI provides good coverage. This PR modifies a few tests which explicitly constructed size hints. Pull Request resolved: #142249 Approved by: https://github.com/jansel

Fixes #134277 and #142317. Sub-PRs containing refactors from this one: - #141733 - #141738 - #141751 (based off the former) - #142249 - #142020 - #143135 These refactor PRs should land before the main one. # Feature *Note: to minimize risk, multi-dimensional reductions are gated by the flag `config.triton.tile_reductions`, which defaults to False.* Instead of having a single reduction dimension called `"r"`, we can now support 2D reductions with `"r0_"` and `"r1_"` dimensions. 2D reductions generate two nested loops, with different block pointer advancements in each loop body. Most of the implementation is generic to ND reductions, but for now the tiling algorithm sets a hard limit at 2D. Here's an example of a 2D persistent reduction kernel: ``` @triton.jit def triton_per_fused_sum_0(in_ptr0, out_ptr0, xnumel, r0_numel, r1_numel, XBLOCK : tl.constexpr): xnumel = 1 r0_numel = 15 R0_BLOCK: tl.constexpr = 16 r1_numel = 15 R1_BLOCK: tl.constexpr = 16 xoffset = tl.program_id(0) * XBLOCK xindex = xoffset + tl.arange(0, XBLOCK)[:, None, None] xmask = tl.full([XBLOCK, R0_BLOCK, R1_BLOCK], True, tl.int1) r0_index = tl.arange(0, R0_BLOCK)[None, :, None] r0_offset = 0 r0_mask = r0_index < r0_numel r1_index = tl.arange(0, R1_BLOCK)[None, None, :] r1_offset = 0 r1_mask = r1_index < r1_numel rnumel = r0_numel * r1_numel RBLOCK: tl.constexpr = R0_BLOCK*R1_BLOCK roffset = r1_offset + (r0_offset*r1_numel) rindex = r1_index + (r0_index*r1_numel) r0_0 = r0_index r1_1 = r1_index tmp0 = tl.load(tl.make_block_ptr(in_ptr0, shape=[15, 15], strides=[30, 1], block_shape=[R0_BLOCK, R1_BLOCK], order=[1, 0], offsets=[r0_offset, r1_offset]), boundary_check=[0, 1], padding_option='zero')[None, :, :] tmp1 = tl.broadcast_to(tmp0, [XBLOCK, R0_BLOCK, R1_BLOCK]) tmp3 = tl.where(r0_mask & r1_mask, tmp1, 0) tmp4 = tl.reshape(tmp3, [XBLOCK, RBLOCK]) tmp5 = tl.sum(tmp4, 1)[:, None, None] tl.store(out_ptr0 + (tl.full([XBLOCK, 1, 1], 0, tl.int32)), tmp5, None) ''', device_str='cuda') ``` There are a few main differences between this kernel and what Inductor would generate without this PR. - Instead of an `r`/`RBLOCK` dimension, we have two reduction dimensions: `r0_`/`R0_BLOCK` and `r1_`/`R1_BLOCK`. - There are special size and indexing variables for reductions, which don't directly correspond to any kernel dimension. (`rindex`, `rnumel`, `RBLOCK`, and `roffset`.) These collapse N-D reduction sizes and indices indices into 1D. This simplifies the codegen for reductions, which sometimes want to access linear indices instead of N-dimensional ones. Doing things this way allows us to generate N-D loads and stores, but access this data as if it were 1D, minimizing the blast radius of this PR. Although this makes the code more verbose, it shouldn't have a perf impact because the triton compiler eliminates dead code. - We generate the line `tmp4 = tl.reshape(tmp3, [XBLOCK, RBLOCK])` before performing the actual reduction. This reshapes N reduction dimensions into 1D. This allows us to reduce over all N dimensions at once, simplifying the codegen and allowing the Triton complier to decide the order of processing under the hood. Here's an example of a looped reduction: ``` @triton.jit def triton_red_fused_sum_0(in_ptr0, out_ptr0, xnumel, r0_numel, r1_numel, XBLOCK : tl.constexpr, R0_BLOCK : tl.constexpr, R1_BLOCK : tl.constexpr): xnumel = 3 r0_numel = 43 r1_numel = 129 xoffset = tl.program_id(0) * XBLOCK xindex = xoffset + tl.arange(0, XBLOCK)[:, None, None] xmask = xindex < xnumel r0_base = tl.arange(0, R0_BLOCK)[None, :, None] r1_base = tl.arange(0, R1_BLOCK)[None, None, :] rnumel = r0_numel * r1_numel RBLOCK: tl.constexpr = R0_BLOCK*R1_BLOCK rbase = r1_base + (r0_base*r1_numel) x0 = xindex block_ptr0 = tl.make_block_ptr(in_ptr0, shape=[3, 43, 129], strides=[11094, 258, 1], block_shape=[XBLOCK, R0_BLOCK, R1_BLOCK], order=[2, 1, 0], offsets=[xoffset, 0, 0]) _tmp2 = tl.full([XBLOCK, R0_BLOCK, R1_BLOCK], 0, tl.float32) for r0_offset in range(0, r0_numel, R0_BLOCK): r0_index = r0_offset + r0_base r0_mask = r0_index < r0_numel for r1_offset in range(0, r1_numel, R1_BLOCK): r1_index = r1_offset + r1_base r1_mask = r1_index < r1_numel roffset = r1_offset + (r0_offset*r1_numel) rindex = r1_index + (r0_index*r1_numel) r0_1 = r0_index r1_2 = r1_index tmp0 = tl.load(block_ptr0, boundary_check=[0, 1, 2], padding_option='zero', eviction_policy='evict_first') tmp1 = tl.broadcast_to(tmp0, [XBLOCK, R0_BLOCK, R1_BLOCK]) tmp3 = _tmp2 + tmp1 _tmp2 = tl.where(r0_mask & r1_mask & xmask, tmp3, _tmp2) block_ptr0 = tl.advance(block_ptr0, [0, 0, R1_BLOCK]) block_ptr0 = tl.advance(block_ptr0, [0, R0_BLOCK, (-1)*R1_BLOCK*((128 + R1_BLOCK) // R1_BLOCK)]) tmp4 = tl.reshape(_tmp2, [XBLOCK, RBLOCK]) tmp2 = tl.sum(tmp4, 1)[:, None, None] tl.store(tl.make_block_ptr(out_ptr0, shape=[3], strides=[1], block_shape=[XBLOCK], order=[0], offsets=[xoffset]), tl.reshape(tmp2, [XBLOCK]).to(tl.float32), boundary_check=[0]) ''', device_str='cuda') ``` In addition to the aforementioned changes to the persistent reduction, multidimensional looped reductions have a few more lines of code: - They calculate indices inside the loop using `r0_base` and `r1_base`. For compatibility with existing codegen, these are collapsed to the 1D variant `rbase`. - Block pointer advancements are more nuanced for multidimensional loops. At the end of each loop body, we emit a `tl.advance` line which not only increments the pointer in its own dimension, but also undoes the cumulative increments of the previous loop level. This is equivalent to the usual practice in nested loops of starting with a fresh iteration variable at each level. Implementing this required refactoring the way we generate pointer advancements into a new `self.pointer_advancements` field of the kernel, which categorizes advancements by dimension. The biggest difficulty in implementing this feature was that we represented tiling with a tuple like `(5,2)`. In the existing codebase, the compiler can infer that the reduction dimension of `(5,2)` is `2`, since reductions are always the last dimension. This became cumbersome now that we have to support multiple reduction dimensions, so I refactored tiling into a dict like `{"x": 5, "r0_": 2, "r1_": 4}`. This required quite a few code changes, but I don't think it makes the underlying logic much more complex. This will also make it easier to eventually support simultaneous pointwise and reduction tiling, like `{"x": 5, "y": 5, "r0_": 2, "r1_": 4}`. (This is not supported today, but we might want to do it eventually.) The existing tiling algorithm generalized naturally to support reductions. For pointwise kernels, we tile the pointwise dimensions (`"x"`, `"y"`) as is. For reduction kernels, we never tile the `"x"` dimension, and only tile the reduction dimensions (`"r0_"`, `"r1_"`). Thus we only ever tile pointwise OR reduction dimensions, but not both. In principle it seems possible to support both, but it would likely require changes to the kernel fusion and autotuning logic. I thought it best to keep this PR as minimal as possible since it already touched a lot of different files. Unfortunately, these changes weren't enough to get block pointers in some seemingly simple test cases. In some tests for `argmax` and `var_mean`, we already collapse reduction dimensions into 1D and generate modular indexing expressions, prior to tiling. So it's not trivial to figure out how to expand the collapsed reduction dimension back to a shape that would simplify the indexing. To address these cases, this PR adds a new feature to the `config.prefer_nd_tiling` option, which analyzes reads and writes in the kernel, using the same mod-div pattern matching logic that generates block pointers later on. By matching this pattern, we can solve for the tiling splits which *would* simplify the indexing expression, and use then use that tiling to eliminate the modular indexing and emit a block pointer. This tiling mode is still off by default, but it's important for certain applications where we need to get as many block pointers as possible. # Test plan This touches pretty much anything that uses the Triton and Halide backends, so the existing CI provides good coverage. However, 2D reductions are gated behind a few feature flags like `config.prefer_nd_tiling` and `config.tile_reductions`, so this really only checks that the PR doesn't break 1D reductions. In addition to existing CI tests, this PR also adds some new tests that specifically stress 2D reductions: - `test_2d_reduction_odd_shapes`: test 2D reductions with a variety of ops and sizes. This covers the typical persistent and looped reductions. - `test_2d_reduce_no_x_dim`: test 2D reductions with no x dimension. - `test_2d_welford_reduction`: test 2D welford reductions with block pointers. - `test_welford_non_block_pointer`: test a 2D welford reduction when block pointer analysis fails. - `test_reduction_multiple_discontiguous_dims`: test reducing over more than one discontiguous dimension. We won't get a block pointer for this case, since that would require 3D tiling, but we're currently limited to 2D. - `test_2d_reduction_multi_kernel`: test multi kernel autotuning on a 2D softmax kernel. - `test_enable_tiled_reductions`: test that `config.triton.tile_reductions` enables/disables this feature. Pull Request resolved: #137243 Approved by: https://github.com/jansel Co-authored-by: Yueming Hao <[email protected]> Co-authored-by: Jason Ansel <[email protected]>

blaine-rister added 2 commits November 27, 2024 16:41

add helper function to tell if a tree or prefix is a reduction

0fc5665

[Inductor] convert tiling to a dict

6d15d13

pytorch-bot bot added ciflow/inductor module: inductor labels Nov 28, 2024

blaine-rister added 2 commits November 27, 2024 20:46

prevent type check

a4c1bda

comment

cc8ed39

blaine-rister added the topic: not user facing topic category label Nov 28, 2024

blaine-rister requested a review from jansel November 28, 2024 04:52

blaine-rister mentioned this pull request Nov 28, 2024

[Inductor] Support tiling reduction dimensions #137243

Closed

blaine-rister changed the title ~~[Inductor] Convert tiling to a dict~~ [Inductor] Represent tiling as a dict Nov 28, 2024

jansel approved these changes Nov 30, 2024

View reviewed changes

blaine-rister commented Nov 30, 2024

View reviewed changes

torch/_inductor/codegen/simd.py Outdated Show resolved Hide resolved

blaine-rister added 5 commits November 30, 2024 00:42

Update torch/_inductor/codegen/simd.py

930711f

Merge branch 'main' into brister/tree_is_reduction

1042a2d

Merge branch 'brister/tree_is_reduction' into brister/tiling_dict

59343aa

update to use dict access

d3146d1

Merge branch 'brister/tiling_dict' of github.com:pytorch/pytorch into…

4b073a1

… brister/tiling_dict

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 30, 2024

pytorchmergebot added the merging label Nov 30, 2024

pytorchmergebot removed the merging label Nov 30, 2024

fix coop reductions

714ba3f

pytorchmergebot added the merging label Dec 1, 2024

pytorchmergebot added the Merged label Dec 1, 2024

Merge branch 'main' into brister/tiling_dict

53c08c1

pytorchmergebot added the merging label Dec 5, 2024

pytorchmergebot removed the merging label Dec 5, 2024

pytorchmergebot added the merging label Dec 5, 2024

pytorchmergebot closed this in f9af86d Dec 5, 2024

pytorchmergebot removed the merging label Dec 5, 2024

This was referenced Dec 6, 2024

[Inductor] Refactor "r" reduction prefix to {"r0_", "r1_"}. #142020

Closed

[Inductor] Represent size_hints as a dict #142249

Closed

github-actions bot deleted the brister/tiling_dict branch January 7, 2025 02:05

[Inductor] Represent tiling as a dict #141751

[Inductor] Represent tiling as a dict #141751

Uh oh!

Conversation

blaine-rister commented Nov 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Nov 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141751

❌ 2 Cancelled Jobs, 3 Unrelated Failures

Uh oh!

jansel Nov 30, 2024

Choose a reason for hiding this comment

Uh oh!

blaine-rister Nov 30, 2024

Choose a reason for hiding this comment

Uh oh!

jansel Nov 30, 2024

Choose a reason for hiding this comment

Uh oh!

blaine-rister Nov 30, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

blaine-rister commented Nov 30, 2024

Uh oh!

pytorchmergebot commented Nov 30, 2024

Merge started

Uh oh!

pytorchmergebot commented Nov 30, 2024

Merge failed

Uh oh!

blaine-rister commented Dec 1, 2024

Uh oh!

pytorchmergebot commented Dec 1, 2024

Merge started

Uh oh!

pytorchmergebot commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

blaine-rister commented Dec 5, 2024

Uh oh!

pytorchmergebot commented Dec 5, 2024

Merge started

Uh oh!

pytorchmergebot commented Dec 5, 2024

Merge failed

Uh oh!

blaine-rister commented Dec 5, 2024

Uh oh!

pytorchmergebot commented Dec 5, 2024

Merge started

Uh oh!

blaine-rister commented Dec 5, 2024

Uh oh!

Skylion007 commented Dec 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

blaine-rister commented Nov 28, 2024 •

edited

Loading

pytorch-bot bot commented Nov 28, 2024 •

edited

Loading