Fix prims.broadcast_in_dim functionalization #163377

jansel · 2025-09-19T21:44:48Z

Stack from ghstack (oldest at bottom):

-> Fix prims.broadcast_in_dim functionalization #163377

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo @chenyang78

[ghstack-poisoned]

pytorch-bot · 2025-09-19T21:44:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163377

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit ef87cf7 with merge base 51152ef ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Fixes #163037 ghstack-source-id: fc244cf Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 0c237b0 Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 929c1d6 Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 329d227 Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 280e3f9 Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 7547c31 Pull-Request: #163377

ezyang

Unfortunately, it seems Codex found an existing wrong functionalization rule for rng in the prims directory. Here's what a real, full-bodied view functionalization rule looks like:

    at::Tensor view(c10::DispatchKeySet dispatchKeySet, const at::Tensor & self, c10::SymIntArrayRef size) {

      at::Tensor self_;
      if (at::functionalization::impl::isFunctionalTensor(self)) {

        self_ = at::functionalization::impl::from_functional_tensor(self);
      } else {
        self_ = self;
      }
      if (!at::functionalization::impl::isFunctionalTensor(self)) {
        // functionalization is re-entrant, but will no-op if it wasn't passed a FunctionalTensorWrapper.
        at::AutoDispatchSkipFunctionalize guard;
        return at::_ops::view::call(self_, size);
      }
      auto reapply_views = at::functionalization::impl::getFunctionalizationReapplyViewsTLS();
      auto inverse_return_mode = (
          reapply_views ? at::functionalization::InverseReturnMode::ViewOrScatterInverse
            : at::functionalization::InverseReturnMode::NeverView
      );
      auto compute_reference_meta =
        self.key_set().has_backend(c10::BackendComponent::XLABit) ||
        self.key_set().has_backend(c10::BackendComponent::LazyBit);
      at::Tensor reference_tensor_output;
      if (compute_reference_meta && !disable_meta_reference()) {
        auto self_meta = to_meta(self);
        at::AutoDispatchSkipFunctionalize func_guard;
        c10::impl::ExcludeDispatchKeyGuard guard(exclude_keys_for_meta_dispatch);
        reference_tensor_output = at::_ops::view::call(self_meta, size);
      }
      at::Tensor tmp_output;
      {
        at::AutoDispatchSkipFunctionalize guard;
        if (reapply_views) {
          tmp_output = at::_ops::view::call(self_, size);
        } else {
          tmp_output = at::_ops::view_copy::call(self_, size);
        }
      }

      bool has_symbolic_inputs = false;
      has_symbolic_inputs = has_symbolic_inputs | (std::any_of(size.begin(), size.end(), [=](auto& arg) { return arg.is_symbolic(); }));
      at::functionalization::ViewMeta view_meta = at::functionalization::ViewMeta(
        [reapply_views = reapply_views, size = size.vec()](const at::Tensor & base, int64_t mutated_view_idx) -> at::Tensor {
          if (reapply_views) {
            return at::_ops::view::call(base, size);
          } else {
            return at::_ops::view_copy::call(base, size);
          }
        },
        [inverse_return_mode = inverse_return_mode, size = size.vec()](const at::Tensor & base, const at::Tensor & mutated_view, int64_t mutated_view_idx) -> at::Tensor {
          return at::functionalization::FunctionalInverses::view_inverse(base, mutated_view, inverse_return_mode, size);
        },
        /*has_symbolic_inputs=*/has_symbolic_inputs,
        /*is_multi_output=*/false,
        /*is_as_strided=*/false
      );
      auto out = at::functionalization::impl::create_functional_tensor_with_view_meta(tmp_output, self, view_meta);
      // See  Note [Propagating strides in the functionalization pass]
      if (compute_reference_meta && !disable_meta_reference()) {
        at::functionalization::impl::set_sizes_strides_offset(out, reference_tensor_output);
      }
      return out;
    }

for broadcast_in_dim to be implemented correctly it needs to follow this structure more closely

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 890d54f Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 1774764 Pull-Request: #163377

laithsakka · 2025-09-24T07:31:00Z

torch/_prims/__init__.py

+    original_idx = 0
+    for idx in range(len(shape)):
+        if idx in broadcast_dims_set:
+            size = tensor_sizes[original_idx]


can we write this to be unbacked friendly
the unbacked semantics for broadcasting is that if we cant tell if its a broadcast case or not
we would assume no broadcasting and do a torch_check.
example this is the meta version

https://www.internalfb.com/code/fbsource/[e18e1578407a804d7877bf6be709197b739f6eae]/fbcode/caffe2/torch/_prims/__init__.py?lines=1296

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: f5efd7d Pull-Request: #163377

jansel · 2025-09-24T15:04:46Z

Updated prompt (I reverted the prior version and redid it):

Write a functionalization rule in C++ for broadcast_in_dim to fix failures in repro_broadcast_in_dim.py and the test test_prims_broadcast_in_dim_alias. Try to follow the pattern of other rules like view. Perhaps we could even rewrite broadcast_in_dim to view during functionalization. At the end, please provide a one paragraph summary of your approach for use in a github comment. When you change C++ rebuild with python setup.py develop, you have a working build environment.

Summary:

Implemented a Functionalize kernel for prims::broadcast_in_dim that rewrites the op into the
same unsqueeze+expand view pattern we use in eager, captures the resulting size/stride metadata, and registers a prims-
specific rule so functionalization rewrites it to the appropriate as_strided view; with the new rule in place both the
standalone repro and CPUReproTests.test_prims_broadcast_in_dim_alias now pass

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: 3d194a9 Pull-Request: #163377

[ghstack-poisoned]

Fixes #163037 ghstack-source-id: a2ee4b8 Pull-Request: #163377

ngimel · 2025-09-25T23:29:31Z

When would this functionalization be needed? this prim should not be used in the usual lowering process, and if someone wants to just call the function that has this behavior, they can write it with unsqueeze and expand and it will be functionalized normally.

jansel · 2025-09-26T15:55:29Z

Someone called the op directly.

ngimel · 2025-09-26T17:30:15Z

well they shouldn't

github-actions · 2025-12-13T19:34:16Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

Update

1ed851d

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 19, 2025

Add decomp for prims.broadcast_in_dim

fc65708

Fixes #163037 ghstack-source-id: fc244cf Pull-Request: #163377

jansel mentioned this pull request Sep 19, 2025

[dynamo] Fix issue with namedtuple slicing #163351

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Sep 19, 2025

jansel added the topic: not user facing topic category label Sep 19, 2025

Update

da8f06f

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 20, 2025

Add decomp for prims.broadcast_in_dim

885700b

Fixes #163037 ghstack-source-id: 0c237b0 Pull-Request: #163377

This was referenced Sep 20, 2025

Better decomp for torch.eye #163386

Closed

[inductor] Fallback on strided complex add #163387

Closed

Update

4f8c0fc

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 20, 2025

Add decomp for prims.broadcast_in_dim

52213e4

Fixes #163037 ghstack-source-id: 929c1d6 Pull-Request: #163377

jansel mentioned this pull request Sep 20, 2025

[inductor] Support out_dtype arg to matmul #163393

Closed

Update

4deb417

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 20, 2025

Add decomp for prims.broadcast_in_dim

801bbe0

Fixes #163037 ghstack-source-id: 329d227 Pull-Request: #163377

jansel mentioned this pull request Sep 20, 2025

[inductor] Fix bug where viewed outputs get padded #163398

Closed

jansel added 2 commits September 19, 2025 22:01

Update

3a336b0

[ghstack-poisoned]

Update

30f8634

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 20, 2025

Add decomp for prims.broadcast_in_dim

19d9bd1

Fixes #163037 ghstack-source-id: 280e3f9 Pull-Request: #163377

jansel added 2 commits September 20, 2025 22:27

Update

0d564d3

[ghstack-poisoned]

Update

b623dd2

[ghstack-poisoned]

This was referenced Sep 22, 2025

[inductor] Fix issue with scalar arg handling #163481

Closed

[inductor] Fix divmod error in decomp #163482

Closed

jansel requested a review from bdhirsh September 23, 2025 04:40

Update

d6c2920

[ghstack-poisoned]

eellison requested review from ezyang and zou3519 September 23, 2025 17:02

Update

4257a31

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 23, 2025

Fix prims.broadcast_in_dim functionalization

635ef8a

Fixes #163037 ghstack-source-id: 7547c31 Pull-Request: #163377

ezyang requested changes Sep 24, 2025

View reviewed changes

Update

cc4fd61

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 24, 2025

Fix prims.broadcast_in_dim functionalization

28ad89d

Fixes #163037 ghstack-source-id: 890d54f Pull-Request: #163377

jansel mentioned this pull request Sep 24, 2025

Revert "[inductor] Fix issue with scalar arg handling" #163737

Closed

jansel marked this pull request as draft September 24, 2025 04:09

Update

0c44766

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 24, 2025

Fix prims.broadcast_in_dim functionalization

6a6b728

Fixes #163037 ghstack-source-id: 1774764 Pull-Request: #163377

laithsakka reviewed Sep 24, 2025

View reviewed changes

Update

e8e1f6d

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 24, 2025

Fix prims.broadcast_in_dim functionalization

e5e57ed

Fixes #163037 ghstack-source-id: f5efd7d Pull-Request: #163377

Update

d4b984e

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 25, 2025

Fix prims.broadcast_in_dim functionalization

3f682b4

Fixes #163037 ghstack-source-id: 3d194a9 Pull-Request: #163377

jansel marked this pull request as ready for review September 25, 2025 03:25

jansel requested a review from ezyang September 25, 2025 03:25

Update

ef87cf7

[ghstack-poisoned]

jansel added a commit that referenced this pull request Sep 25, 2025

Fix prims.broadcast_in_dim functionalization

e017213

Fixes #163037 ghstack-source-id: a2ee4b8 Pull-Request: #163377

eellison removed their request for review October 14, 2025 19:29

github-actions bot added the Stale label Dec 13, 2025

jansel closed this Dec 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix prims.broadcast_in_dim functionalization #163377

Fix prims.broadcast_in_dim functionalization #163377

Uh oh!

jansel commented Sep 19, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 19, 2025 •

edited

Loading

Uh oh!

ezyang left a comment

Uh oh!

laithsakka Sep 24, 2025

Uh oh!

jansel commented Sep 24, 2025

Uh oh!

ngimel commented Sep 25, 2025

Uh oh!

jansel commented Sep 26, 2025

Uh oh!

ngimel commented Sep 26, 2025

Uh oh!

github-actions bot commented Dec 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix prims.broadcast_in_dim functionalization #163377

Fix prims.broadcast_in_dim functionalization #163377

Uh oh!

Conversation

jansel commented Sep 19, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163377

✅ No Failures

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

laithsakka Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

jansel commented Sep 24, 2025

Uh oh!

ngimel commented Sep 25, 2025

Uh oh!

jansel commented Sep 26, 2025

Uh oh!

ngimel commented Sep 26, 2025

Uh oh!

github-actions bot commented Dec 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jansel commented Sep 19, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 19, 2025 •

edited

Loading