Add checks for empty tensor list by malfet · Pull Request #155383 · pytorch/pytorch

malfet · 2025-06-07T00:52:27Z

Stack from ghstack (oldest at bottom):

Vibe-coded with Codex, after collecting a backtrace, see https://chatgpt.com/s/cd_68438be8a1248191adbfa0a5f000e60b

Even though, check for empty tensor list exists in at::cat crash might happens while resolving named dimension to position, by calling dimname_to_position(tensors[0], dim), see backtrace below

(lldb) up
frame #1: 0x00000001101146dc libtorch_cpu.dylib`at::TensorBase::has_names(this=0x0000000000000000) const at TensorBase.h:559:10
   556 	  bool has_names() const {
   557 	    // If a user is using unnamed tensors, then we can short-circuit right here.
   558 	    // Otherwise, impl::has_names attempts to retrieve names.
-> 559 	    if (!impl_->has_named_tensor_meta()) {
   560 	      return false;
   561 	    }
   562 	    return impl::has_names(unsafeGetTensorImpl());
(lldb) up
frame #2: 0x00000001101144c4 libtorch_cpu.dylib`at::dimname_to_position(tensor=0x0000000000000000, dim=Dimname @ 0x000000016fdfe348) at NamedTensorUtils.cpp:23:3
   20  	int64_t dimname_to_position(const Tensor& tensor, Dimname dim) {
   21  	  TORCH_CHECK(dim.type() != NameType::WILDCARD,
   22  	      "Please look up dimensions by name, got: name = None.");
-> 23  	  TORCH_CHECK(tensor.has_names(),
   24  	      "Name ", dim, " not found in ", toDimnameRepr(tensor), ".");
   25  	  const auto names = tensor.names();
   26

TODOs:

May be move test from test_tensor_creation.py to OpInfo (not sure which one is more readable)
Replace TORCH_CHECK with TORCH_CHECK_VALUE and adjust unit tests

Fixes #155306

[ghstack-poisoned]

pytorch-bot · 2025-06-07T00:52:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155383

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 6fe8e6a with merge base 7e4c097 ():

UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3-clang12-executorch / build (gh) (#150261)
Final attempt failed. Child_process exited with error code 1
pull / unstable-linux-jammy-cuda12.6-py3.10-gcc11-sm89-xfail / build (gh)
ninja: build stopped: subcommand failed

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ezyang · 2025-06-07T13:45:11Z

Can you confirm from the transcript that the tests fail before the change?

ezyang · 2025-06-07T13:47:01Z

Also the diff is kind of suspicious, all the places annotated with checks call functions I would also expect to do the non empty check

malfet · 2025-06-07T14:11:08Z

Can you confirm from the transcript that the tests fail before the change?

Yes, run python -c "import torch; torch.concat([], dim='N')" now and it will crash. Add empty check to concat and it will throw the exception.

Also the diff is kind of suspicious, all the places annotated with checks call functions I would also expect to do the non empty check

It crashes inside dimname_to_position that is called with tensors[0] as first argument(see backtrace below). If it's documented somewhere, that calling tensor[0] on empty tensor list guarantees to return nullptr, I can move the check there.

Process 88909 stopped
* thread #2, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x0)
    frame #0: 0x0000000110114940 libtorch_cpu.dylib`c10::intrusive_ptr<c10::TensorImpl, c10::UndefinedTensorImpl>::operator->(this=0x0000000000000000) const at intrusive_ptr.h:423:12
   420 	  }
   421 	
   422 	  TTarget* operator->() const noexcept {
-> 423 	    return target_;
   424 	  }
   425 	
   426 	  operator bool() const noexcept {
Target 0: (Python) stopped.
(lldb) up
frame #1: 0x00000001101146dc libtorch_cpu.dylib`at::TensorBase::has_names(this=0x0000000000000000) const at TensorBase.h:559:10
   556 	  bool has_names() const {
   557 	    // If a user is using unnamed tensors, then we can short-circuit right here.
   558 	    // Otherwise, impl::has_names attempts to retrieve names.
-> 559 	    if (!impl_->has_named_tensor_meta()) {
   560 	      return false;
   561 	    }
   562 	    return impl::has_names(unsafeGetTensorImpl());
(lldb) up
frame #2: 0x00000001101144c4 libtorch_cpu.dylib`at::dimname_to_position(tensor=0x0000000000000000, dim=Dimname @ 0x000000016fdfe348) at NamedTensorUtils.cpp:23:3
   20  	int64_t dimname_to_position(const Tensor& tensor, Dimname dim) {
   21  	  TORCH_CHECK(dim.type() != NameType::WILDCARD,
   22  	      "Please look up dimensions by name, got: name = None.");
-> 23  	  TORCH_CHECK(tensor.has_names(),
   24  	      "Name ", dim, " not found in ", toDimnameRepr(tensor), ".");
   25  	  const auto names = tensor.names();
   26  	
(lldb) bt
* thread #2, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x0)
  * frame #0: 0x000000014007f588 libtorch_cpu.dylib`at::dimname_to_position(at::Tensor const&, at::Dimname) + 40
    frame #1: 0x00000001407b87c0 libtorch_cpu.dylib`at::native::concat(c10::ArrayRef<at::Tensor>, at::Dimname) + 36
    frame #2: 0x00000001410f0c24 libtorch_cpu.dylib`at::_ops::concat_names::call(c10::ArrayRef<at::Tensor>, at::Dimname) + 468
    frame #3: 0x00000001028cef24 libtorch_python.dylib`torch::autograd::THPVariable_concat(_object*, _object*, _object*) + 944

Skylion007 · 2025-06-07T15:45:14Z

aten/src/ATen/native/TensorShape.cpp


 // torch.concat, alias for torch.cat
 Tensor& concat_out(TensorList tensors, Dimname dim, Tensor& result) {
+  TORCH_CHECK(!tensors.empty(), "expected a non-empty list of Tensors");


Suggested change

TORCH_CHECK(!tensors.empty(), "expected a non-empty list of Tensors");

TORCH_CHECK_VALUE(!tensors.empty(), "expected a non-empty list of Tensors");

These should all be ValueErrors

I'll replace all of them in #155460

ezyang

I misunderstood the change, current location is good.

malfet · 2025-06-08T18:51:29Z

@pytorchbot merge -f "Lint is green"

pytorchmergebot · 2025-06-08T18:53:05Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

c4fd599

[ghstack-poisoned]

malfet mentioned this pull request Jun 7, 2025

[MPS] Implement erfc #155382

Closed

malfet requested review from Skylion007 and ezyang June 7, 2025 00:52

malfet added release notes: python_frontend python frontend release notes category topic: bug fixes topic category labels Jun 7, 2025

malfet added 3 commits June 6, 2025 18:16

Update

9cf56d3

[ghstack-poisoned]

Update

83c8f89

[ghstack-poisoned]

Update

9fb6b8d

[ghstack-poisoned]

cyyever approved these changes Jun 7, 2025

View reviewed changes

Update

6fe8e6a

[ghstack-poisoned]

Skylion007 reviewed Jun 7, 2025

View reviewed changes

ezyang approved these changes Jun 8, 2025

View reviewed changes

pytorchmergebot added the merging label Jun 8, 2025

pytorchmergebot added the Merged label Jun 8, 2025

pytorchmergebot closed this in 3d82a1d Jun 8, 2025

pytorchmergebot removed the merging label Jun 8, 2025

github-actions bot deleted the gh/malfet/385/head branch July 13, 2025 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add checks for empty tensor list#155383

Add checks for empty tensor list#155383
malfet wants to merge 5 commits intogh/malfet/385/basefrom
gh/malfet/385/head

malfet commented Jun 7, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 7, 2025 •

edited

Loading

Uh oh!

ezyang commented Jun 7, 2025

Uh oh!

ezyang commented Jun 7, 2025

Uh oh!

malfet commented Jun 7, 2025

Uh oh!

Skylion007 Jun 7, 2025

Uh oh!

malfet Jun 9, 2025

Uh oh!

ezyang left a comment

Uh oh!

malfet commented Jun 8, 2025

Uh oh!

pytorchmergebot commented Jun 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	TORCH_CHECK(!tensors.empty(), "expected a non-empty list of Tensors");
	TORCH_CHECK_VALUE(!tensors.empty(), "expected a non-empty list of Tensors");

Conversation

malfet commented Jun 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155383

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

ezyang commented Jun 7, 2025

Uh oh!

ezyang commented Jun 7, 2025

Uh oh!

malfet commented Jun 7, 2025

Uh oh!

Skylion007 Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

malfet commented Jun 8, 2025

Uh oh!

pytorchmergebot commented Jun 8, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

malfet commented Jun 7, 2025 •

edited

Loading

pytorch-bot bot commented Jun 7, 2025 •

edited

Loading