Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack #125262

nowtryz · 2024-04-30T21:15:06Z

Hi,
I noticed the unfold operator was missing on MaskedTensor.

I tested that my change works when calling unfold and backward on a MaskedTensor but I didn't find the tests for the dispatch of such operation. Where is it?

pytorch-bot · 2024-04-30T21:15:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/125262

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e919719 with merge base 52c7c89 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

albanD · 2024-05-01T14:13:16Z

Most likely the right fix indeed.
cc @cpuhrsch what is the right way to test these PRs?

cpuhrsch · 2024-05-01T16:27:15Z

Hi @nowtryz - Thanks for sending the PR! The tests are in https://github.com/pytorch/pytorch/blob/main/test/test_maskedtensor.py .

nowtryz · 2024-05-01T17:07:08Z

Hi, Is it possible to have a bit of help to understand how the test framework works? Or a link to a documentation? When I used the debugger, I did not see the pass through dispatch being called. Thank you

cpuhrsch · 2024-05-01T17:44:29Z

@nowtryz - You should be able to run this with pytest test/test_maskedtensor.py. You'll need to extend the tests to also cover fold/unfold. For example see [test_prod](

pytorch/test/test_maskedtensor.py

Lines 714 to 725 in e3627d0

    
           def test_prod(self): 
        
               d = torch.tensor([[0, 1, 3, 0.0], [float("nan"), 4, 1.0, 5.0]]) 
        
               m = torch.tensor([[True, False, False, True], [False, True, False, True]]) 
        
               mt = masked_tensor(d, m) 
        
               _compare_mts(masked_tensor(torch.tensor(0.0), torch.tensor(True)), mt.prod()) 
        
               _compare_mts( 
        
                   masked_tensor( 
        
                       torch.tensor([0.0, 4.0, 1.0, 0.0]), 
        
                       torch.tensor([True, True, False, True]), 
        
                   ), 
        
                   mt.prod(dim=0), 
        
               )

. Since fold/unfold is neither unary, binary or a reduction I'd add a new test under TestBasics (not that it's a basic op, but because it doesn't fit the other buckets).

nowtryz · 2024-05-03T20:45:02Z

Hi,

Ok perfect, I confused test_masked.py and test_maskedtensor.py.
I added the test and updated the documentation.

From what I see fold is only implemented through torch.nn.Fold/torch.nn.functional.fold and would need to add a passthrough for im2col (and col2im for symmetry) but I don't really know how to test it from the MaskedTensor perspective, adding a test in test_maskedtensor with a torch.nn.Fold?

cpuhrsch · 2024-05-08T20:49:35Z

@nowtryz - Yes, exactly. A test_fold method for example.

nowtryz · 2024-05-29T23:17:41Z

Hi,

From what I understand, torch.nn.Fold has a specific semantic that requires a custom masked implementation: what should happen when I fold a combination of specified and unspecified values? Should I sum only the specified values or consider the fold unspecified? Consequently, I feel that this is out of the perimeter of this pull request. Feel free to correct me if I'm wrong.

In the meantime, I added support for torch.nn.Unfold/torch.nn.functional.unfold.

cpuhrsch · 2024-05-30T00:10:09Z

@nowtryz - Hm, on reduction semantics see details here: https://pytorch.org/tutorials/prototype/maskedtensor_advanced_semantics#reduction-semantics and also pytorch/rfcs#27 .

One way to derive semantics of composites such as Fold (or matmul) is by inferring them from the semantics of its components.

Or said different (and maybe less confusingly), if you were to implement Fold in terms of existing MaskedTensor operations (if that's possible), what would it do?

These kind of discussions are really important (in my very biased opinion) to separate out all the subtly different use cases: raggedness (tensor[i].size() and tensor[j].size() may not match for i != j), sparsity (storage layouts for Tensors with lots of zeros) and masking (logical elements that have a position, but no value).

So, for example, if you mask out an element it doesn't have a value, but it does still have a position. Should that influence the denominator in variance? If you do sparsity, well, 0 is still a value. So if you do softmax, you want to include that (but often people actually would rather it mean -inf because really they want to mask it out). But then if you do raggedness, well you just want to actually describe an entirely different shape. You want to entirely remove elements (e.g. sentences of different lengths) and not have their individual shape influence operations that are truly batched (e.g. parallel or commute with vmap along dimension 0).

nowtryz · 2024-05-30T16:36:49Z

Hi @cpuhrsch,

I'm trying to catch with the Fold/Unfold operators as they are different from Tensor.unfold. Looking at the operator's documentation, I understand the masked semantics of this operator like this:

If we have blocks of size $L$, then the folded cell would have the value $L \times x$, $x$ being the initial value of an equivalent Unfold operation yielding the given block. In that sens, if we have $N$ specified elements, $N \le L$, my understanding is that the reduction of the given block should be

$$\frac{L}{N} \sum_{n=1}^{N} x_n $$

If I am correct, then I am not sure how to implement if efficiently. My first idea (to reuse the existing ATen operator without further modifications) is the following:

the new mask is computed by passing the original one through the operator, which yields True for each block where at least one value was specified;
we compute the normal folded version of the data after filling unspecified values with 0s, let's call it "folded data";
the original mask is converted to an integer type and passed through the operator, which yields the $N$ values mentioned earlier, let's call it "divisors";
the result would be res = input.size(-1) * divisors * folded_data.

I don't know if it is more optimized, but I can also combine (1) and (3) by performing (3) and converting the result back to boolean to obtain the new mask.

What do you think?

nowtryz · 2024-06-12T18:51:42Z

Hi @cpuhrsch @mruberry,

Could we merge this PR and I will open a new one for the Fold when I have more time?

Many thanks

cpuhrsch · 2024-07-30T19:03:55Z

@pytorchbot rebase

pytorchmergebot · 2024-07-30T19:05:24Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-07-30T19:05:28Z

Successfully rebased patch onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout patch && git pull --rebase)

cpuhrsch · 2024-07-30T19:06:08Z

@nowtryz I had to unexpectedly go on leave due to family reasons, but let's revisit this. Are there tests for im2col or col2im that you could add the additional dtype to? Otherwise this looks fine, thank you for sending this :)

nowtryz · 2024-07-31T17:48:57Z

Hi @cpuhrsch,

No problem! If I understand correctly, updating torch/testing/_internal/common_methods_invocations.py should have generated the correct tests for im2col and col2im. Though the tests fail for these functions on cuda with the dtype I added. I have to check the cuda kernel but the fix should be easy (however I do not know cuda)

cpuhrsch · 2024-08-05T17:25:04Z

@nowtryz - Thanks for double checking, please ping again if you need more pointers.

nowtryz · 2024-09-05T22:17:23Z

@pytorchbot rebase

pytorchmergebot · 2024-09-05T22:18:48Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-09-05T22:18:51Z

Successfully rebased patch onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout patch && git pull --rebase)

nowtryz · 2024-09-05T22:21:26Z

Hi @cpuhrsch!
Last failure didn't seem to be related to the code of this PR and passed locally.

cpuhrsch · 2024-09-06T18:58:47Z

@pytorchbot merge

pytorch-bot · 2024-09-06T18:58:51Z

This PR needs to be approved by an authorized maintainer before merge.

cpuhrsch · 2024-09-06T18:59:11Z

@pytorchbot merge

pytorchmergebot · 2024-09-06T19:00:57Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

clee2000 · 2024-09-11T15:59:50Z

I believe this caused a memory leak in test_maskedtensor.py::TestBasicsCUDA::test_stack_cuda GH job link HUD commit link, @nowtryz could you take a look?

cpuhrsch · 2024-09-11T22:33:47Z

@clee2000 - I think we'll need to revert this again then.

nowtryz · 2024-09-11T22:56:04Z

@clee2000 That's strange, the test is just doing a passthrough to the stack function, there is no change in the cuda kernels as part of this test, I will need to take a deeper look

…h#125262) Hi, I noticed the `unfold` operator was missing on MaskedTensor. I tested that my change works when calling unfold and backward on a `MaskedTensor` but I didn't find the tests for the dispatch of such operation. Where is it? Pull Request resolved: pytorch#125262 Approved by: https://github.com/cpuhrsch

…pytorch#125262)" This reverts commit f685018. Reverted pytorch#125262 on behalf of https://github.com/ZainRizvi due to Hi, this PR appears to be calling maskedtensor tests to fail on main. Please rebase your changes onto the latest trunk build to repro the failure. test_maskedtensor.py::TestOperatorsCUDA::test_like_empty_like_layout1_cuda_bool [GH job link](https://github.com/pytorch/pytorch/actions/runs/10604716811/job/29393256312) [HUD commit link](https://hud.pytorch.org/pytorch/pytorch/commit/f685018ea9d08f98cbd7106028db134f967f74d3) ([comment](pytorch#125262 (comment)))

…h#125262) Hi, I noticed the `unfold` operator was missing on MaskedTensor. I tested that my change works when calling unfold and backward on a `MaskedTensor` but I didn't find the tests for the dispatch of such operation. Where is it? Pull Request resolved: pytorch#125262 Approved by: https://github.com/cpuhrsch

This test is currently failing in trunk when memory leak check is enabled, for example https://github.com/pytorch/pytorch/actions/runs/11296206361/job/31422348823#step:22:1970. When testing locally, calling `backward` on a masked tensor always causes memory leak until I clean up the data and the mask manually. This is probably related to this warning from masked tensor `UserWarning: It is not recommended to create a MaskedTensor with a tensor that requires_grad. To avoid this, you can use data.clone().detach()`, but I don't know much about the internal details here to go further. So, let's just fix the test first/ ### Testing ``` PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_maskedtensor.py TestBasicsCUDA.test_stack_cuda ``` passes and doesn't warn about memory leak anymore. The test itself came from #125262 (comment) Pull Request resolved: #137815 Approved by: https://github.com/kit1980

pytorchbot added the open source label Apr 30, 2024

albanD added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 1, 2024

albanD requested a review from cpuhrsch May 1, 2024 14:17

nowtryz force-pushed the patch branch from 608265c to c878677 Compare May 3, 2024 20:34

nowtryz force-pushed the patch branch from a7826fa to cb289ad Compare May 30, 2024 16:37

nowtryz requested a review from mruberry as a code owner May 30, 2024 16:37

nowtryz force-pushed the patch branch from cb289ad to a91b07a Compare May 30, 2024 19:23

nowtryz changed the title ~~Add unfold support for MaskedTensor~~ Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack Jun 13, 2024

nowtryz force-pushed the patch branch 2 times, most recently from b202d11 to c2b41b0 Compare June 17, 2024 19:29

pytorchmergebot force-pushed the patch branch from c2b41b0 to 3698064 Compare July 30, 2024 19:05

nowtryz force-pushed the patch branch from 3698064 to 7f29f9d Compare August 15, 2024 21:55

nowtryz requested a review from eqy as a code owner August 15, 2024 21:55

nowtryz added 4 commits September 5, 2024 22:18

Add unfold support for MaskedTensor

655eea6

Add F.unfold support for MaskedTensor

af4c512

Add torch.stack support for MaskedTensor

171a9a1

Fix dispatch types

e919719

pytorchmergebot force-pushed the patch branch from 0127168 to e919719 Compare September 5, 2024 22:18

cpuhrsch approved these changes Sep 6, 2024

View reviewed changes

pytorchmergebot added the merging label Sep 6, 2024

pytorchmergebot closed this in a15aabc Sep 6, 2024

pytorchmergebot removed the merging label Sep 6, 2024

huydhn mentioned this pull request Oct 11, 2024

Fix masked tensor test_stack memory leak #137815

Closed

Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack #125262

Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack #125262

Uh oh!

Conversation

nowtryz commented Apr 30, 2024

Uh oh!

pytorch-bot bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/125262

✅ No Failures

Uh oh!

albanD commented May 1, 2024

Uh oh!

cpuhrsch commented May 1, 2024

Uh oh!

nowtryz commented May 1, 2024 via email

Uh oh!

cpuhrsch commented May 1, 2024

Uh oh!

nowtryz commented May 3, 2024

Uh oh!

cpuhrsch commented May 8, 2024

Uh oh!

nowtryz commented May 29, 2024

Uh oh!

cpuhrsch commented May 30, 2024

Uh oh!

nowtryz commented May 30, 2024

Uh oh!

nowtryz commented Jun 12, 2024

Uh oh!

cpuhrsch commented Jul 30, 2024

Uh oh!

pytorchmergebot commented Jul 30, 2024

Uh oh!

pytorchmergebot commented Jul 30, 2024

Uh oh!

cpuhrsch commented Jul 30, 2024

Uh oh!

nowtryz commented Jul 31, 2024

Uh oh!

cpuhrsch commented Aug 5, 2024

Uh oh!

nowtryz commented Sep 5, 2024

Uh oh!

pytorchmergebot commented Sep 5, 2024

Uh oh!

pytorchmergebot commented Sep 5, 2024

Uh oh!

nowtryz commented Sep 5, 2024

Uh oh!

cpuhrsch commented Sep 6, 2024

Uh oh!

pytorch-bot bot commented Sep 6, 2024

Uh oh!

cpuhrsch commented Sep 6, 2024

Uh oh!

pytorchmergebot commented Sep 6, 2024

Merge started

Uh oh!

clee2000 commented Sep 11, 2024

Uh oh!

cpuhrsch commented Sep 11, 2024

Uh oh!

nowtryz commented Sep 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

pytorch-bot bot commented Apr 30, 2024 •

edited

Loading