Change reorder_dimensions behavior to favor output writting sequence #28615

glaringlee · 2019-10-24T19:39:59Z

reorder_dimensions() currently iterate all the operands when determining the dimension order in the TensorIterator. It tries to move a dimension to front if any operand has a dimension whose stride is bigger than this dimension.

reorder_dimensions() do respect the case that stride has zero value. I did not see a reason why reorder_dimensions() need to keep probing each operand under regular cases.

Changed behavior a little bit.
Since operands is ordered by outputs tensor first followed by input tensor. I would favor the writing of outputs is as sequential as possible. This could make the copy between tensors with different memory format faster.

Pls correct me if this change is wrong, thanks.

Fix #26812

Benchmark on CPU
x = torch.randn(64, 2048, 7, 7).contiguous(memory_format = torch.contiguous_format)
%timeit x.contiguous(memory_format = torch.channels_last)
x = torch.randn(64, 2048, 7, 7).contiguous(memory_format = torch.channels_last)
%timeit x.contiguous(memory_format = torch.contiguous_format)
BEFORE:
20.7 ms ± 1.87 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
12.5 ms ± 49.4 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
AFTER:
9.26 ms ± 454 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
12.6 ms ± 53.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

Benchmark on GPU
x = torch.randn(64, 2048, 7, 7).contiguous(memory_format = torch.contiguous_format).cuda()
%timeit x.contiguous(memory_format = torch.channels_last); torch.cuda.synchronize()
x = torch.randn(64, 2048, 7, 7).contiguous(memory_format = torch.channels_last).cuda()
%timeit x.contiguous(memory_format = torch.contiguous_format); torch.cuda.synchronize()
BEFORE:
622 µs ± 268 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)
5.2 µs ± 77.6 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
AFTER:
379 µs ± 316 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)
5.25 µs ± 76.3 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

facebook-github-bot

@glaringlee has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

VitalyFedyunin · 2019-10-24T20:39:56Z

CC @ngimel

facebook-github-bot · 2019-10-28T16:10:38Z

@glaringlee merged this pull request in 7ed9a3e.

…(#28615) Summary: reorder_dimensions() currently iterate all the operands when determining the dimension order in the TensorIterator. It tries to move a dimension to front if any operand has a dimension whose stride is bigger than this dimension. reorder_dimensions() do respect the case that stride has zero value. I did not see a reason why reorder_dimensions() need to keep probing each operand under regular cases. Changed behavior a little bit. Since operands is ordered by outputs tensor first followed by input tensor. I would favor the writing of outputs is as sequential as possible. This could make the copy between tensors with different memory format faster. Pls correct me if this change is wrong, thanks. Pull Request resolved: pytorch/pytorch#28615 Reviewed By: VitalyFedyunin Differential Revision: D18122474 Pulled By: glaringlee fbshipit-source-id: f36467489fe6c6514b14ce9dcc439628d5d5ad0e

Change reorder_dimensions behavior to favor output writting sequence

4e499da

facebook-github-bot reviewed Oct 24, 2019

View reviewed changes

VitalyFedyunin self-requested a review October 24, 2019 20:40

VitalyFedyunin approved these changes Oct 27, 2019

View reviewed changes

facebook-github-bot closed this in 7ed9a3e Oct 28, 2019

facebook-github-bot added the merged label Oct 28, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change reorder_dimensions behavior to favor output writting sequence #28615

Change reorder_dimensions behavior to favor output writting sequence #28615

Uh oh!

glaringlee commented Oct 24, 2019 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

VitalyFedyunin commented Oct 24, 2019

Uh oh!

facebook-github-bot commented Oct 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Change reorder_dimensions behavior to favor output writting sequence #28615

Change reorder_dimensions behavior to favor output writting sequence #28615

Uh oh!

Conversation

glaringlee commented Oct 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

VitalyFedyunin commented Oct 24, 2019

Uh oh!

facebook-github-bot commented Oct 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

glaringlee commented Oct 24, 2019 •

edited

Loading