Eliminate direct access to size/strides of THTensor; replace them with std::vector #9561

ezyang · 2018-07-18T20:41:09Z

THTensor now stores sizes_ and strides_ which is a std::vector<int64_t>
Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet.
There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides)
Anywhere you said t->size[n] = 0, we now say THTensor_setSizeAt(t, n, 0), ditto for strides
Anywhere you said t->size[n], we now say t->size(n) (coming soon: ditto for strides)

Previous review of just the std::vector change in #9518, but I'm planning to merge this all in one go.

Note for @gchanan: review from commit "ci" and after

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ezyang · 2018-07-18T21:11:29Z

@pytorchbot retest this please

ezyang · 2018-07-18T21:34:51Z

@pytorchbot retest this please

ezyang · 2018-07-19T00:18:42Z

@pytorchbot retest this please

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

cpuhrsch · 2018-07-19T14:59:07Z

Not sure if std::vector is a good longterm choice since size and stride is so short. It also can't be compiled into very efficient code. Ideally we'd template on that size and branch somewhere, at least that led to the best code within CPUApplyUtils.h. Otherwise SmartVector could be a good choice for now.

ezyang · 2018-07-19T15:03:25Z

@cpuhrsch Yeah, some sort of inline size/stride is in the roadmap, but it's not blocking for the merge so I haven't done it. When we add a change like this some benchmarking will be needed.

gchanan

I assume you are just trying to figure out where you need the to put the no-strict-overflow? Also, why do you need it?

aten/src/ATen/templates/TensorDense.cpp

aten/src/TH/THTensor.hpp

aten/src/TH/generic/THTensor.cpp

cmake/public/utils.cmake

setup.py

torch/CMakeLists.txt

CMakeLists.txt

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

This patch was very carefully constructed to avoid having to modify too many files; there are some obvious follow ups which I will be hitting later. - I didn't do stride. But the change for stride should look very similar. - I did NOT rename the field in question, so that direct accesses of the form foo->size[n] keep working. I intend to do a codemod to fix all of these shortly. - Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. - _THSizeDesc got an overload that understands ArrayRef (which a vector size is convertible to). Eventually we should get rid of all of these functions (because ArrayRef is printable via the AT_ERROR macros), but not today. - I ran into something very subtle in the implementation of sizes() for TensorDerived: I MUST use the dim as per Tensor::dim() (which correctly is zero for scalars), otherwise I'll give a nonsense sizes(). We can fix this eventually once Scalar is turned on internally. - I added two new functions THTensor_resizeSize and THTensor_setSize. Maybe these are eventually worth deifying as methods in the Tensor class, but for now I'm keeping them out-of-line just in case. Signed-off-by: Edward Z. Yang <[email protected]>

Signed-off-by: Edward Z. Yang <[email protected]>

…h std::vector (#9561) Summary: * THTensor now stores `sizes_` and `strides_` which is a `std::vector<int64_t>` * Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. * There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides) * Anywhere you said `t->size[n] = 0`, we now say `THTensor_setSizeAt(t, n, 0)`, ditto for strides * Anywhere you said `t->size[n]`, we now say `t->size(n)` (coming soon: ditto for strides) Previous review of just the `std::vector` change in #9518, but I'm planning to merge this all in one go. Note for gchanan: review from commit "ci" and after Pull Request resolved: pytorch/pytorch#9561 Reviewed By: cpuhrsch Differential Revision: D8901926 Pulled By: ezyang fbshipit-source-id: 483cf275060ab0a13845cba1ece39dd127142510

Summary: This pops off `refcount_`, `storage_`, `storage_offset_`; there are now no more direct accesses to these fields and we can make them private (with appropriate friending). Stacked on pytorch#9561 Pull Request resolved: pytorch#9591 Reviewed By: gchanan Differential Revision: D8922246 Pulled By: gchanan fbshipit-source-id: 63dd3eb9787c460373c3cc507886d3b0a30e7542

Summary: Pull Request resolved: #9683 This pops off `refcount_`, `storage_`, `storage_offset_`; there are now no more direct accesses to these fields and we can make them private (with appropriate friending). Stacked on #9561 Pull Request resolved: #9591 Reviewed By: SsnL Differential Revision: D8922246 Pulled By: ezyang fbshipit-source-id: dfae023d790e29ce652e2eab9a1628bbe97b318d

…h std::vector (pytorch#9561) Summary: * THTensor now stores `sizes_` and `strides_` which is a `std::vector<int64_t>` * Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. * There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides) * Anywhere you said `t->size[n] = 0`, we now say `THTensor_setSizeAt(t, n, 0)`, ditto for strides * Anywhere you said `t->size[n]`, we now say `t->size(n)` (coming soon: ditto for strides) Previous review of just the `std::vector` change in pytorch#9518, but I'm planning to merge this all in one go. Note for gchanan: review from commit "ci" and after Pull Request resolved: pytorch#9561 Reviewed By: cpuhrsch Differential Revision: D8901926 Pulled By: ezyang fbshipit-source-id: 483cf275060ab0a13845cba1ece39dd127142510

Summary: Pull Request resolved: pytorch#9683 This pops off `refcount_`, `storage_`, `storage_offset_`; there are now no more direct accesses to these fields and we can make them private (with appropriate friending). Stacked on pytorch#9561 Pull Request resolved: pytorch#9591 Reviewed By: SsnL Differential Revision: D8922246 Pulled By: ezyang fbshipit-source-id: dfae023d790e29ce652e2eab9a1628bbe97b318d

…h std::vector (pytorch#9561) Summary: * THTensor now stores `sizes_` and `strides_` which is a `std::vector<int64_t>` * Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. * There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides) * Anywhere you said `t->size[n] = 0`, we now say `THTensor_setSizeAt(t, n, 0)`, ditto for strides * Anywhere you said `t->size[n]`, we now say `t->size(n)` (coming soon: ditto for strides) Previous review of just the `std::vector` change in pytorch#9518, but I'm planning to merge this all in one go. Note for gchanan: review from commit "ci" and after Pull Request resolved: pytorch#9561 Reviewed By: cpuhrsch Differential Revision: D8901926 Pulled By: ezyang fbshipit-source-id: 483cf275060ab0a13845cba1ece39dd127142510

Summary: Pull Request resolved: pytorch#9683 This pops off `refcount_`, `storage_`, `storage_offset_`; there are now no more direct accesses to these fields and we can make them private (with appropriate friending). Stacked on pytorch#9561 Pull Request resolved: pytorch#9591 Reviewed By: SsnL Differential Revision: D8922246 Pulled By: ezyang fbshipit-source-id: dfae023d790e29ce652e2eab9a1628bbe97b318d

ezyang requested review from apaszke, colesbury, gchanan, soumith and zdevito as code owners July 18, 2018 20:41

facebook-github-bot reviewed Jul 18, 2018

View reviewed changes

ezyang mentioned this pull request Jul 18, 2018

Change THTensor::size into a std::vector<int64_t> #9518

Closed

facebook-github-bot reviewed Jul 18, 2018

View reviewed changes

ezyang force-pushed the pr/size-codemod branch from e73250b to 12609bb Compare July 18, 2018 23:59

ezyang force-pushed the pr/size-codemod branch from 12609bb to 95afe2e Compare July 19, 2018 01:44

facebook-github-bot reviewed Jul 19, 2018

View reviewed changes

gchanan approved these changes Jul 19, 2018

View reviewed changes

facebook-github-bot reviewed Jul 19, 2018

View reviewed changes

ezyang force-pushed the pr/size-codemod branch from 385d332 to 15cac87 Compare July 19, 2018 17:22

ezyang mentioned this pull request Jul 19, 2018

Hide all other fields in THTensor #9591

Closed

ezyang added 9 commits July 19, 2018 11:37

Apparently dim doesn't match size ugh

37d0f88

Signed-off-by: Edward Z. Yang <[email protected]>

Convert stride to std::vector

710c785

Signed-off-by: Edward Z. Yang <[email protected]>

MAGMAAAA

afc2314

Signed-off-by: Edward Z. Yang <[email protected]>

Get rid of THSizeDesc overload.

c76d42c

Signed-off-by: Edward Z. Yang <[email protected]>

bugfix

58e4697

Signed-off-by: Edward Z. Yang <[email protected]>

ci

9c65eed

Signed-off-by: Edward Z. Yang <[email protected]>

Grab some more dim_ increment/decrement

a902842

Signed-off-by: Edward Z. Yang <[email protected]>

Replace direct setters on size with setSizeAtDim

58583f7

Signed-off-by: Edward Z. Yang <[email protected]>

ezyang added 11 commits July 19, 2018 11:37

Replace direct setters on stride with setStrideAtDim

d89c558

Signed-off-by: Edward Z. Yang <[email protected]>

Replace direct access of size with method, rename internal member.

85c062f

Signed-off-by: Edward Z. Yang <[email protected]>

Make size() on THTensor wrap dimensions.

0161d0d

Signed-off-by: Edward Z. Yang <[email protected]>

Replace direct access of stride with method, rename internal member.

88a4be3

Signed-off-by: Edward Z. Yang <[email protected]>

Shut up trusty warnings.

23537ea

Signed-off-by: Edward Z. Yang <[email protected]>

Turn off strict overflow harder.

64e161e

Signed-off-by: Edward Z. Yang <[email protected]>

try again

39ca215

Signed-off-by: Edward Z. Yang <[email protected]>

More

de043ef

Signed-off-by: Edward Z. Yang <[email protected]>

Delete asserts

91b38f7

Signed-off-by: Edward Z. Yang <[email protected]>

quash one signed-unsigned compare

43e27de

Signed-off-by: Edward Z. Yang <[email protected]>

Fix more merge conflict.

93a7e61

Signed-off-by: Edward Z. Yang <[email protected]>

ezyang force-pushed the pr/size-codemod branch from 63d650f to 93a7e61 Compare July 19, 2018 18:38

facebook-github-bot closed this in a08119a Jul 19, 2018

ezyang mentioned this pull request Jul 22, 2018

Hide all other fields in THTensor (#9591) #9683

Closed

ezyang added the merged label Jun 26, 2019

Eliminate direct access to size/strides of THTensor; replace them with std::vector #9561

Eliminate direct access to size/strides of THTensor; replace them with std::vector #9561

Uh oh!

Conversation

ezyang commented Jul 18, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

ezyang commented Jul 19, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

cpuhrsch commented Jul 19, 2018

Uh oh!

ezyang commented Jul 19, 2018

Uh oh!

gchanan left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants