[fixing reduction kernel launch] #22827

jjsjann123 · 2019-07-12T23:47:29Z

Fix out of range memory access for reduction on all dimensions for non-packed
tensor.
Enabling launch config that maps block width to reduction on fastest striding
dimension. This mapping was previously only active when reducing on fastest
striding dimension of packed tensor, which is not necessary.

1. Fix out of range memory access for reduction on all dimensions for non-packed tensor. 2. Enabling launch config that maps block width to reduction on fastest striding dimension. This mapping was previously only active when reducing on fastest striding dimension of packed tensor, which is not necessary.

jjsjann123 · 2019-07-12T23:48:14Z

cc @zdevito @ngimel

jerryzh168 · 2019-07-13T01:32:01Z

@jjsjann123 feel free to request review

jjsjann123 · 2019-07-13T02:17:09Z

Looks like bunch of tests failed here.
Let me get a clean rebuild and run the test locally to see if I can get some better hint.

facebook-github-bot

@zdevito has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zdevito

Looks good. Ill land when the tests pass.

jjsjann123 · 2019-07-15T23:21:03Z

I don't have a local repro on circleci failure RuntimeError: NCCL error in: /var/lib/jenkins/workspace/torch/lib/c10d/ProcessGroupNCCL.cpp:272, unhandled system error

build failure looks strange as well. I don't think I have messed up history :/

zhangguanheng66 · 2019-07-18T21:33:26Z

@zdevito Could you land this PR? Thanks.

facebook-github-bot · 2019-07-19T22:12:08Z

@zdevito merged this pull request in a28ffaf.

Summary: 1. Fix out of range memory access for reduction on all dimensions for non-packed tensor. 2. Enabling launch config that maps block width to reduction on fastest striding dimension. This mapping was previously only active when reducing on fastest striding dimension of packed tensor, which is not necessary. Pull Request resolved: pytorch/pytorch#22827 Differential Revision: D16271897 Pulled By: zdevito fbshipit-source-id: 20763f6cf9a58e44ffc0e7ec27724dfec8fe2c5d

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: operators labels Jul 12, 2019

ezyang added the open source label Jul 13, 2019

jerryzh168 requested review from ngimel and zdevito July 13, 2019 01:32

jerryzh168 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 13, 2019

Merge remote-tracking branch 'origin/master' into reduction_PR

4054d8e

facebook-github-bot reviewed Jul 15, 2019

View reviewed changes

zdevito approved these changes Jul 15, 2019

View reviewed changes

facebook-github-bot closed this in a28ffaf Jul 19, 2019

facebook-github-bot added the merged label Jul 19, 2019

ngimel mentioned this pull request Apr 23, 2020

CPU out of bound memory access in CUDA reduction kernel config #37153

Open

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[fixing reduction kernel launch] #22827

[fixing reduction kernel launch] #22827

Uh oh!

jjsjann123 commented Jul 12, 2019

Uh oh!

jjsjann123 commented Jul 12, 2019

Uh oh!

jerryzh168 commented Jul 13, 2019

Uh oh!

jjsjann123 commented Jul 13, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

zdevito left a comment

Uh oh!

jjsjann123 commented Jul 15, 2019

Uh oh!

zhangguanheng66 commented Jul 18, 2019

Uh oh!

facebook-github-bot commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

[fixing reduction kernel launch] #22827

[fixing reduction kernel launch] #22827

Uh oh!

Conversation

jjsjann123 commented Jul 12, 2019

Uh oh!

jjsjann123 commented Jul 12, 2019

Uh oh!

jerryzh168 commented Jul 13, 2019

Uh oh!

jjsjann123 commented Jul 13, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

jjsjann123 commented Jul 15, 2019

Uh oh!

zhangguanheng66 commented Jul 18, 2019

Uh oh!

facebook-github-bot commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants