-
Notifications
You must be signed in to change notification settings - Fork 26.3k
initialize with ident value in global reduction #15653
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
if possible, add a test covering the case. |
test/test_cuda.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please also add a comment explaining why do we need to test this. Is it that we use different kernels for large vs small inputs? Those things are super helpful when refactoring the tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is now a single kernel for all reductions, with the different options. I'm not sure all the possible options are thoroughly tested, it would be a good idea to test them, but I don't know exactly in which cases which options are triggered.
facebook-github-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
|
fixed via #15647 which is landing |
Summary: Fixes #15647. cc colesbury. Pull Request resolved: pytorch/pytorch#15653 Differential Revision: D13571132 Pulled By: soumith fbshipit-source-id: 8f25943c974b3b931f4528e0e0a370bc095dab51
Fixes #15647. cc @colesbury.