-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Parallelize eye() on CPU. #21077
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallelize eye() on CPU. #21077
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are you sure a grain_size of 1 is the best here? I'd expect that for small tensors the overhead of parallelizing it would dominate.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I changed this to internal::GRAIN_SIZE. Hope it is sufficient.
|
Any chance to get this merged :) |
|
@ifedan please merge them |
facebook-github-bot
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
Summary: Pull Request resolved: pytorch/pytorch#21077 Differential Revision: D15695329 Pulled By: ezyang fbshipit-source-id: 9841777238dac7c08cde2db3cd9401853f633af3
No description provided.