-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Port prod kernel to structured kernels.
#62024
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. [ghstack-poisoned]
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 4dc4316 (more details on the Dr. CI page):
🕵️ 2 new failures recognized by patternsThe following CI failures do not appear to be due to upstream breakages:
|
| Job | Step | Action |
|---|---|---|
| Set Up CI Environment After attach_workspace | 🔁 rerun |
❄️ 1 failure tentatively classified as flaky
but reruns have not yet been triggered to confirm:
linux-bionic-py3.8-gcc9-coverage / test (default, 2, 2, linux.2xlarge) (1/1)
Step: "Test PyTorch" (full log | diagnosis details | 🔁 rerun) ❄️
2021-08-11T20:43:51.0353067Z RuntimeError: Proc...ated or timed out after 100.07439947128296 seconds
2021-08-11T20:43:51.0343824Z ======================================================================
2021-08-11T20:43:51.0344784Z ERROR [100.094s]: test_return_local_rrefs (__main__.TensorPipeRpcTestWithSpawn)
2021-08-11T20:43:51.0346563Z ----------------------------------------------------------------------
2021-08-11T20:43:51.0347187Z Traceback (most recent call last):
2021-08-11T20:43:51.0348445Z File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 418, in wrapper
2021-08-11T20:43:51.0349362Z self._join_processes(fn)
2021-08-11T20:43:51.0350224Z File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 645, in _join_processes
2021-08-11T20:43:51.0350946Z self._check_return_codes(elapsed_time)
2021-08-11T20:43:51.0351849Z File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 695, in _check_return_codes
2021-08-11T20:43:51.0352528Z raise RuntimeError(
2021-08-11T20:43:51.0353067Z RuntimeError: Process 0 terminated or timed out after 100.07439947128296 seconds
2021-08-11T20:43:51.0353450Z
2021-08-11T20:43:51.0353943Z ----------------------------------------------------------------------
2021-08-11T20:43:51.0354376Z Ran 369 tests in 757.239s
2021-08-11T20:43:51.0381459Z
2021-08-11T20:43:51.0382078Z FAILED (errors=1, skipped=9)
2021-08-11T20:43:51.0382402Z
2021-08-11T20:43:51.0383009Z Generating XML reports...
2021-08-11T20:43:51.0384906Z Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDdpComparisonTestWithSpawn-20210811203113.xml
2021-08-11T20:43:51.0387734Z Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDdpUnderDistAutogradTestWithSpawn-20210811203113.xml
2021-08-11T20:43:51.0520714Z Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDistAutogradTestWithSpawn-20210811203113.xml
This comment was automatically generated by Dr. CI (expand for details).
Follow this link to opt-out of these comments for your Pull Requests.Please report bugs/suggestions to the (internal) Dr. CI Users group.
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
|
rebasing this stack; @ysiraichi you'll need to ghstack checkout after this |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
Rebased. |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
| auto shape = meta::get_reduction_shape(self, {}, false); | ||
| Tensor result = at::empty(shape, self.options().dtype(dtype)); | ||
| impl_func_prod(self, {}, false, dtype, result); | ||
| return result; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks suspicious. How come we have to define prod here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The problem is that we can't call prod.dim_int since it expects only one dimension to reduce (prodreduces all of them). This differs from sum.dim_IntList, that expects many dimensions: int[1] (i.e. we can call sum.dim_IntList inside sum kernel)
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Differential Revision: [D29847122](https://our.internmc.facebook.com/intern/diff/D29847122) [ghstack-poisoned]
|
@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
Summary: Pull Request resolved: #62024 Tracking issue: #55070 In this PR, I also broke down the meta functions of other reduction kernels (e.g. `all`, `argmax`, `sum`) into the composition of common patterns. Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D29847122 Pulled By: ezyang fbshipit-source-id: a6680a6cf6e59bb46b8ffe7bf2a3a611d6e0fd14
Stack from ghstack:
normkernel to structured kernels. #62711 Portnormkernel to structured kernels.prodkernel to structured kernels. #62024 Portprodkernel to structured kernels.meankernel to structured kernels. #61643 Portmeankernel to structured kernels.Tracking issue: #55070
In this PR, I also broke down the meta functions of other reduction kernels (e.g.
all,argmax,sum) into the composition of common patterns.Differential Revision: D29847122