Skip to content

Conversation

@kaniblu
Copy link

@kaniblu kaniblu commented Jun 7, 2017

No description provided.

@apaszke
Copy link
Contributor

apaszke commented Jun 7, 2017

These functions don't support inplace mode, and there's no good way to add it (input is overwritten but is needed to compute backward).

@apaszke apaszke closed this Jun 7, 2017
houseroad added a commit to houseroad/pytorch that referenced this pull request Jan 19, 2019
…8bdbe7

Summary:
Previous import was fd60104394fa353e1762f44ecad1b2166e33deef

Included changes:
- **[c553fb3](onnx/onnx@c553fb3)**: Handle negative axis in scan shape inference (pytorch#1748) <G. Ramalingam>
- **[51b6ecc](onnx/onnx@51b6ecc)**: external_data: Store large tensor values in separate files (pytorch#678) <Michał Karzyński>
- **[ba05f26](onnx/onnx@ba05f26)**: Scan output axes (pytorch#1737) <G. Ramalingam>
- **[90920c0](onnx/onnx@90920c0)**: Add NonZero op. (pytorch#1714) <Sergii Dymchenko>
- **[c4cf112](onnx/onnx@c4cf112)**: fix the test cases for constantofshape (pytorch#1746) <Lu Fang>
- **[d902349](onnx/onnx@d902349)**: Add sample implementation support (pytorch#1712) <Lu Fang>

Differential Revision: D13745693

fbshipit-source-id: 057d827652e85ad19be8f0243d874e036bf69898
facebook-github-bot pushed a commit that referenced this pull request Jan 21, 2019
…8bdbe7 (#16190)

Summary:
Pull Request resolved: #16190

Previous import was fd60104394fa353e1762f44ecad1b2166e33deef

Included changes:
- **[c553fb3](onnx/onnx@c553fb3)**: Handle negative axis in scan shape inference (#1748) <G. Ramalingam>
- **[51b6ecc](onnx/onnx@51b6ecc)**: external_data: Store large tensor values in separate files (#678) <Michał Karzyński>
- **[ba05f26](onnx/onnx@ba05f26)**: Scan output axes (#1737) <G. Ramalingam>
- **[90920c0](onnx/onnx@90920c0)**: Add NonZero op. (#1714) <Sergii Dymchenko>
- **[c4cf112](onnx/onnx@c4cf112)**: fix the test cases for constantofshape (#1746) <Lu Fang>
- **[d902349](onnx/onnx@d902349)**: Add sample implementation support (#1712) <Lu Fang>

Differential Revision: D13745693

fbshipit-source-id: 05e2cce9ae1dfa2865db83840df64673d55cea57
jjsjann123 pushed a commit to jjsjann123/pytorch that referenced this pull request Jun 7, 2022
jjsjann123 added a commit to jjsjann123/pytorch that referenced this pull request Jun 8, 2022
…sarofeen/pytorch/

Squashed commits to WAR github API

Commits that's actually in this PR from the devel branch:

4c60e7d Add examples infrastructure for using nvFuser in a standalone program (pytorch#1725)
02a05d9 Fix issue pytorch#1751 (pytorch#1753)
8a69aa3 Refactor NvFuser transpose API to match eager mode behavior (pytorch#1746)
ffdf6b7 Remove BroadcastWithoutStride. (pytorch#1738)
02bab16 Fix flipping of a boolean flag (pytorch#1745)
465d668 cleanup (pytorch#1744)
26d354e fixing noncontig broadcast (pytorch#1742)
856b6b2 Add IterDomainBuilder (pytorch#1736)
1fd974f fixing warning for gcc7 (pytorch#1732)
de2740a disabling complex in python tests for pytorch#1730 (pytorch#1733)
fbbbe0a fixing MSVC build (pytorch#1728)
b5feee5 Fix the fused reduction runtime kernel (pytorch#1729)
5247682 Re-entrant GroupedGridReduction (pytorch#1727)
pytorchmergebot pushed a commit that referenced this pull request Jun 10, 2022
Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/

Bug fixes and minor refactor

Squashed commits to WAR github API
Commits that's actually in this PR from the devel branch:

```
4c60e7d Add examples infrastructure for using nvFuser in a standalone program (#1725)
02a05d9 Fix issue #1751 (#1753)
8a69aa3 Refactor NvFuser transpose API to match eager mode behavior (#1746)
ffdf6b7 Remove BroadcastWithoutStride. (#1738)
02bab16 Fix flipping of a boolean flag (#1745)
465d668 cleanup (#1744)
26d354e fixing noncontig broadcast (#1742)
856b6b2 Add IterDomainBuilder (#1736)
1fd974f fixing warning for gcc7 (#1732)
de2740a disabling complex in python tests for #1730 (#1733)
fbbbe0a fixing MSVC build (#1728)
b5feee5 Fix the fused reduction runtime kernel (#1729)
5247682 Re-entrant GroupedGridReduction (#1727)
```

RUN_TORCHBENCH: nvfuser
Pull Request resolved: #79147
Approved by: https://github.com/davidberard98
facebook-github-bot pushed a commit that referenced this pull request Jun 10, 2022
Summary:
Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/

Bug fixes and minor refactor

Squashed commits to WAR github API
Commits that's actually in this PR from the devel branch:

```
4c60e7d Add examples infrastructure for using nvFuser in a standalone program (#1725)
02a05d9 Fix issue #1751 (#1753)
8a69aa3 Refactor NvFuser transpose API to match eager mode behavior (#1746)
ffdf6b7 Remove BroadcastWithoutStride. (#1738)
02bab16 Fix flipping of a boolean flag (#1745)
465d668 cleanup (#1744)
26d354e fixing noncontig broadcast (#1742)
856b6b2 Add IterDomainBuilder (#1736)
1fd974f fixing warning for gcc7 (#1732)
de2740a disabling complex in python tests for #1730 (#1733)
fbbbe0a fixing MSVC build (#1728)
b5feee5 Fix the fused reduction runtime kernel (#1729)
5247682 Re-entrant GroupedGridReduction (#1727)
```

Pull Request resolved: #79147

Reviewed By: zou3519

Differential Revision: D37038192

Pulled By: davidberard98

fbshipit-source-id: 330f46ee21eef5c2de7e178290f2962aa030393e
pytorchmergebot pushed a commit that referenced this pull request Jun 16, 2022
Landing reverted PR #79147.

Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/

Bug fixes and minor refactor

Squashed commits to WAR github API
Commits that's actually in this PR from the devel branch:

```
4c60e7d Add examples infrastructure for using nvFuser in a standalone program (#1725)
02a05d9 Fix issue #1751 (#1753)
8a69aa3 Refactor NvFuser transpose API to match eager mode behavior (#1746)
ffdf6b7 Remove BroadcastWithoutStride. (#1738)
02bab16 Fix flipping of a boolean flag (#1745)
465d668 cleanup (#1744)
26d354e fixing noncontig broadcast (#1742)
856b6b2 Add IterDomainBuilder (#1736)
1fd974f fixing warning for gcc7 (#1732)
de2740a disabling complex in python tests for #1730 (#1733)
fbbbe0a fixing MSVC build (#1728)
b5feee5 Fix the fused reduction runtime kernel (#1729)
5247682 Re-entrant GroupedGridReduction (#1727)
```

RUN_TORCHBENCH: nvfuser
Pull Request resolved: #79406
Approved by: https://github.com/davidberard98
facebook-github-bot pushed a commit that referenced this pull request Jun 16, 2022
Summary:
Landing reverted PR #79147.

Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/

Bug fixes and minor refactor

Squashed commits to WAR github API
Commits that's actually in this PR from the devel branch:

```
4c60e7d Add examples infrastructure for using nvFuser in a standalone program (#1725)
02a05d9 Fix issue #1751 (#1753)
8a69aa3 Refactor NvFuser transpose API to match eager mode behavior (#1746)
ffdf6b7 Remove BroadcastWithoutStride. (#1738)
02bab16 Fix flipping of a boolean flag (#1745)
465d668 cleanup (#1744)
26d354e fixing noncontig broadcast (#1742)
856b6b2 Add IterDomainBuilder (#1736)
1fd974f fixing warning for gcc7 (#1732)
de2740a disabling complex in python tests for #1730 (#1733)
fbbbe0a fixing MSVC build (#1728)
b5feee5 Fix the fused reduction runtime kernel (#1729)
5247682 Re-entrant GroupedGridReduction (#1727)
```

RUN_TORCHBENCH: nvfuser

Pull Request resolved: #79406

Reviewed By: anjali411

Differential Revision: D37109147

Pulled By: davidberard98

fbshipit-source-id: 14209be028a3338be112cc83ffe77e631f802891
jagadish-amd pushed a commit to jagadish-amd/pytorch that referenced this pull request Jan 14, 2025
…) (pytorch#1746)

MI300 adds HW support for packed bfloat16 and fp16. Enable via existing
fastSpecializedAtomicAdd.

Helps with improving [torch.scatter_add_
performance](https://ontrack-internal.amd.com/browse/SWDEV-497013),
among others.

Pull Request resolved: pytorch#135770

Co-authored-by: Jeff Daily <[email protected]>
akashveramd pushed a commit to akashveramd/pytorch that referenced this pull request Apr 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants