Skip to content

Conversation

@pytorch-bot
Copy link

pytorch-bot bot commented Jul 8, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/130227

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit ed9a3e9 with merge base 6875179 (image):

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@github-actions github-actions bot requested a review from ezyang July 8, 2024 01:20
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang

[ghstack-poisoned]
Chillee added a commit that referenced this pull request Jul 8, 2024
@ezyang ezyang removed their request for review July 8, 2024 02:43
@Chillee Chillee requested review from drisspg and yanboliang July 8, 2024 05:40
@Chillee Chillee added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 8, 2024
}

static C10_UNUSED void checkIndexTensorTypes(IOptTensorListRef indices, bool allow_int=false) {
static C10_UNUSED void checkIndexTensorTypes(IOptTensorListRef indices) {
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is only used in 3 places, and all of them have allow_int set to True or should have it set to True.

@Chillee
Copy link
Collaborator Author

Chillee commented Jul 8, 2024

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team Raised by workflow job

@yanboliang
Copy link
Contributor

@pytorchbot merge

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D59498662

pytorchmergebot added a commit that referenced this pull request Jul 9, 2024
…lexAttention numerics to be as accurate as FA2) (#130250)"

This reverts commit 3e48d92.

Reverted #130250 on behalf of https://github.com/izaitsevfb due to depends on #130227 which needs to be reverted ([comment](#130250 (comment)))
@izaitsevfb
Copy link
Contributor

@pytorchbot revert -m "breaks internal builds, please see D59498662" -c ghfirst

 error: too many arguments to function call, expected single argument 'indices', have 2 arguments
  at::native::checkIndexTensorTypes(indices, /*allow_int*/ true);
  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                        ^~~~
buck-out/v2/gen/fbcode/795a8ff4e6618da8/caffe2/__aten-headers-cpu__/buck-headers/ATen/native/IndexingUtils.h:51:24: note: 'checkIndexTensorTypes' declared here
static C10_UNUSED void checkIndexTensorTypes(IOptTensorListRef indices) {

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

@pytorchmergebot
Copy link
Collaborator

@Chillee your PR has been successfully reverted.

pytorchmergebot added a commit that referenced this pull request Jul 9, 2024
Chillee added 2 commits July 9, 2024 19:27
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang

[ghstack-poisoned]
datagero pushed a commit to datagero/pytorch that referenced this pull request Jul 10, 2024
…lexAttention numerics to be as accurate as FA2) (pytorch#130250)"

This reverts commit 3e48d92.

Reverted pytorch#130250 on behalf of https://github.com/izaitsevfb due to depends on pytorch#130227 which needs to be reverted ([comment](pytorch#130250 (comment)))
datagero pushed a commit to datagero/pytorch that referenced this pull request Jul 10, 2024
datagero pushed a commit to datagero/pytorch that referenced this pull request Jul 10, 2024
pytorchmergebot pushed a commit that referenced this pull request Jul 10, 2024
…tion numerics to be as accurate as FA2) (#130250)

After this PR, our numerical error is within 3% of FA2 for forward and gradients. Prior, for `dq` our numerical error was 30% higher. I also added a `PRESCALE_QK` kernel option that increases perf by about 3-4% but incurs about 20-30% more numerical error.

![image](https://github.com/pytorch/pytorch/assets/6355099/7b5ff44e-219b-4a05-8a1b-2a0182c01ab2)

Pull Request resolved: #130250
Approved by: https://github.com/drisspg
ghstack dependencies: #130227
datagero pushed a commit to datagero/pytorch that referenced this pull request Jul 10, 2024
…lexAttention numerics to be as accurate as FA2) (pytorch#130250)"

This reverts commit 3e48d92.

Reverted pytorch#130250 on behalf of https://github.com/izaitsevfb due to depends on pytorch#130227 which needs to be reverted ([comment](pytorch#130250 (comment)))
datagero pushed a commit to datagero/pytorch that referenced this pull request Jul 10, 2024
xuhancn pushed a commit to xuhancn/pytorch that referenced this pull request Jul 25, 2024
xuhancn pushed a commit to xuhancn/pytorch that referenced this pull request Jul 25, 2024
…lexAttention numerics to be as accurate as FA2) (pytorch#130250)"

This reverts commit 3e48d92.

Reverted pytorch#130250 on behalf of https://github.com/izaitsevfb due to depends on pytorch#130227 which needs to be reverted ([comment](pytorch#130250 (comment)))
xuhancn pushed a commit to xuhancn/pytorch that referenced this pull request Jul 25, 2024
xuhancn pushed a commit to xuhancn/pytorch that referenced this pull request Jul 25, 2024
…tion numerics to be as accurate as FA2) (pytorch#130250)

After this PR, our numerical error is within 3% of FA2 for forward and gradients. Prior, for `dq` our numerical error was 30% higher. I also added a `PRESCALE_QK` kernel option that increases perf by about 3-4% but incurs about 20-30% more numerical error.

![image](https://github.com/pytorch/pytorch/assets/6355099/7b5ff44e-219b-4a05-8a1b-2a0182c01ab2)

Pull Request resolved: pytorch#130250
Approved by: https://github.com/drisspg
ghstack dependencies: pytorch#130227
@github-actions github-actions bot deleted the gh/chillee/318/head branch August 10, 2024 01:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants