[cuDNN][conv] Re-enable cuDNN for 3D convolutions (fixed in 9.15+) #166480

eqy · 2025-10-29T00:59:45Z

cc @csarofeen @ptrblck @xwang233 @msaroufim @jerryzh168 @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2025-10-29T00:59:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166480

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit ab4e945 with merge base afaaaa3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Skylion007 · 2025-10-29T01:36:37Z

Shouldn't there be a CUDNN_FRONTEND version guard though instead of deleting the code?

malfet · 2025-10-29T18:20:04Z

@eqy but can you add a test, to make sure we'll not regress again?

I think feedback about cudnn frontend verison guard is valid as well as unit test

eqy · 2025-10-29T20:03:17Z

Test for this case was already checked in here: e2817ac#diff-31c5c90e1292af7427be151ba6c4aca280793122d3a2010698aeb18a4f69a508
Will just update the runtime version guard for now

malfet · 2025-10-29T21:05:10Z

@eqy IMO instead of deleting the check completely, you need to change the cudnn_version here (as with dynamic linking dev can choose to install pytorch with older cudnn (or newer))

aten/src/ATen/native/Convolution.cpp

eqy · 2025-10-30T20:39:29Z

@pytorchmergebot merge

pytorchmergebot · 2025-10-30T20:41:41Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…166480) Pull Request resolved: #166480 Approved by: https://github.com/Skylion007, https://github.com/malfet

Lucaskabela · 2025-11-03T23:39:29Z

@pytorchbot cherry-pick --onto release/2.9 --fixes "4x performance regressions for 3d convs with AMP" -c regression

…166480) Pull Request resolved: #166480 Approved by: https://github.com/Skylion007, https://github.com/malfet (cherry picked from commit df71b70)

pytorchbot · 2025-11-03T23:45:33Z

Cherry picking #166480

The cherry pick PR is at #166908 and it is linked with issue 4x performance regressions for 3d convs with AMP. The following tracker issues are updated:

[v2.9.1] Release Tracker #166758 (comment)

Details for Dev Infra team

Raised by workflow job

eqy · 2025-11-04T00:01:14Z

@Lucaskabela we cannot cherrypick this without a cuDNN version bump...
Otherwise we will dispatch to broken kernels. This issue should also be reflected in existing CI tests.

Lucaskabela · 2025-11-04T00:06:42Z

Okay I will close this for now - once we have that version bump please submit the cherry pick :)

eqy · 2025-11-04T00:10:41Z

@Lucaskabela we will discuss this in the core team sync meeting tomorrow... don't think we can bump a cuDNN backend version in a patch release.

…ytorch#166480) Pull Request resolved: pytorch#166480 Approved by: https://github.com/Skylion007, https://github.com/malfet

eqy · 2025-11-04T19:12:38Z

@Lucaskabela OK, after discussion in the meeting I think you can proceed with the cherrypick, as the reenablement is guarded based on cuDNN runtime version.
We don't upgrade the cuDNN runtime version in the release matrix but will recommend users who face the performance regression to upgrade their local cuDNN package as a workaround.

…166908) [cuDNN][conv] Re-enable cuDNN for 3D convolutions (fixed in 9.15+) (#166480) Pull Request resolved: #166480 Approved by: https://github.com/Skylion007, https://github.com/malfet (cherry picked from commit df71b70) Co-authored-by: Eddie Yan <[email protected]>

check in

e70aa6d

eqy added module: cudnn Related to torch.backends.cudnn, and CuDNN support module: cuda Related to torch.cuda, and CUDA support in general module: convolution Problems related to convolutions (THNN, THCUNN, CuDNN) open source topic: not user facing topic category labels Oct 29, 2025

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Oct 29, 2025

Skylion007 previously approved these changes Oct 29, 2025

View reviewed changes

use runtime check

ab4e945

malfet approved these changes Oct 29, 2025

View reviewed changes

eqy added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 30, 2025

eqy mentioned this pull request Oct 30, 2025

4x performance regression for 3D convs with AMP on torch 2.9.0 #166122

Open

Skylion007 reviewed Oct 30, 2025

View reviewed changes

aten/src/ATen/native/Convolution.cpp Show resolved Hide resolved

Skylion007 approved these changes Oct 30, 2025

View reviewed changes

pytorchmergebot added the merging label Oct 30, 2025

pytorchmergebot closed this in df71b70 Oct 30, 2025

pytorchmergebot added Merged and removed merging labels Oct 30, 2025

eqy mentioned this pull request Oct 31, 2025

Significant Memory Regression in F.conv3d with bfloat16 Inputs in PyTorch 2.9.0 #166643

Open

ngimel added this to the 2.9.1 milestone Nov 1, 2025

Lucaskabela linked an issue Nov 3, 2025 that may be closed by this pull request

4x performance regression for 3D convs with AMP on torch 2.9.0 #166122

Open

pytorchbot mentioned this pull request Nov 3, 2025

[v2.9.1] Release Tracker #166758

Closed

eqy mentioned this pull request Nov 4, 2025

Out of Memory using Conv3D passing from PyTorch 2.5.1+ cu121 to PyTorch 2.9 + cu128 #166790

Open

atalman mentioned this pull request Nov 10, 2025

Release 2.9.1 validations checklist and cherry-picks #167476

Open

38 tasks

jovan2009 referenced this pull request in comfyanonymous/ComfyUI Nov 14, 2025

Pytorch is stupid. (#10398)

b4f30bd

jovan2009 mentioned this pull request Nov 14, 2025

CUDNN version in nightly pytorch 2.10.0 builds #167242

Open

jovan2009 mentioned this pull request Nov 21, 2025

working around nvidia conv3d memory bug comfyanonymous/ComfyUI#10827

Closed

1 task

saberrroool mentioned this pull request Nov 27, 2025

Regarding this issue, how can I upgrade or replace the cuDNN version built into my current PyTorch installation? #169175

Closed

[cuDNN][conv] Re-enable cuDNN for 3D convolutions (fixed in 9.15+) #166480

[cuDNN][conv] Re-enable cuDNN for 3D convolutions (fixed in 9.15+) #166480

Conversation

eqy commented Oct 29, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166480

✅ No Failures

Uh oh!

Skylion007 commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

malfet commented Oct 29, 2025

Uh oh!

eqy commented Oct 29, 2025

Uh oh!

malfet commented Oct 29, 2025

Uh oh!

Uh oh!

eqy commented Oct 30, 2025

Uh oh!

pytorchmergebot commented Oct 30, 2025

Merge started

Uh oh!

Lucaskabela commented Nov 3, 2025

Uh oh!

pytorchbot commented Nov 3, 2025

Cherry picking #166480

Uh oh!

eqy commented Nov 4, 2025

Uh oh!

Lucaskabela commented Nov 4, 2025

Uh oh!

eqy commented Nov 4, 2025

Uh oh!

eqy commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

eqy commented Oct 29, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 29, 2025 •

edited

Loading

Skylion007 commented Oct 29, 2025 •

edited

Loading