[cuDNN][SDPA][Convolution] Expose cuDNN runtime version in CUDA hooks #167111

eqy · 2025-11-05T19:06:57Z

cuDNN dispatching heuristics rely on versions checks but currently only that compile-time version is exposed, if we want to allow users to resolve #166643 on their end by updating their cuDNN version locally we need to check the runtime version rather than compile-time version.

cc @csarofeen @ptrblck @xwang233 @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @aditew01

pytorch-bot · 2025-11-05T19:07:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167111

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit caa7a77 with merge base 5c63946 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

trunk / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 5, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu) (gh) (similar failure)
test_decomp

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Skylion007 · 2025-11-05T19:14:01Z

aten/src/ATen/cuda/detail/CUDAHooks.h

  long versionCUDART() const override;
  long versionCuDNN() const override;
+  long versionRuntimeCuDNN() const override;
+  long versionCuDNNFrontend() const override;


Why does Runtime CUDNN frontend matter? It cannot be changed right? It's a compile time include header?

I sidecar'd this change in as we'll need it in the near future for SDPA issues that require a cuDNN frontend version to be available for gating. In theory sdp_utils.cpp could be able to access this but I'm not sure I want to include that directly.

Can the runtime version be different for cudNNFronteEnd or should it be constexpr?

Skylion007 · 2025-11-05T19:40:55Z

aten/src/ATen/Context.h

  static bool hasCuDNN() {
    return detail::getCUDAHooks().hasCuDNN();
  }
  static long versionCuDNN() {


If this is really compile time? Why no constexpr? Would enable if constexpr logic that would simplify critical code paths in CUDNN dispatch.

yes see

pytorch/aten/src/ATen/cuda/detail/CUDAHooks.cpp

Line 348 in 6c5db82

return CUDNN_VERSION;

other uses of CUDNN_VERSION in the file are macros, etc.

Yeah, if they are macros they should be propogated with constexpr then. :)

Yeah, CUDNN_FRONTNED has it's equivalent function as constexpr

eqy · 2025-11-05T23:55:21Z

@Skylion007 are we building with C++20 only? not sure if virtual functions (as these are CUDAHooks) can be constexpr

eqy · 2025-11-06T18:37:52Z

@pytorchmergebot merge

pytorchmergebot · 2025-11-06T18:40:18Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Skylion007 · 2025-11-06T18:56:40Z

@Skylion007 are we building with C++20 only? not sure if virtual functions (as these are CUDAHooks) can be constexpr

Ah, wasn't aware of that limitation. Not yet, no. :(

pytorchmergebot · 2025-11-06T22:51:45Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-jammy-cuda12.8-py3.10-gcc11 / test (default, 5, 5, lf.linux.g6.4xlarge.experimental.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

eqy · 2025-11-07T00:27:09Z

@pytorchmergebot merge

pytorchmergebot · 2025-11-07T00:29:13Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

eqy · 2025-11-07T16:39:05Z

@pytorchbot cherry-pick --onto release/2.9 --fixes "cuDNN conv3d performance workaround" -c regression

…#167111) cuDNN dispatching heuristics rely on versions checks but currently only that compile-time version is exposed, if we want to allow users to resolve #166643 on their end by updating their cuDNN version locally we need to check the runtime version rather than compile-time version. Pull Request resolved: #167111 Approved by: https://github.com/Skylion007 (cherry picked from commit e678450)

pytorchbot · 2025-11-07T16:44:23Z

Cherry picking #167111

The cherry pick PR is at #167327 and it is linked with issue cuDNN conv3d performance workaround. The following tracker issues are updated:

[v2.9.1] Release Tracker #166758 (comment)

Details for Dev Infra team

Raised by workflow job

…#167327) [cuDNN][SDPA][Convolution] Expose cuDNN runtime version in CUDA hooks (#167111) cuDNN dispatching heuristics rely on versions checks but currently only that compile-time version is exposed, if we want to allow users to resolve #166643 on their end by updating their cuDNN version locally we need to check the runtime version rather than compile-time version. Pull Request resolved: #167111 Approved by: https://github.com/Skylion007 (cherry picked from commit e678450) Co-authored-by: Eddie Yan <[email protected]>

…pytorch#167111) cuDNN dispatching heuristics rely on versions checks but currently only that compile-time version is exposed, if we want to allow users to resolve pytorch#166643 on their end by updating their cuDNN version locally we need to check the runtime version rather than compile-time version. Pull Request resolved: pytorch#167111 Approved by: https://github.com/Skylion007

eqy added 2 commits November 5, 2025 19:00

check in

fa39341

lint

caa7a77

eqy requested a review from syed-ahmed as a code owner November 5, 2025 19:06

eqy added the module: cudnn Related to torch.backends.cudnn, and CuDNN support label Nov 5, 2025

eqy requested a review from Aidyn-A as a code owner November 5, 2025 19:06

eqy added module: convolution Problems related to convolutions (THNN, THCUNN, CuDNN) open source release notes: cudnn module: sdpa All things related to torch.nn.functional.scaled_dot_product_attentiion labels Nov 5, 2025

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Nov 5, 2025

Skylion007 reviewed Nov 5, 2025

View reviewed changes

Skylion007 approved these changes Nov 5, 2025

View reviewed changes

Skylion007 reviewed Nov 5, 2025

View reviewed changes

eqy mentioned this pull request Nov 5, 2025

4x performance regression for 3D convs with AMP on torch 2.9.0 #166122

Open

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 6, 2025

pytorchmergebot added the merging label Nov 6, 2025

pytorchmergebot removed the merging label Nov 6, 2025

pytorchmergebot added the merging label Nov 7, 2025

pytorchmergebot added the Merged label Nov 7, 2025

pytorchmergebot closed this in e678450 Nov 7, 2025

pytorchmergebot removed the merging label Nov 7, 2025

pytorchbot mentioned this pull request Nov 7, 2025

[v2.9.1] Release Tracker #166758

Closed

jovan2009 referenced this pull request in comfyanonymous/ComfyUI Nov 14, 2025

Pytorch is stupid. (#10398)

b4f30bd

jovan2009 mentioned this pull request Nov 14, 2025

CUDNN version in nightly pytorch 2.10.0 builds #167242

Open

jovan2009 mentioned this pull request Nov 21, 2025

working around nvidia conv3d memory bug comfyanonymous/ComfyUI#10827

Closed

1 task

saberrroool mentioned this pull request Nov 27, 2025

Regarding this issue, how can I upgrade or replace the cuDNN version built into my current PyTorch installation? #169175

Closed

[cuDNN][SDPA][Convolution] Expose cuDNN runtime version in CUDA hooks #167111

[cuDNN][SDPA][Convolution] Expose cuDNN runtime version in CUDA hooks #167111

Uh oh!

Conversation

eqy commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167111

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Skylion007 Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eqy Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eqy Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

eqy commented Nov 5, 2025

Uh oh!

eqy commented Nov 6, 2025

Uh oh!

pytorchmergebot commented Nov 6, 2025

Merge started

Uh oh!

Skylion007 commented Nov 6, 2025

Uh oh!

pytorchmergebot commented Nov 6, 2025

Merge failed

Uh oh!

eqy commented Nov 7, 2025

Uh oh!

pytorchmergebot commented Nov 7, 2025

Merge started

Uh oh!

eqy commented Nov 7, 2025

Uh oh!

pytorchbot commented Nov 7, 2025

Cherry picking #167111

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eqy commented Nov 5, 2025 •

edited

Loading

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading

Skylion007 Nov 5, 2025 •

edited

Loading

Skylion007 Nov 5, 2025 •

edited

Loading