Skip to content

[release-0.19] wsl: report a single "all" device to kubelet#1699

Merged
cdesiniotis merged 2 commits intorelease-0.19from
backport-1671-to-release-0.19
Apr 17, 2026
Merged

[release-0.19] wsl: report a single "all" device to kubelet#1699
cdesiniotis merged 2 commits intorelease-0.19from
backport-1671-to-release-0.19

Conversation

@github-actions
Copy link
Copy Markdown

🤖 Automated backport of #1671 to release-0.19

✅ Cherry-pick completed successfully with no conflicts.

Original PR: #1671
Original Author: @elezar

Cherry-picked commits (2):

  • b93fb47 refactor: Simplify resource manager construction
  • 1bb3658 wsl: report a single "all" device to kubelet

This backport was automatically created by the backport bot.

elezar and others added 2 commits April 17, 2026 15:03
In order to prepare for the WSL changes, we remove the tegra resource
manager and pull the basic function implementations into the base type.
This means that the base type is essentially a resource manager that
does not support health checking and always uses distributed allocation.

Signed-off-by: Evan Lezar <[email protected]>
(cherry picked from commit b93fb47)
On WSL, all GPUs are accessed through /dev/dxg. Replace the per-GPU
wslDevice (which reported one device per physical GPU with individual
UUIDs) with a stateless wslAllGPUsDevice that always returns UUID "all"
and path "/dev/dxg". This causes the device map to collapse to a single
entry per resource, so kubelet sees exactly one GPU device on WSL.

When allocated, this flows naturally through all strategy paths
(envvar, CDI, volume mounts) to set NVIDIA_VISIBLE_DEVICES=all, which
is what nvidia-container-runtime on WSL expects.

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>
Signed-off-by: Evan Lezar <[email protected]>
(cherry picked from commit 1bb3658)
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 17, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@elezar elezar added this to the v0.19.1 milestone Apr 17, 2026
@elezar elezar requested review from cdesiniotis and rahulait April 17, 2026 15:12
@cdesiniotis
Copy link
Copy Markdown
Contributor

/ok to test 80a51ff

@cdesiniotis cdesiniotis merged commit f27c813 into release-0.19 Apr 17, 2026
11 checks passed
@cdesiniotis cdesiniotis deleted the backport-1671-to-release-0.19 branch April 17, 2026 21:59
@elezar elezar mentioned this pull request Apr 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants