Nvidia-smi "Failed to initialize NVML: Unknown Error"

andrea.alvsaker · November 24, 2025, 8:22am

After a random amount of time the GPUs become unavailable inside all the running containers and nvidia-smi returns below error: “Failed to initialize NVML: Unknown Error”
After doing some research online I tried the following:

Uncommenting “no-cgroups = false” in the /etc/nvidia-container-runtime/config.toml file
Restarting Docker: sudo systemctl restart docker

But this didnt seem to solve the issue.
Some forums stated that the output of running cat /proc/cmdline should contain “systemd.unified_cgroup_hierarchy=false”, but when I run cat /proc/cmdline i get:
BOOT_IMAGE=/vmlinuz-5.15.0-157-generic root=/dev/mapper/ubuntu–vg-ubuntu–lv ro

How can I fix this once and for all so that the GPU’s are always available to the container?

Topic		Replies	Views
[solved - somehow] CUDA in Docker gives "Failed to initialize NVML: Unknown Error" CUDA Setup and Installation	2	4804	May 2, 2024
Failed to initialize NVML: Unknown Error when running nvidia-smi on Docker container CUDA Programming and Performance cuda , ubuntu , docker	2	11006	October 18, 2020
nvidia-docker inside Kubernetes - Failed to initialize NVML: Unknown Error CUDA Setup and Installation	3	4430	January 9, 2022
Nvida Container Toolkit: Failed to initialize NVML: Unknown Error Linux	8	25007	June 29, 2025
nvidia-smi -----> Failed to initialize NVML: Unknown Error (in docker) CUDA Setup and Installation	4	20278	August 12, 2019
"Failed to initialize NVML: Unknown Error" running nvidia-smi in a docker container only after some hours/days DGX Spark / GB10	13	113	December 12, 2025
Failed to initialize nvml unknown error - can’t run nvidia-smi System Management and Monitoring (NVML) nvidia-smi	0	1148	October 19, 2022
Failed to initialize NVML: Unknown Error when running nvidia-smi Anyone else have this error? CUDA Programming and Performance	14	59505	January 20, 2022
nvidia-smi error: Failed to initialize NVML: Unknoown Error CUDA Setup and Installation	0	1297	September 29, 2014
Nvidia-container-cli: detection error: nvml error: function not found: unknown CUDA Programming and Performance cuda , ubuntu , docker	5	8282	April 24, 2021

Nvidia-smi "Failed to initialize NVML: Unknown Error"

Related topics