Hi everybody,
I am not used to post on such forums but i’ve been trying to make CUDA work on my machine for days and can’t manage to do it… Well, almost.
I installed cuda successfully 2 weeks ago, but when i wanted to give it another try because i needed some stuff to be done, i wasn’t able to communicate with my GPU anymore. I tried everything, installed hundreds of times following different tutorials but nothing.
I also reinstalled my Linux still nothing. Here’s the procedure I am following and some informations.
I am running an x64 versions of Debian 8.5, I installed CUDA via apt-get install nvidia-cuda-toolkit which worked well. I then rebooted, i can compile sucessfully but cannot run it and got “error 38”. Which is from what i understood a problem communicating with the GPU. (as root)
nvidia-smi returns the following :
“NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.” (as root)
ls /dev/nvidia* returns : /dev/nvidia0 /dev/nvidiactl
I also installed using the .run file made for ubuntu but had other strange error that i can’t remember. Could you guide me through the installation ?
I can provide all it needs to make it work… I don’t get why it’s not working correctly…
I am using the RUN files and not the Debian packages, because these are always a little bit behind.
The problem with the RUN installer is that whenever Debian updates Mesa, XOrg, kernel or some other package indirectly related to the driver then this can remove symlinks, libraries and even the driver’s kernel module. It’s important that you look at which packages are being updated whenever you run apt or any other package manager.
Quickest way to check if your driver is still alive and kicking is to open the Nvidia settings and to click through each page. An empty page for video or OpenGL is often an indicator for a Debian update having undone something.
So you simply rerun the Nvidia installer:
logout from the desktop
switch to the console with Ctrl-Alt-F1
login as root
stop the display manager with “service lightdm stop” or “service gdm stop” …
rerun the Nvidia RUN installer
start the display manager with “service lightdm start” or “service gdm start” …
You should see the window system coming up again.
If you are using the Nvidia driver from the Debian repository then you’ll probably have to reinstall the package. I don’t have any experience with those and I can’t really help you there.
If you’ve been using the RUN installer that came with CUDA then best don’t use it. Just get the latest driver from Nvidia’s diver download page. You’ll get a more recent driver this way.