|
How to extract nested loop features from CUDA kernels for LLM-based optimization?
|
|
0
|
34
|
November 25, 2025
|
|
Global Memory Stores Not Eliminated After Register Promotion
|
|
3
|
54
|
November 6, 2025
|
|
Nvcc segfault on jetson thor R38.2.2
|
|
2
|
55
|
October 17, 2025
|
|
Nvlddmkm issue leading to BSOD during AI inference
|
|
0
|
91
|
October 10, 2025
|
|
Understanding uniform registers
|
|
5
|
195
|
October 9, 2025
|
|
PTX createpolicy instruction compile failure
|
|
2
|
65
|
May 21, 2025
|
|
Using a nvfortran compiled library with nvcc
|
|
2
|
84
|
May 18, 2025
|
|
Understanding Linker Behavior and Symbol Resolution in GPU Device Compilation (CUDA/NVLink)
|
|
0
|
51
|
May 16, 2025
|
|
Error loading dynamic parallelism kernel from fatbin via CUDA driver api
|
|
2
|
59
|
May 9, 2025
|
|
CUDA error: an illegal memory access was encountered (Address 0x0 is out of bounds)
|
|
4
|
154
|
April 3, 2025
|
|
Invalid result when using multiple GPUs with openmp threads
|
|
3
|
82
|
March 26, 2025
|
|
Unable to link nvinfer_lean_static.a and cuda kernels into same binary
|
|
2
|
82
|
March 17, 2025
|
|
OpenMP: unsupported opcode=OMPTARGETDATA
|
|
5
|
907
|
February 7, 2025
|
|
Operator precedence bug on NVC compiler
|
|
2
|
55
|
January 29, 2025
|
|
Is nvidia-persistenced necessary for confidential computing?
|
|
0
|
111
|
December 6, 2024
|
|
Strange usage of local memory
|
|
2
|
64
|
November 10, 2024
|
|
-DCMAKE_PREFIX_PATH=<> can't able to find this path
|
|
0
|
102
|
October 19, 2024
|
|
How to compile .cu files into static lib and use it in C projects
|
|
2
|
133
|
October 11, 2024
|
|
Finding location of Warning: Cannot do atomic on local memory
|
|
1
|
107
|
October 3, 2024
|
|
Reducing branches causes longer duration
|
|
1
|
45
|
August 26, 2024
|
|
'nvlink fatal : Could not open input file' when linking with empty static library
|
|
8
|
4242
|
August 18, 2024
|
|
Nvlink error : Undefined reference to '__cudaCDP2GetLastError
|
|
2
|
177
|
August 15, 2024
|
|
Non-deterministic compilation when anonymous namespaces are used
|
|
2
|
181
|
August 13, 2024
|
|
C++ Smart Pointers and OpenACC
|
|
3
|
382
|
July 31, 2024
|
|
Nvcc c++20 std::variant complie failed
|
|
4
|
909
|
July 24, 2024
|
|
Missing nvcc from arm64-sbsa cross for Ubuntu 20.04 when using deb (local)
|
|
1
|
511
|
July 10, 2024
|
|
Pip install nvidida-pyindex && pip install nvidia-cuda-nvcc does not actually install nvcc
|
|
2
|
2017
|
July 1, 2024
|
|
Driver/library version mismatch Error
|
|
0
|
242
|
June 18, 2024
|
|
NVCC error when trying to compile FFMPEG with --enable-cuda-nvcc flag
|
|
5
|
13378
|
June 8, 2024
|
|
Compiling programs that use dynamic parallelism (in Thrust) with device link time optimization
|
|
1
|
221
|
May 31, 2024
|