|
About the GPU-Accelerated Libraries category
|
|
0
|
5551
|
February 1, 2020
|
|
cublasDx batched gather gemm
|
|
3
|
39
|
April 20, 2026
|
|
Install Llama 405B on two Sparks with ConnectX-7
|
|
0
|
8
|
April 20, 2026
|
|
cublasSgemmGroupedBatched requires host-side synchronization after preceding TRSM on A5000 (device-side ordering insufficient)
|
|
1
|
15
|
April 19, 2026
|
|
Library not found on Ubuntu 24.04 install
|
|
0
|
9
|
April 18, 2026
|
|
[BUG] OpenCL #include directive causes CL_OUT_OF_HOST_MEMORY on driver 595.97
|
|
5
|
43
|
April 17, 2026
|
|
Shocking cuSolver implicit synchronizations
|
|
0
|
14
|
April 17, 2026
|
|
Sparse matrix solve fails with cuda-sparse with cuDSS enabled
|
|
0
|
17
|
April 15, 2026
|
|
Follow-up on NVIDIA Inception Program Application (Applied on March 26, 2026)
|
|
2
|
42
|
April 15, 2026
|
|
Bf16 is half of fp16 tflops with same instruction __hfma2 on H100
|
|
1
|
591
|
April 13, 2026
|
|
Failed to create instance: action not allowed, contact support (7) https://discord.gg/b3mN94m3dr | code: 10
|
|
3
|
24
|
April 12, 2026
|
|
Undefined reference to non-CTX api function
|
|
1
|
98
|
April 11, 2026
|
|
cuBLAS batched FP32 SGEMM dispatcher picks suboptimal kernel on RTX 5090 (sm_120)
|
|
0
|
29
|
April 10, 2026
|
|
Can GPU's L2 Cache CPU's Memory on Grace Hopper
|
|
2
|
51
|
April 9, 2026
|
|
Does anyone have experience using CuDSS for a general, non-symmetric, indefinite, positive matrix?
|
|
1
|
47
|
April 9, 2026
|
|
Request for more information regarding CURAND to CURANDDx migration and help on staying more aware of changes
|
|
1
|
59
|
April 8, 2026
|
|
Numerical Analysis of Collatz Conjecture: Identified 90,902-Digit Integrity Deficit using CUDA HPC
|
|
3
|
62
|
April 7, 2026
|
|
Why does compute-sanitizer give invalid global writes on cufft store callback functions with Toolkit 12.2 but our cufft with callback functions work f
|
|
0
|
16
|
March 30, 2026
|
|
Cutensor errors on compute capability 6.1 card (Quadro P6000)
|
|
0
|
23
|
March 27, 2026
|
|
cusparseDgstv2 fortran issue: uninitialized memory access and out of bounds access
|
|
1
|
27
|
March 26, 2026
|
|
Proposal: SU(7) Vector Resonance Model for High-Performance Multi-Layer Processing
|
|
8
|
41
|
March 25, 2026
|
|
cuFFTMp scalability issue on A100 machine
|
|
7
|
109
|
March 24, 2026
|
|
Application Status Check — MatrixSentry (Submitted March 10)
|
|
3
|
46
|
March 24, 2026
|
|
HDPM: 2x GDDR7 Bandwidth Enhancement — Same Voltage, Same Power
|
|
2
|
21
|
March 17, 2026
|
|
GPU Memory Leak in nppiConvert_8u32f_C1R_Ctx Function
|
|
6
|
148
|
March 16, 2026
|
|
DXGI/D3D11 regression on Maxwell GM107 — dual-GPU Optimus-disabled — Intel P630 driver 2140 breaks adapter negotiation
|
|
0
|
38
|
March 15, 2026
|
|
R with OpenACC on new WSL Ubuntu
|
|
3
|
587
|
March 14, 2026
|
|
cuFile only running compat mode for RTX 6000 pro & Samsung 9100 Pro SSD on ASUS TUF Gaming B850
|
|
1
|
38
|
March 12, 2026
|
|
Utilization metrics across accelerators
|
|
0
|
39
|
March 7, 2026
|
|
The aConstants parameter does not work when using the planar twist API
|
|
0
|
24
|
March 5, 2026
|