|
Tutorial: Build llama.cpp from source and run Qwen3 235B
|
|
25
|
960
|
December 12, 2025
|
|
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch
|
|
45
|
2253
|
December 12, 2025
|
|
"unable to allocate CUDA0 buffer" after Updating Ubuntu Packages
|
|
149
|
5458
|
December 12, 2025
|
|
Installing llama.cpp
|
|
4
|
11
|
December 12, 2025
|
|
Very slow mmap on DGX Spark that affects model loading - questions to NVIDIA
|
|
26
|
925
|
December 12, 2025
|
|
Clarification on NVIDIA embedding/reranker API access and costs
|
|
1
|
12
|
December 12, 2025
|
|
Building llama.cpp container images for Spark/GB10
|
|
5
|
171
|
December 11, 2025
|
|
Architectural insights needed: Why is the MIG 3g.71gb instance consistently the "Efficiency Sweet Spot" on H200?
|
|
1
|
38
|
December 11, 2025
|
|
Invalid Certificate error
|
|
1
|
24
|
December 9, 2025
|
|
Build and Deploy a Multi-Agent Chatbot on a Workstation
|
|
1
|
79
|
December 8, 2025
|
|
Investigating metrics which cause profiler to fail
|
|
0
|
8
|
December 8, 2025
|
|
When we install an LLM model and start a chat session, the response speed becomes extremely slow
|
|
1
|
90
|
December 6, 2025
|
|
Llama 3.1 nemotron 70b instruct API access not working correctly
|
|
1
|
16
|
December 5, 2025
|
|
DGX Spark crashes when running tensorrt-llm
|
|
1
|
48
|
December 5, 2025
|
|
Having issue running NeMo Guardrail Docker Image
|
|
0
|
18
|
December 4, 2025
|
|
DGX Spark – Request AI Enterprise / NIM Entitlement Activation
|
|
5
|
129
|
December 4, 2025
|
|
NIM LLM Containers Fail on DGX Spark (GB10): Triton/vLLM Crash on sm_121 and NGC Permission Errors
|
|
2
|
108
|
December 3, 2025
|
|
Any tips on running Magistral?
|
|
11
|
211
|
December 3, 2025
|
|
My DGX Spark keeps freezing and crashing when I try to run this code no matter the LLM
|
|
0
|
34
|
December 2, 2025
|
|
Cannot enable --enable-auto-tool-choice and --tool-call-parser
|
|
0
|
50
|
December 1, 2025
|
|
LLAMA 3.2 3B Full finetuning much slower than benchmark
|
|
0
|
53
|
November 30, 2025
|
|
Agentic/vibe coding configuration? Particularly - context window
|
|
0
|
56
|
November 29, 2025
|
|
Proposal: AION — Vendor-Agnostic AI-Driven Out-of-Band Operations Node
|
|
0
|
10
|
November 27, 2025
|
|
Certificate verify failed while installing NIM models
|
|
0
|
35
|
November 26, 2025
|
|
Continue.dev agentic alternative: roo code
|
|
0
|
60
|
November 22, 2025
|
|
Second NIM container won't start due to less than desired GPU memory utilization
|
|
10
|
215
|
December 3, 2025
|
|
How to deploy the model
|
|
6
|
53
|
November 19, 2025
|
|
Question Regarding Draft Model Support AnythingLLM via NVIDIA NIM
|
|
2
|
66
|
November 19, 2025
|
|
Function not found using meta/llama-3.2-11b-vision-instruct
|
|
2
|
77
|
November 17, 2025
|
|
NVIDIA NIM: LLAMA-4 (Maverick) Image for performance Benchmarking on Nvidia H200 GPUs
|
|
2
|
154
|
November 17, 2025
|