Topics tagged llama

Topic	Replies	Views	Activity
Tutorial: Build llama.cpp from source and run Qwen3 235B DGX Spark / GB10 Projects llama	25	960	December 12, 2025
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch Jetson Thor jetson , llama-31-8b-instruct , llama , nemotron	45	2253	December 12, 2025
"unable to allocate CUDA0 buffer" after Updating Ubuntu Packages Jetson Orin Nano cuda , jetson , generative_ai , llama	149	5458	December 12, 2025
Installing llama.cpp Jetson Orin NX cuda , llama	4	11	December 12, 2025
Very slow mmap on DGX Spark that affects model loading - questions to NVIDIA DGX Spark / GB10 llama	26	925	December 12, 2025
Clarification on NVIDIA embedding/reranker API access and costs Visual AI Agent llama	1	12	December 12, 2025
Building llama.cpp container images for Spark/GB10 DGX Spark / GB10 Projects cuda , llama	5	171	December 11, 2025
Architectural insights needed: Why is the MIG 3g.71gb instance consistently the "Efficiency Sweet Spot" on H200? CUDA Programming and Performance llama	1	38	December 11, 2025
Invalid Certificate error Models llama	1	24	December 9, 2025
Build and Deploy a Multi-Agent Chatbot on a Workstation DGX Spark / GB10 Projects llama	1	79	December 8, 2025
Investigating metrics which cause profiler to fail Nsight Compute llama	0	8	December 8, 2025
When we install an LLM model and start a chat session, the response speed becomes extremely slow DGX Spark / GB10 llama	1	90	December 6, 2025
Llama 3.1 nemotron 70b instruct API access not working correctly NVIDIA Nemotron llama , nemotron	1	16	December 5, 2025
DGX Spark crashes when running tensorrt-llm DGX Spark / GB10 llama	1	48	December 5, 2025
Having issue running NeMo Guardrail Docker Image NVIDIA NeMo nim , llama , nemo-guardrails	0	18	December 4, 2025
DGX Spark – Request AI Enterprise / NIM Entitlement Activation DGX Spark / GB10 nim , llama , nemotron	5	129	December 4, 2025
NIM LLM Containers Fail on DGX Spark (GB10): Triton/vLLM Crash on sm_121 and NGC Permission Errors DGX Spark / GB10 jetson , nim , llama , nemotron	2	108	December 3, 2025
Any tips on running Magistral? DGX Spark / GB10 cuda , llama	11	211	December 3, 2025
My DGX Spark keeps freezing and crashing when I try to run this code no matter the LLM NVIDIA AI Workbench llama	0	34	December 2, 2025
Cannot enable --enable-auto-tool-choice and --tool-call-parser Models nim , llama-31-8b-instruct , llama	0	50	December 1, 2025
LLAMA 3.2 3B Full finetuning much slower than benchmark DGX Spark / GB10 llama	0	53	November 30, 2025
Agentic/vibe coding configuration? Particularly - context window DGX Spark / GB10 llama , agentic-ai	0	56	November 29, 2025
Proposal: AION — Vendor-Agnostic AI-Driven Out-of-Band Operations Node Network Management Products nim , llama , nemotron	0	10	November 27, 2025
Certificate verify failed while installing NIM models Models cuda , nim , llm , llama-31-8b-instruct , llama	0	35	November 26, 2025
Continue.dev agentic alternative: roo code DGX Spark / GB10 llama , agentic-ai	0	60	November 22, 2025
Second NIM container won't start due to less than desired GPU memory utilization DGX Spark / GB10 docker , nim , llama-31-8b-instruct , llama	10	215	December 3, 2025
How to deploy the model NVIDIA NeMo llama	6	53	November 19, 2025
Question Regarding Draft Model Support AnythingLLM via NVIDIA NIM DGX Spark / GB10 nim , llama-31-8b-instruct , llama	2	66	November 19, 2025
Function not found using meta/llama-3.2-11b-vision-instruct Models llama	2	77	November 17, 2025
NVIDIA NIM: LLAMA-4 (Maverick) Image for performance Benchmarking on Nvidia H200 GPUs Models cuda , nim , llama-31-8b-instruct , llama	2	154	November 17, 2025