Topics tagged llama-31-8b-instruct

Topic	Replies	Views	Activity
Building RAG Agents with LLMs ----Function not found using meta/llama-3.1-8b-instruct Certification Exams llama-31-8b-instruct , llama	2	30	January 13, 2026
Question Regarding Draft Model Support AnythingLLM via NVIDIA NIM DGX Spark / GB10 nim , llama-31-8b-instruct , llama	5	120	January 2, 2026
Example ran out of memory on dgxspark DGX Spark / GB10 llama-31-8b-instruct , llama-31-70b-instruct , llama	4	171	December 25, 2025
Certificate verify failed while installing NIM models Models cuda , nim , llm , llama-31-8b-instruct , llama	1	102	December 15, 2025
Cannot enable --enable-auto-tool-choice and --tool-call-parser Models nim , llama-31-8b-instruct , llama	2	235	December 15, 2025
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch Jetson Thor jetson , llama-31-8b-instruct , llama , nemotron	46	2869	December 14, 2025
Second NIM container won't start due to less than desired GPU memory utilization DGX Spark / GB10 docker , nim , llama-31-8b-instruct , llama	10	288	December 3, 2025
NVIDIA NIM: LLAMA-4 (Maverick) Image for performance Benchmarking on Nvidia H200 GPUs Models cuda , nim , llama-31-8b-instruct , llama	2	184	November 17, 2025
Title: 401 Unauthorized when calling NVIDIA Integrate API (/v1/chat/completions) from container (API key works for /v1/models but fails for chat) NVIDIA Blueprints cuda , kernel , ubuntu , llama-31-8b-instruct , llama-31-70b-instruct , llama	0	66	November 6, 2025
Vllm client connection refused Jetson Thor ai , llama-31-8b-instruct , llama	10	147	October 31, 2025
Trying to Run DAPT-Continual PreTraining on Chip_design Data NVIDIA NeMo cuda , nemo , llama-31-8b-instruct , llama	2	89	August 5, 2025
Speculative decoding using vLLM on the Nvidia Jetson AGX Orin 64GB dev kit Jetson Projects generative_ai , llama-31-8b-instruct , llama	0	219	March 9, 2025
VSS local deployment single gpu: Failed to load VIA stream handler - Guardrails / CA-RAG setup failed Visual AI Agent nim , llama-31-8b-instruct , llama	4	186	July 4, 2025
Example-hybrid-rag NVIDIA AI Workbench nim , llama-31-8b-instruct , llama	7	199	June 2, 2025
Model _ request Model Does not exist error NIM on RTX AI PCs and Workstations nim , llama-31-8b-instruct , llama	0	68	May 31, 2025
SOTA inference speed using SGlang and EAGLE-3 speculative decoding on the NVIDIA Jetson AGX Orin Jetson Projects llama-31-8b-instruct , llama	2	1003	March 23, 2025
Batch processing using NVIDIA NIM \| Docker \| Self-hosted Models python , nim , llama3-8b-instruct , llama-31-8b-instruct , llama	11	716	January 29, 2025
Running LMdeploy inference engine on the NVIDIA Jetson AGX Orin Devkit Jetson Projects jetson , llama-31-8b-instruct , llama	2	224	January 14, 2025
Failed to MLC-compile mlc-ai/Llama-3.1-8B-Instruct-fp8-MLC on Jetson AGX orin Jetson AGX Orin generative_ai , llama-31-8b-instruct , llama	5	311	January 13, 2025
Jetson Orin Nano Super Dev Kit Performance Jetson Orin Nano cudnn , gemma-2-9b-it , llama-31-8b-instruct , llama	6	1157	January 28, 2025
How to fix 0 compatible profiles? Where to get compatible profiles? Models nim , llama-31-8b-instruct , llama	4	683	November 26, 2024
Boosting LLM Inference Speed Using Speculative Decoding in MLC-LLM on Nvidia Jetson AGX Orin Jetson Projects generative_ai , llama-31-8b-instruct , llama	0	262	November 23, 2024
NIM TensorRT-LLM on H100 NVL Models nim , llama-31-8b-instruct , llama	2	293	November 22, 2024
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	306	November 12, 2024
NIM does not support llama-3.1-8b-instruct and llama-3.1-70b-instruct on GH200 On-Prem deployment Models nim , llama-31-8b-instruct , llama	1	338	November 7, 2024
Reusing a stored model (llama-3.1-8b-instruct) with a proper profile Models nim , llama-31-8b-instruct , llama	0	209	October 30, 2024
LoRA swapping inference Llama-3.1-8b-instruct \| Exception: lora format could not be determined Models nim , llama3-8b-instruct , llama-31-8b-instruct , llama	4	275	October 22, 2024
Running Ollama / llama3.1 on Jetson AGX Xavier 16gb is it possible? how-to? Jetson AGX Xavier generative_ai , llama-31-8b-instruct	8	2695	October 19, 2024
NIM API key not Found Models nim , llama-31-8b-instruct , llama	4	854	September 21, 2024
API connect Models nim , llama-31-8b-instruct , llama	1	294	September 20, 2024