When I run the tensorrt-llm docker image and serve llama 4 DGX Spark crashes. Power cycle to recover.
docker run --rm -it --ipc host --gpus all --ulimit memlock=-1 --ulimit stack=67108864 -p 9000:8000 -v ~/.cache/huggingface:/root/.cache/huggingface nvcr.io/nvidia/tensorrt-llm/release:1.2.0rc4
trtllm-serve “nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4” --host 0.0.0.0