TensorRT Edge-LLM on the AGX Thor

iosovi · November 19, 2025, 4:22pm

How come TensorRT-LLM is not supported on the AGX Thor? I tried to port my TensorRT-LLM-based project from the Orin to the Thor. After a bit of debugging and going into rabbit holes on this forum, I found out that not only is it not supported anymore, but it will be replaced by TensorRT Edge-LLM. Is this a totally different framework? Is it a fork of TensorRT-LLM? Also, is it not available yet?

AastaLLL · November 20, 2025, 4:08am

Hi,

TensorRT-LLM doesn’t work on the Jetson platform.
For Thor, we will support TensorRT Edge-LLM from the upcoming release as a replacement.

Thanks.

iosovi · November 20, 2025, 11:40am

Hi,

Do you mean on the newest iteration of the Jetson platform? It works just fine on the older, AGX Orin (props to @dusty_nv for the tutorials and container images). I am curious if my TensorRT-LLM Python code could be easily ported to TensorRT Edge-LLM, once it’s available.

Thanks.

johnathon1 · November 21, 2025, 1:26am

The lack of transparency around TensorRT Edge-LLM and explanation for the need for a fork rather than extensions to TensorRT-LLM is really frustrating especially as this is a Blackwell GPU. Any information would be super appreciated @nvidia - thanks :-)

AastaLLL · November 24, 2025, 8:24am

Hi, both

As TensorRT Edge-LLM has not been released yet, we are not able to share detailed information on the public forum.

But there is no plan to support TensorRT-LLM on the Jetson platform.
There was a workable version but we don’t officially support it.

Thanks.

ggddyys · November 25, 2025, 3:42am

Hi,
What is its approximate release date?
Thanks.

AastaLLL · November 26, 2025, 6:27am

Hi,

TensorRT Edge-LLM will come with JetPack 7.1.
We expect to release JetPack 7.1 in early next year.

Thanks.

johnathon1 · November 29, 2025, 10:34am

Ok that makes sense, I suppose Thor is an edge device, or as ChatGPT likse to say it’s “edge device that’s been to the gym”.

Any chance you can share more on the comment “there was a workable solution but we don’t support it”? Was that a solution from another Thor user? I’ve tried to compile tensorrt-llm from scratch back I got into dependency hell with cuBLAST and other issues compiling all the cuda kernels. Ooof.

AastaLLL · December 1, 2025, 8:20am

Hi,

For now, you can try to run the LLMs with the container below instead:

Thanks.

user71196 · December 4, 2025, 4:45am

Hi there,

I couldn’t find a TensorRT-LLM image in the jetson-containers repo — are you working with the 0.12.0-jetson branch?
Is it possible to use the latest version of TensorRT-LLM on Jetson Orin, and could you point me to some related docs or examples?

Thanks a lot!

AastaLLL · December 4, 2025, 8:31am

Hi,

For Orin, it’s recommended to try our ollama or vLLM container below:

Ollama:

vLLM:

Thanks.

Topic		Replies	Views
Does TensorRT-LLM Supports on NVIDIA Jetson AGX Orin Edge Device? Jetson AGX Orin generative_ai	2	267	July 29, 2024
TensorRT LLM Jetson Thor tensorrt , generative_ai	6	313	November 18, 2025
Can TensorRT-LLM be used on Jetson Orin NX with JetPack 6.1? Jetson Orin NX tensorrt , generative_ai	6	371	December 17, 2024
Inquiry on any updated support for tensorrt-llm support nvidia orin AGX? Jetson AGX Orin tensorrt , generative_ai , llama	4	187	June 3, 2025
TensorRT-LLM for Jetson Jetson AGX Orin generative_ai	11	3413	July 7, 2025
TensorRT-LLM for Jetson Announcements generative_ai	0	246	November 13, 2024
Support for TensoRT-LLM and Benchmarking Models Jetson Thor nvbugs , generative_ai	7	277	September 24, 2025
Can I use TensorRT-LLM in Jetson AGX orin? Jetson AGX Orin nvbugs , generative_ai	3	755	July 15, 2024
Deploying Triton Server with TensorRT-LLM on Jetson AGX Orin (JetPack 6.2) — Any Working Example? Jetson AGX Orin tensorrt , jetson-inference , inference-server-triton , generative_ai , llm	10	739	June 17, 2025
Nvidia Jetson Orin Nano tensorrt llm Jetson Orin Nano tensorrt , generative_ai	6	394	August 5, 2024

TensorRT Edge-LLM on the AGX Thor

Related topics