|
Benchmarking and Optimizing Averager algorithm on Jetson Nano
|
|
1
|
1
|
December 14, 2025
|
|
How to benchmark on Thor to get the real FP4/FP8 performance TFOPS
|
|
5
|
78
|
December 11, 2025
|
|
Run hpc_benchmark23.10 HPL with v100GPU
|
|
4
|
1738
|
November 24, 2025
|
|
Clarification on CUDA IPC: Does cudaMemcpyDeviceToDevice guarantee remote memory visibility?
|
|
0
|
37
|
November 13, 2025
|
|
Thor torch.mm benchmark results (float32/float16/float8_e3m2fn)
|
|
5
|
218
|
September 15, 2025
|
|
nVidia nVector - download and documentation
|
|
3
|
3176
|
August 5, 2025
|
|
cuDNN vs cuBLAS performance on GEMMs
|
|
0
|
92
|
June 19, 2025
|
|
Anyone has comparison of LLM engines(TRTLLM/VLLM/MLC)?
|
|
3
|
445
|
June 16, 2025
|
|
Has Anyone Benchmarked (U-Net Segmentation) on Jetson Orin Series?
|
|
2
|
186
|
June 2, 2025
|
|
Source Code of Cutlass GemmKernel from Basic Gemm
|
|
1
|
88
|
April 16, 2025
|
|
Orin nano/nx ResNet-50 benchmark on R36.4.3(jetpack6.2)
|
|
8
|
446
|
March 24, 2025
|
|
Orin nano benchmark on R36.4.3(jetpack6.2)
|
|
14
|
500
|
February 26, 2025
|
|
Issue encountered while executing jetson_benchmarks from GitHub
|
|
3
|
156
|
December 3, 2024
|
|
FPS calculation (estimate) for NVIDIA RTX 2000 Ada Generation Embedded GPU
|
|
0
|
84
|
November 3, 2024
|
|
ONNX engine initialisation/build takes significantly longer in TensorRT 8.5 vs 8.0
|
|
10
|
1561
|
August 20, 2024
|
|
Fp32 precision support on Jetson AGX Orin
|
|
2
|
558
|
June 4, 2024
|
|
Tx2 Benchmarks error
|
|
3
|
306
|
May 21, 2024
|
|
Compare cpu vs gpu execution time with google benchmark
|
|
0
|
585
|
February 15, 2024
|
|
Freeze when running benchmarks
|
|
14
|
1092
|
December 15, 2023
|
|
Jetson Orin Developer Kit - unexpected drop in PCIe transfer speed
|
|
4
|
861
|
December 6, 2023
|
|
Jetson_benchmark Minimum memory requirements
|
|
19
|
1235
|
November 14, 2023
|
|
Jetson_benchmarks got Error opening engine file
|
|
7
|
1035
|
September 7, 2023
|
|
Isaac Sim very slow compared to Mujoco or PyBullet (both physics and rendering)
|
|
5
|
2823
|
April 5, 2024
|
|
L4 Quality vs throughput with FFMPEG
|
|
0
|
686
|
July 21, 2023
|
|
Jetson Xavier NX slower than Jetson TX2 at pytorch inferences
|
|
4
|
628
|
June 29, 2023
|
|
Floating point exception when running HPC-Benchmark:23.3
|
|
0
|
919
|
April 28, 2023
|
|
Questions about whether HPL uses Tensor Core in A100
|
|
3
|
980
|
April 27, 2023
|
|
L40 vs. RTX 6000 Ada FP16/FP8 throughput?
|
|
7
|
15792
|
April 4, 2023
|
|
CUDA benchmark
|
|
2
|
1422
|
March 20, 2023
|
|
Large difference between dcgmproftester and specs
|
|
1
|
1118
|
December 26, 2022
|