Mmc0: Timeout waiting for hardware interrupt

Hi Nvidia,

We need to address the issue where interrupts are only aggregated and processed by CPU0. Therefore, we have incorporated the following patch: “re-enable GICv2m for PCIe MSI interrupts”.

R36.3 Patch to re-enable GICv2m for PCIe MSI interrupts and restore I/O performance - Jetson Systems / Jetson AGX Orin - NVIDIA Developer Forums

Currently, there is a problem with the test. Occasionally, there are situations where the MMC disk writes fail and the system freezes. From the log, it seems that there is an abnormality in the MMC interrupt response. Could you please advise on how to solve this?

Basic info: kernel is 5.15.148-rt-tegra, jetpack 6.2, Jetson Agx orin 64G

# cat /etc/nv_tegra_release 
# R36 (release), REVISION: 4.3, GCID: 38968081, BOARD: generic, EABI: aarch64, DATE: Wed Jan  8 01:49:37 UTC 2025
# KERNEL_VARIANT: oot
TARGET_USERSPACE_LIB_DIR=nvidia
TARGET_USERSPACE_LIB_DIR_PATH=usr/lib/aarch64-linux-gnu/nvidia

20260525-172127.log (1.4 MB)

mmc0: Timeout waiting for hardware interrupt.
[   28.872612] mmc0: running CQE recovery
[   28.874490] mmc0: cache flush error -110
[   40.159220] mmc0: running CQE recovery
[   40.161197] mmc0: cache flush error -110
[   18.763753] mmc0: running CQE recovery
[   19.265601] mmc0: cqhci: Failed to halt
[   30.177597] mmc0: Timeout waiting for hardware interrupt.
[   30.177603] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[   30.177607] mmc0: sdhci: Sys addr:  0x00000000 | Version:  0x00000505
[   30.177610] mmc0: sdhci: Blk size:  0x00007200 | Blk cnt:  0x00000080
[   30.177613] mmc0: sdhci: Argument:  0x401d0008 | Trn mode: 0x00000033
[   30.177615] mmc0: sdhci: Present:   0x11fb00f0 | Host ctl: 0x00000039
[   30.177618] mmc0: sdhci: Power:     0x0000000f | Blk gap:  0x00000000
[   30.177620] mmc0: sdhci: Wake-up:   0x00000000 | Clock:    0x0000000f
[   30.177623] mmc0: sdhci: Timeout:   0x0000000e | Int stat: 0x00000000
[   30.177625] mmc0: sdhci: Int enab:  0x00ff0003 | Sig enab: 0x00fc0003
[   30.177628] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[   30.177630] mmc0: sdhci: Caps:      0x3f6cd08c | Caps_1:   0x18002f73
[   30.177632] mmc0: sdhci: Cmd:       0x00002c1e | Max curr: 0x00000000
[   30.177635] mmc0: sdhci: Resp[0]:   0x00000900 | Resp[1]:  0x200021ae
[   30.177637] mmc0: sdhci: Resp[2]:   0x26468000 | Resp[3]:  0x00000000
[   30.177639] mmc0: sdhci: Host ctl2: 0x0000300d
[   30.177642] mmc0: sdhci: ADMA Err:  0x00000000 | ADMA Ptr: 0x0000007ffffe7090

Hi,

Could you help dump this information?

cat /sys/class/mmc_host/mmc0//mmc0:0001/manfid

# cat /sys/class/mmc_host/mmc0//mmc0:0001/manfid
0x000045

let me check internally for this.

Has there been any progress on the issue?

we will update you the result soon. Thanks for patience.