-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Closed
Labels
ep:CUDAissues related to the CUDA execution providerissues related to the CUDA execution provider
Description
Describe the issue
I tried to run cuda graph on unet model, and there is error like
2023-03-07 19:26:21.706256103 [E:onnxruntime:, sequential_executor.cc:494 ExecuteKernel] Non-zero status code returned while running Attention node. Name:'Attention_0' Status Message: /onnxruntime_src/onnxruntime/core/framework/bfc_arena.cc:368 void* onnxruntime::BFCArena::AllocateRawInternal(size_t, bool, onnxruntime::Stream*, bool, onnxruntime::WaitNotificationFn) Failed to allocate memory for requested buffer of size 15728896
If I disable cuda graph, the model can run smoothly.
To reproduce
Download the unet.zip and uncompress to a folder.
The command line: python test_unet.py -i unet.onnx.
Urgency
No response
Platform
Linux
OS Version
Ubuntu 20.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.14.1
ONNX Runtime API
Python
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
ep:CUDAissues related to the CUDA execution providerissues related to the CUDA execution provider