[Unity] CUDA Graph update #15320

masahi · 2023-07-14T20:06:52Z

Allow capturing input buffers passed from a user if num_input attribute is set. The last func->params.size() - num_inputs inputs are assumed to be fixed and thus they can be captured into a cuda graph. Users need to be careful if they intend to pass different parameter tensors, for example in LoRa deployment.
Cache the instantiated exec object rather than the captured graph, since we assume that the graph is fixed anyway. I found that calling cudaGraphLaunch on every launch is super expensive.
Support CUDA graph for CUTLASS BYOC.

tvm-bot · 2023-07-14T20:06:55Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @quic-sanirudh _{See #10317 for details}

_{Generated by tvm-bot}

* allow capturing input parameters in a cuda graph * remove unnecessary cudaGraphLaunch * support cuda graph for cutlass * add test * add test for cutlass * revert LiftTransformParams change * comment * update test * update builtin * update * delete exec properly * run cuda graph twice in the test to make sure cached launch works

masahi added 12 commits July 13, 2023 11:19

allow capturing input parameters in a cuda graph

b495144

remove unnecessary cudaGraphLaunch

6183d1d

support cuda graph for cutlass

0789236

add test

a7e3574

add test for cutlass

64f85ad

revert LiftTransformParams change

85ed523

comment

4b1b8d6

update test

f9b7eea

update builtin

26ccb15

update

ad7cbf9

delete exec properly

c34f38c

run cuda graph twice in the test to make sure cached launch works

644b345

vinx13 approved these changes Jul 14, 2023

View reviewed changes

masahi merged commit 783b467 into apache:unity Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Unity] CUDA Graph update #15320

[Unity] CUDA Graph update #15320

Uh oh!

masahi commented Jul 14, 2023 •

edited

Loading

Uh oh!

tvm-bot commented Jul 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Unity] CUDA Graph update #15320

[Unity] CUDA Graph update #15320

Uh oh!

Conversation

masahi commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented Jul 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

masahi commented Jul 14, 2023 •

edited

Loading