fix memory profile for partial graph run by pengwa · Pull Request #11911 · microsoft/onnxruntime

pengwa · 2022-06-20T08:09:34Z

Description: fix memory profile for partial graph run

Memory Profile is helpful for analyze memory peak used in static ORT planning.

This PR targets to fix the issues when enable memory profile in ORTModule partial graph run. Two issues:

Loose the failure if some tensors are missing in track.
Only summarized after backward partial graph run.

Validated with
./build.sh --config $flavor --use_cuda --enable_training --build_wheel --skip_tests --cuda_version=11.3 --parallel 8 --enable_training_torch_interop --cmake_extra_defines onnxruntime_ENABLE_MEMORY_PROFILE=ON

and

pytest -k test_forward_call_single_positional_argument orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.

This reverts commit fb60beb.

…o pengwa/memory_profile_fix

pengwa added 3 commits June 15, 2022 08:45

fix mpi build for gcc8 or higher

fb60beb

fix memory profile for partial graph run

b38197a

Revert "fix mpi build for gcc8 or higher"

3f656e9

This reverts commit fb60beb.

pengwa added the training issues related to ONNX Runtime training; typically submitted using template label Jun 20, 2022

pengwa requested review from Lafi7e, askhade, tlh20 and wezuo June 20, 2022 08:10

pengwa added 4 commits June 20, 2022 08:17

remove debug code

7b20929

Merge branch 'master' of https://github.com/microsoft/onnxruntime int…

5ecc5f1

…o pengwa/memory_profile_fix

fix build

5e72519

fix build

0d64157

wezuo previously approved these changes Jun 23, 2022

View reviewed changes

fix cpplint and python black format

0fdd885

pengwa dismissed wezuo’s stale review via 0fdd885 June 23, 2022 02:40

Merge branch 'master' of https://github.com/microsoft/onnxruntime int…

510d94e

…o pengwa/memory_profile_fix

Lafi7e approved these changes Jun 24, 2022

View reviewed changes

pengwa merged commit 0d6cbc6 into master Jun 24, 2022

pengwa deleted the pengwa/memory_profile_fix branch June 24, 2022 05:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix memory profile for partial graph run#11911

fix memory profile for partial graph run#11911
pengwa merged 9 commits intomasterfrom
pengwa/memory_profile_fix

pengwa commented Jun 20, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pengwa commented Jun 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pengwa commented Jun 20, 2022 •

edited

Loading