[Profiler] Add CUDA Overhead to Auto-trace #142271

sraikund16 · 2024-12-06T23:13:16Z

Summary: We already have CUDA OVERHEAD events enabled in on-demand so we should also add them to auto-trace

Test Plan: Tested using internal performance suites and found no noticeable performance change

Differential Revision: D66904879

pytorch-bot · 2024-12-06T23:13:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142271

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 40845a2 with merge base d3d1a78 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

Lint / workflow-checks / linux-job (gh) (#142485)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-12-06T23:13:38Z

This pull request was exported from Phabricator. Differential Revision: D66904879

netlify · 2024-12-06T23:14:01Z

✅ Deploy Preview for chimerical-cranachan-793287 ready!

Name	Link
🔨 Latest commit	`e52cb9b`
🔍 Latest deploy log	https://app.netlify.com/sites/chimerical-cranachan-793287/deploys/6753850ffcb6ef00083cd12c
😎 Deploy Preview	https://deploy-preview-142271--chimerical-cranachan-793287.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Summary: We already have CUDA OVERHEAD events enabled in on-demand so we should also add them to auto-trace Test Plan: Tested using servicelab and found no performance difference: kineto_benchmark duration_ms: 21668 number_of_events: 26542 profiler_prepare_call_duration_us: 970 profiler_enable_call_duration_us: 616474 profiling_window_duration_us: 2188525 profiler_disable_call_duration_us: 148628 parse_kineto_call_duration_us: 1672536 function_events_build_tree_call_duration_us: 285939 kineto_benchmark duration_ms: 21718 number_of_events: 26556 profiler_prepare_call_duration_us: 885 profiler_enable_call_duration_us: 7037 profiling_window_duration_us: 1772481 profiler_disable_call_duration_us: 174122 parse_kineto_call_duration_us: 1983683 function_events_build_tree_call_duration_us: 333582 Differential Revision: D66904879

facebook-github-bot · 2024-12-10T02:14:35Z

This pull request was exported from Phabricator. Differential Revision: D66904879

sraikund16 · 2024-12-10T07:57:30Z

@pytorchbot rebase

pytorchmergebot · 2024-12-10T07:59:01Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-12-10T07:59:04Z

Tried to rebase and push PR #142271, but it was already up to date. Try rebasing against main by issuing:
@pytorchbot rebase -b main

sraikund16 · 2024-12-10T08:00:16Z

@pytorchbot rebase -b main

pytorchmergebot · 2024-12-10T08:01:45Z

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

Summary: We already have CUDA OVERHEAD events enabled in on-demand so we should also add them to auto-trace Test Plan: Tested using servicelab and found no performance difference: kineto_benchmark duration_ms: 21668 number_of_events: 26542 profiler_prepare_call_duration_us: 970 profiler_enable_call_duration_us: 616474 profiling_window_duration_us: 2188525 profiler_disable_call_duration_us: 148628 parse_kineto_call_duration_us: 1672536 function_events_build_tree_call_duration_us: 285939 kineto_benchmark duration_ms: 21718 number_of_events: 26556 profiler_prepare_call_duration_us: 885 profiler_enable_call_duration_us: 7037 profiling_window_duration_us: 1772481 profiler_disable_call_duration_us: 174122 parse_kineto_call_duration_us: 1983683 function_events_build_tree_call_duration_us: 333582 Differential Revision: D66904879

pytorchmergebot · 2024-12-10T08:01:48Z

Successfully rebased export-D66904879 onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout export-D66904879 && git pull --rebase)

sraikund16 · 2024-12-10T18:32:34Z

@pytorchmergebot merge

pytorchmergebot · 2024-12-10T18:34:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: We already have CUDA OVERHEAD events enabled in on-demand so we should also add them to auto-trace Test Plan: Tested using internal performance suites and found no noticeable performance change Differential Revision: D66904879 Pull Request resolved: pytorch#142271 Approved by: https://github.com/ngimel

facebook-github-bot added the fb-exported label Dec 6, 2024

ngimel approved these changes Dec 6, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 6, 2024

ngimel added the release notes: profiler release notes category label Dec 9, 2024

sraikund16 force-pushed the export-D66904879 branch from e52cb9b to 02e2c3e Compare December 10, 2024 02:14

pytorchmergebot force-pushed the export-D66904879 branch from 02e2c3e to 40845a2 Compare December 10, 2024 08:01

sraikund16 added the topic: improvements topic category label Dec 10, 2024

pytorchmergebot added the merging label Dec 10, 2024

pytorchmergebot closed this in d102cfa Dec 10, 2024

pytorchmergebot added Merged and removed merging labels Dec 10, 2024

[Profiler] Add CUDA Overhead to Auto-trace #142271

[Profiler] Add CUDA Overhead to Auto-trace #142271

Uh oh!

Conversation

sraikund16 commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142271

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

facebook-github-bot commented Dec 6, 2024

Uh oh!

netlify bot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for chimerical-cranachan-793287 ready!

Uh oh!

facebook-github-bot commented Dec 10, 2024

Uh oh!

sraikund16 commented Dec 10, 2024

Uh oh!

pytorchmergebot commented Dec 10, 2024

Uh oh!

pytorchmergebot commented Dec 10, 2024

Uh oh!

sraikund16 commented Dec 10, 2024

Uh oh!

pytorchmergebot commented Dec 10, 2024

Uh oh!

pytorchmergebot commented Dec 10, 2024

Uh oh!

sraikund16 commented Dec 10, 2024

Uh oh!

pytorchmergebot commented Dec 10, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sraikund16 commented Dec 6, 2024 •

edited

Loading

pytorch-bot bot commented Dec 6, 2024 •

edited

Loading

netlify bot commented Dec 6, 2024 •

edited

Loading