PR #13603: NVTX: name threads, CUDA devices and CUDA streams #70044
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PR #13603: NVTX: name threads, CUDA devices and CUDA streams
Imported from GitHub PR openxla/xla#13603
This aims to improve the profiling experience. These names are shown in the Nsight Systems UI.
Device names:

Stream names:

Thread names:

This also provides a missing link between replica IDs in the HLO and the physical devices in the profile.
Copybara import of the project:
--
5b3121c58db8aa1b6529f0aeb8573be8bf2cde80 by Olli Lupton [email protected]:
NVTX: name threads, CUDA devices and CUDA streams
--
d973674de6218fcee88473d85bb43ba345652fdf by Olli Lupton [email protected]:
Address review comments
--
918cf3e7b87150e9d666b218bbd9aca0cae606a4 by Olli Lupton [email protected]:
Alternative for @jbaiocchi
--
1d1978437e64c0dac97e97ea4320a6dcb3945296 by Olli Lupton [email protected]:
Address more review comments
Merging this change closes #13603
FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#13603 from olupton:name-devices-streams-and-threads 1d1978437e64c0dac97e97ea4320a6dcb3945296