-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Upgrade ROCm wheels to manylinux2_28 - 1a of 2 (docker images) #140681
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade ROCm wheels to manylinux2_28 - 1a of 2 (docker images) #140681
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140681
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit 6642fb5 with merge base cb8c956 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@huydhn @atalman Got this error for the rocm6.1 docker build job: |
|
I have created the missing ECR record |
Thanks, can you please do it for rocm6.2 as well? That will also need an ECR record. |
@huydhn Also, I suppose this means we will need to request you to create a new ECR record for every ROCm upgrade? |
That's the current process. Also, due to #140958 earlier today, I haven't been able to deploy the ECR change yet. I will need to check with @jeanschmidt when we can resume infra deployment. We could use |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
b0b44ca to
1a87ee6
Compare
1a87ee6 to
3dd5cb0
Compare
I just create it, starting the infra deployment now, so give it about 15 minutes to finish |
3dd5cb0 to
2a4560f
Compare
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
2a4560f to
cb09ef8
Compare
434d09a to
6642fb5
Compare
|
@pytorchbot merge -f "Manylinux/libtorch/CI docker image builds for ROCm completed with build duration 40-45min; other CI failures unrelated" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…er images) (#141609) Upgrade gcc version from 9 to 11 on ROCm manylinux images. Needed for #141423 since almalinux8-based manylinux2_28 images for ROCm (#140681) installs gcc-toolset-9, which installs [gcc 9.2.1](https://pkgs.org/download/gcc-toolset-9-gcc-c++). However, PyTorch CMakeLists.txt enforces a [minimum gcc version of 9.3](https://github.com/pytorch/pytorch/blob/5318bf8baf19fecda365c185cd81196e3cfb08e3/CMakeLists.txt#L61). Pull Request resolved: #141609 Approved by: https://github.com/jeffdaily Co-authored-by: Jithun Nair <[email protected]>
…h#140681) Fixes pytorch#140631 Highlights: * Use `cpu_final` base for ROCm in `.ci/docker/manywheel/Dockerfile_2_28` * Cleans up install_miopen.sh to remove old ROCm references * Install `gcc-gfortran` package to build magma for ROCm on almalinux Needs builder PR pytorch/builder#2043 (merged) so that GCC_ABI expected value is updated. Pull Request resolved: pytorch#140681 Approved by: https://github.com/jeffdaily
Depends on #140681 and #141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: #141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <[email protected]> Co-authored-by: Pruthvi Madugundu <[email protected]>
…h#140681) Fixes pytorch#140631 Highlights: * Use `cpu_final` base for ROCm in `.ci/docker/manywheel/Dockerfile_2_28` * Cleans up install_miopen.sh to remove old ROCm references * Install `gcc-gfortran` package to build magma for ROCm on almalinux Needs builder PR pytorch/builder#2043 (merged) so that GCC_ABI expected value is updated. Pull Request resolved: pytorch#140681 Approved by: https://github.com/jeffdaily
…er images) (pytorch#141609) Upgrade gcc version from 9 to 11 on ROCm manylinux images. Needed for pytorch#141423 since almalinux8-based manylinux2_28 images for ROCm (pytorch#140681) installs gcc-toolset-9, which installs [gcc 9.2.1](https://pkgs.org/download/gcc-toolset-9-gcc-c++). However, PyTorch CMakeLists.txt enforces a [minimum gcc version of 9.3](https://github.com/pytorch/pytorch/blob/5318bf8baf19fecda365c185cd81196e3cfb08e3/CMakeLists.txt#L61). Pull Request resolved: pytorch#141609 Approved by: https://github.com/jeffdaily Co-authored-by: Jithun Nair <[email protected]>
) Depends on pytorch#140681 and pytorch#141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: pytorch#141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <[email protected]> Co-authored-by: Pruthvi Madugundu <[email protected]>
) Depends on pytorch#140681 and pytorch#141609 Highlights: * Upgrade binaries to ROCm6.2.4 to use latest docker images * Remove pre-cxx11 builds for libtorch on ROCm * Use manylinux2_28 docker images for ROCm * Set `DESIRED_DEVTOOLSET=cxx-abi` (and hence `_GLIBCXX_USE_CXX11_ABI=1`) for ROCm manylinux2_28 wheels (ROCm RHEL8 packages also have GCC_ABI=1, so it keeps it consistent) Pull Request resolved: pytorch#141423 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <[email protected]> Co-authored-by: Pruthvi Madugundu <[email protected]>
Fixes #140631
Highlights:
cpu_finalbase for ROCm in.ci/docker/manywheel/Dockerfile_2_28gcc-gfortranpackage to build magma for ROCm on almalinuxNeeds builder PR pytorch/builder#2043 (merged) so that GCC_ABI expected value is updated.
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd