-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Fix conda install on distributions with strict POSIX sh #91371
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91371
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 FailuresAs of commit 8a925e0: NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 38cc3d1 Pull Request resolved: #91371
|
@pytorchbot merge -f "the rest of the problems look like preexisting conditions" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
|
But I don't see how this PR can be possibly related... |
|
@pytorchbot revert -m "trunk / cuda11.6-py3.10-gcc7-sm86 / test (slow, 1, 2, linux.g5.4xlarge.nvidia.gpu) started to fail after this PR with mypy error" -c ignoredsignal |
|
@pytorchbot successfully started a revert job. Check the current status here. |
|
@ezyang your PR has been successfully reverted. |
…)" This reverts commit 57dcd93. Reverted #91371 on behalf of https://github.com/kit1980 due to trunk / cuda11.6-py3.10-gcc7-sm86 / test (slow, 1, 2, linux.g5.4xlarge.nvidia.gpu) started to fail after this PR with mypy error
|
I still don't understand what's going on, but with the revert of this PR "trunk / cuda11.6-py3.10-gcc7-sm86 / test (slow, 1, 2, linux.g5.4xlarge.nvidia.gpu)" passed. |
|
Let's try again after #91410 |
|
@pytorchbot rebase |
|
@pytorchbot successfully started a rebase job. Check the current status here |
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <ezyangfb.com> [ghstack-poisoned]
|
Successfully rebased |
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <ezyangfb.com> ghstack-source-id: 9a39ea1 Pull Request resolved: #91371
|
@pytorchbot merge -f "Fix docker builds" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
See also conda/conda#10431 Signed-off-by: Edward Z. Yang <[email protected]> Pull Request resolved: pytorch#91371 Approved by: https://github.com/albanD
We require the same fix that was made on upstream pytorch pytorch/pytorch#91371 ROCm/pytorch@b72ec7c Without this change install_conda.sh stage fails ``` #21 6.254 CondaFileIOError: '/opt/conda/pkgs/envs/*/env.txt'. [Errno 2] No such file or directory: '/opt/conda/pkgs/envs/*/env.txt' #21 6.254 #21 ERROR: executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1 ------ > [conda 2/3] RUN bash ./install_conda.sh && rm install_conda.sh: ------ executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1 ``` Locally tested with the `/builder/libtorch/build_docker.sh`
We require the same fix that was made on upstream pytorch pytorch/pytorch#91371 ROCm/pytorch@b72ec7c Without this change install_conda.sh stage fails ``` #21 6.254 CondaFileIOError: '/opt/conda/pkgs/envs/*/env.txt'. [Errno 2] No such file or directory: '/opt/conda/pkgs/envs/*/env.txt' #21 6.254 #21 ERROR: executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1 ------ > [conda 2/3] RUN bash ./install_conda.sh && rm install_conda.sh: ------ executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1 ``` Locally tested with the `/builder/libtorch/build_docker.sh`
The issue was first solved in [/pull/91371] for CI/CD, but the main Dockerfile in the repo root still has this issue for people trying to test build custom image manually. Without it the build fails at installing miniconda ``` #14 3.802 Preparing transaction: ...working... done #14 4.087 Executing transaction: ...working... done #14 5.713 /root/miniconda.sh: 438: /root/miniconda.sh: [[: not found #14 5.713 #14 5.713 Installing * environment... #14 5.713 #14 5.714 /root/miniconda.sh: 444: /root/miniconda.sh: [[: not found #14 6.050 #14 6.050 CondaFileIOError: '/opt/conda/pkgs/envs/*/env.txt'. [Errno 2] No such file or directory: '/opt/conda/pkgs/envs/*/env.txt' #14 6.050 ``` With the modification, locally tested build successfully with `make -f ./docker.Makefile` as instructed in the README Pull Request resolved: #92702 Approved by: https://github.com/seemethere, https://github.com/malfet
Stack from ghstack (oldest at bottom):
See also conda/conda#10431
Signed-off-by: Edward Z. Yang [email protected]