python3Packages.torch: 2.7.1 -> 2.8.0#431973
Conversation
|
@zeuner Any idea? |
It fails fetching the submodule Looking further for a good way to mitigate this. |
Thanks a lot! |
3109d97 to
bb6a34d
Compare
bb6a34d to
4340d29
Compare
4340d29 to
013fafc
Compare
|
Hello, I don't know if this will help much, but I believe I have a working setup for pytorch 2.8.0 with ROCm. I don't think all of the things I did were strictly necessary. But maybe some of the patches will help you. Feel free to take as little or as much from that repo as you wish :) |
Thanks for proposing. |
013fafc to
ffeee28
Compare
|
Hi, I got around this in my derivation by intentionally downloading setuptools 79.0.1 as you can see here: https://github.com/benjaminhottell/nix-rocm-pytorch/blob/main/derivations/setuptools-79_0_1.nix I then set up a special python for pytorch's build process with that specific setuptools included: This is because pytorch only supports (to my knowledge) versions under version 80.0.0 I believe the version of setuptools that you get from Skimming through your PR, I believe you are currently patching out the requirements to make a version greater than 80.0.0 work anyway. I have not tried this, and I am unsure if that will work. In my experience the setup.py script is rather smart and you may need to patch more than the pyproject.toml entry. |
|
Now failing during compilation ( It's weird because Probably introduced in pytorch/pytorch@9b2a45a. |
|
|
|
This should work: # path override breaks finding site packages
substituteInPlace torch/CMakeLists.txt \
--replace-fail 'env PYTHONPATH=' 'env PYTHONPATH_unused=' |
ffeee28 to
cf9c05a
Compare
|
ROCm build still needs some work. Some substitutes no longer apply. LoadHIP changed and is crashing finding rocm-core. Can probably comment out |
|
Now failing with: |
Does that make any sense? |
|
Seems like rocmPackages.clr in buildInputs messes up the python 3.12 env. :/ |
|
Should we just mark broken if python version == 3.12 + rocmSupport enabled? That's probably an issue in rocmPackages.clr and I can try to address it separately. |
Sure, fine by me. |
Why exactly? |
@NickCao I think what's happening is that clr propagates rocminfo which propagates python3 which is ending up as python3.13 because rocmPackages aren't part of the python package set. rocminfo probably needs this dropped: https://github.com/NixOS/nixpkgs/blame/2a54616c742ed35e28b1e599b528c4ead690af7f/pkgs/development/rocm-modules/6/rocminfo/default.nix#L39 - but might need more patching on top of that and will trigger rebuilds of the entire ROCm package set which includes CK and takes 5ever to review run. Could try to fix it in this PR but the root cause been messed up for 3 years and is only breaking 3.12 torchWithRocm so I think addressing it in a followup PR is fine. |
Breaking all non default python versions? Mark all of them as broken then. |
|
Do you have a suggestion for the specific broken = expression? |
|
rocminfo python leakage fix PR: #438399 |
4f6645e to
15e06b1
Compare
I managed to mark it as broken on non-default python versions. |
There was a problem hiding this comment.
This is supposed to be set externally by us, as an envvar.
There was a problem hiding this comment.
Turns out we can drop the ROCM_VERSION change entirely and don't need to add an env var.
I think this was working around the rocm version header being broken in an earlier iteration of nixpkgs rocm (6.0?), and hasn't been needed since rocmPackages.clr learned to include a proper version header.
Test with the version patch dropped:
python3.13-torch> hip VERSION: 6.3.42134
python3.13-torch> -- Reading ROCM version from: /nix/store/6j8iah88is5dpqpyq32bppm3r6h56qw6-clr-6.3.3/include/rocm-core/rocm_version.h
python3.13-torch> -- Content:
python3.13-torch>
python3.13-torch> ***** ROCm version from rocm_version.h ****
python3.13-torch>
python3.13-torch> ROCM_VERSION_DEV: 6.3.3
python3.13-torch> ROCM_VERSION_DEV_MAJOR: 6
python3.13-torch> ROCM_VERSION_DEV_MINOR: 3
python3.13-torch> ROCM_VERSION_DEV_PATCH: 3
python3.13-torch> ROCM_VERSION_DEV_INT: 60303
python3.13-torch> HIP_VERSION_MAJOR: 6
python3.13-torch> HIP_VERSION_MINOR: 3
python3.13-torch> TORCH_HIP_VERSION: 603
There was a problem hiding this comment.
Is this handwritten or generated, this looks like a maintenance nightmare.
There was a problem hiding this comment.
This is generated by python3Packages.torch.unroll-src.
15e06b1 to
b2d1c50
Compare
b2d1c50 to
ab2d7e4
Compare
|
|
|
Thanks everyone for your help! |
Things done
Update the torch ecosystem:
torch: 2.7.1 -> 2.8.0 https://github.com/pytorch/pytorch/releases/tag/v2.8.0cc @teh @thoughtpolice @tscholak
cc @junjihashimoto
torchaudio: 2.7.1 -> 2.8.0 https://github.com/pytorch/audio/releases/tag/v2.8.0cc @junjihashimoto
torchvision: 0.22.1 -> 0.23.0 https://github.com/pytorch/vision/releases/tag/v0.23.0cc @ericsagnes
cc @junjihashimoto
passthru.tests.nixpkgs-reviewon this PR. See nixpkgs-review usage../result/bin/.Add a 👍 reaction to pull requests you find important.