Update ollama to 0.3.9 and add +cuda variant by brettviren · Pull Request #46204 · spack/spack

brettviren · 2024-09-04T15:22:33Z

No description provided.

brettviren · 2024-09-04T18:53:36Z

This does build w/ and w/out cuda but I am unable to figure out how to make a model actually use my GPUs (2x 4090). I suspect it has to do with not finding all CUDA libs from the Spack cuda package.

FWIW, I can download the binary from ollama's GitHub release page (which includes its own copy of CUDA libs) and get a model running on the GPU.

teaguesterling · 2024-09-07T18:56:09Z

@spackbot fix style

spackbot-app · 2024-09-07T18:56:12Z

Let me see if I can fix that for you!

spackbot-app · 2024-09-07T18:57:08Z

I was able to run spack style --fix for you!

spack style --fix

==> Running style checks on spack
  selected: isort, black, flake8, mypy
==> Modified files
  var/spack/repos/builtin/packages/ollama/package.py
==> Running isort checks
  isort checks were clean
==> Running black checks
reformatted var/spack/repos/builtin/packages/ollama/package.py
All done! ✨ 🍰 ✨
1 file reformatted.
  black checks were clean
==> Running flake8 checks
  flake8 checks were clean
==> Running mypy checks
lib/spack/spack/version/version_types.py:145: error: Argument 2 to "StandardVersion" has incompatible type "*Tuple[Tuple[Any, ...], Tuple[Any, ...]]"; expected "Tuple[Tuple[Any, ...], Tuple[Any, ...]]"  [arg-type]
lib/spack/spack/version/version_types.py:452: error: Argument 2 to "StandardVersion" has incompatible type "*Tuple[Tuple[Any, ...], Tuple[Any, ...]]"; expected "Tuple[Tuple[Any, ...], Tuple[Any, ...]]"  [arg-type]
lib/spack/spack/version/version_types.py:481: error: Argument 2 to "StandardVersion" has incompatible type "*Tuple[Tuple[Any, ...], Tuple[Any, ...]]"; expected "Tuple[Tuple[Any, ...], Tuple[Any, ...]]"  [arg-type]
Found 3 errors in 1 file (checked 620 source files)
  mypy found errors

Keep in mind that I cannot fix your flake8 or mypy errors, so if you have any you'll need to fix them and update the pull request. If I was able to push to your branch, if you make further changes you will need to pull from your updated branch before pushing again.

I've updated the branch with style fixes.

teaguesterling · 2024-09-07T18:57:19Z

Thanks for taking this on! I was hoping to get back to this and get a GPU build going. I don't have much in the way of good CUDA resource available for testing on my dev system but I'll see if I can help sort out the build details.

var/spack/repos/builtin/packages/ollama/package.py

brettviren · 2024-09-12T16:07:24Z

I think the CUDA runtime "problem" I had was a mirage due to a mix of wrong expectation from Spack's cuda package and misleading "warning" messages from ollama serve.

I had thought Spack's cuda actually provided libcuda.so but in fact the package.py says:

Note: This package does not currently install the drivers necessary
to run CUDA. These will need to be installed manually. See:
https://docs.nvidia.com/cuda/ for details.

The stubs/libcuda.so that is provided holds dummy implementations to satisfy link-time dependencies on systems that lack a "real" libcuda.so.

With that knowledge in hand, I let ollama serve make use of Debian's libcuda.so and see the GPUs finally being used.

$ spack install ollama+cuda
$ spack load ollama
$ ollama serve
$ ollama run llama3.1
$ nvidia-smi|grep ollama
|    0   N/A  N/A   1097881      C   ...unners/cuda_v12/ollama_llama_server       6142MiB |

There was also some prior confusion due to WARN level messages from ollama serve:

time=2024-09-12T11:56:49.553-04:00 level=WARN source=gpu.go:669 msg="unable to locate gpu dependency libraries"
time=2024-09-12T11:56:49.553-04:00 level=WARN source=gpu.go:669 msg="unable to locate gpu dependency libraries"

However, with OLLAMA_DEBUG=true ollama serve, in addition to the WARN I get more comforting DEBUG messages:

time=2024-09-12T11:57:22.128-04:00 level=DEBUG source=gpu.go:491 msg="gpu library search" globs="[libcuda.so* /home/wcwc/libcuda.so* /usr/local/cuda*/targets/*/lib/libcuda.so* /usr/lib/*-linux-gnu/nvidia/current/libcuda.so* /usr/lib/*-linux-gnu/libcuda.so* /usr/lib/wsl/lib/libcuda.so* /usr/lib/wsl/drivers/*/libcuda.so* /opt/cuda/lib*/libcuda.so* /usr/local/cuda/lib*/libcuda.so* /usr/lib*/libcuda.so* /usr/local/lib*/libcuda.so*]"
time=2024-09-12T11:57:22.131-04:00 level=DEBUG source=gpu.go:525 msg="discovered GPU libraries" paths=[/usr/lib/x86_64-linux-gnu/nvidia/current/libcuda.so.555.42.06]
CUDA driver version: 12.5
time=2024-09-12T11:57:22.255-04:00 level=DEBUG source=gpu.go:119 msg="detected GPUs" count=2 library=/usr/lib/x86_64-linux-gnu/nvidia/current/libcuda.so.555.42.06

So in the end, all seems good. A fresh push is immanent.

teaguesterling · 2024-09-14T16:36:30Z

Sorry to keep dragging you along here! I'm not as familiar with Cuda packages as I am other things. I took a bit more time to review this morning and had a few more suggestions:

The CudaPackage class actually adds both variant("cuda",...) and depends_on("cuda", when="+cuda", ...) for you, so we can actually remove those. (It also adds a few other things).
The package auditing standards appear to have changed (just in the last few days, it seems) and now the setup_build_environment method needs to be added to the Builder class. I've tested this and just moving the method works as expected.

teaguesterling

Sorry to give you the round-around on these changes. I'll open a PR into your branch with them as well to make it easier to implement.

var/spack/repos/builtin/packages/ollama/package.py

Signed-off-by: Teague Sterling <[email protected]>

teaguesterling · 2024-09-14T18:10:58Z

All stylistic considerations aside: I was able to sort of confirm that this works for building with Cuda. I have an ancient CUDA-compatible card in a machine. It was able to compile with Cuda support and got as far as detecting that my card was a dinosaur before dropping back to CPU.

Fixing style audits and simplifying dependencies

teaguesterling · 2024-09-16T23:56:50Z

@spackbot fix style

spackbot-app · 2024-09-16T23:56:53Z

Let me see if I can fix that for you!

teaguesterling · 2024-09-17T15:05:05Z

@spackbot rerun pipeline

teaguesterling · 2024-09-21T14:47:18Z

@spackbot rerun pipeline

@brettviren something seems broken in the CI unrelated to the package. Rerunning again in hopes it's resolved.

teaguesterling · 2024-09-24T03:46:48Z

@spackbot rerun pipeline

teaguesterling · 2024-09-24T20:56:50Z

Lgtm!

brettviren added 2 commits September 4, 2024 10:44

Update ollama for 0.3.9 and add +cuda variant

f1fe944

Update ollama to 0.3.9 and add +cuda variant

4be0583

spackbot-app bot added dependencies new-variant new-version update-package labels Sep 4, 2024

spackbot-app bot requested a review from teaguesterling September 4, 2024 15:22

[@spackbot] updating style on behalf of brettviren

b6cb15b

teaguesterling suggested changes Sep 10, 2024

View reviewed changes

var/spack/repos/builtin/packages/ollama/package.py Outdated Show resolved Hide resolved

var/spack/repos/builtin/packages/ollama/package.py Outdated Show resolved Hide resolved

var/spack/repos/builtin/packages/ollama/package.py Show resolved Hide resolved

Implement code style changes from teaguesterling's review in spack#46204

135f3cb

teaguesterling suggested changes Sep 14, 2024

View reviewed changes

var/spack/repos/builtin/packages/ollama/package.py Outdated Show resolved Hide resolved

var/spack/repos/builtin/packages/ollama/package.py Outdated Show resolved Hide resolved

var/spack/repos/builtin/packages/ollama/package.py Show resolved Hide resolved

Fixing style audits and simplifying dependencies

298614f

Signed-off-by: Teague Sterling <[email protected]>

teaguesterling mentioned this pull request Sep 14, 2024

Fixing style audits and simplifying dependencies brettviren/spack#1

Merged

Merge pull request #1 from teaguesterling/packages/ollama

d7f4114

Fixing style audits and simplifying dependencies

teaguesterling approved these changes Sep 16, 2024

View reviewed changes

[@spackbot] updating style on behalf of brettviren

1985f20

brettviren mentioned this pull request Sep 17, 2024

ollama + minicpm brettviren/wcwc#7

Closed

Merge branch 'develop' into develop

063f288

spack deleted a comment from spackbot-app bot Sep 24, 2024

bernhardkaindl assigned teaguesterling Sep 24, 2024

alalazo approved these changes Sep 27, 2024

View reviewed changes

alalazo merged commit 3637c08 into spack:develop Sep 27, 2024

Conversation

brettviren commented Sep 4, 2024

Uh oh!

brettviren commented Sep 4, 2024

Uh oh!

teaguesterling commented Sep 7, 2024

Uh oh!

spackbot-app bot commented Sep 7, 2024

Uh oh!

spackbot-app bot commented Sep 7, 2024

Uh oh!

teaguesterling commented Sep 7, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brettviren commented Sep 12, 2024

Uh oh!

teaguesterling commented Sep 14, 2024

Uh oh!

teaguesterling left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

teaguesterling commented Sep 14, 2024

Uh oh!

teaguesterling commented Sep 16, 2024

Uh oh!

spackbot-app bot commented Sep 16, 2024

Uh oh!

teaguesterling commented Sep 17, 2024

Uh oh!

teaguesterling commented Sep 21, 2024

Uh oh!

teaguesterling commented Sep 24, 2024

Uh oh!

teaguesterling commented Sep 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants