CLI upgrades by rshemet · Pull Request #504 · cactus-compute/cactus

rshemet · 2026-03-06T15:03:18Z

Adding a few features:

1. RAM usage display in `cactus run`:

You: hey how are you?
Assistant: I am doing well, thank you for asking! How are you? I'm feeling a little bit flexible with my responses.

Could you tell me about your day?

[36 tokens | latency: 0.026s | total: 0.233s | 154 tok/s | RAM: 101.5 MB]

`--system` flag for custom system prompt in `cactus run`:

Example: cactus run google/gemma-3-270m-it --system "Speak like Shakespeare"

Image handler in `cactus run` for VLMs:

List models and see which ones are downloaded using `cactus list`

includes an optional --downloaded flag to list downloaded models only
moves model list to a shared model.json so we don't duplicate the logic between this and the publish to HF yml

Signed-off-by: Roman Shemet <[email protected]>

- Add `cactus list` to show all supported models grouped by type - Add `--downloaded` flag to filter to local models only - Extract model registry from workflow YAML into shared models.json - Both CLI and publish workflow now read from single source of truth - Add `quantization=` field to config.txt for accurate precision display - Green ⬇ indicator, tags, disk size, and quantization for downloaded models Signed-off-by: Roman Shemet <[email protected]>

Signed-off-by: Roman Shemet <[email protected]>

rshemet added 8 commits March 6, 2026 12:46

display RAM usage in chat stats

c6cb092

Signed-off-by: Roman Shemet <[email protected]>

add --system flag for system prompt injection

0ffdf43

Signed-off-by: Roman Shemet <[email protected]>

add /image command and --image flag for VLM chat

55c1d13

Signed-off-by: Roman Shemet <[email protected]>

guard image features for non-VLM models

1f208d3

Signed-off-by: Roman Shemet <[email protected]>

clean up trailing whitespace in chat.cpp

54ae77d

Signed-off-by: Roman Shemet <[email protected]>

merge main: sync model registry with upstream changes

46eaaf4

Signed-off-by: Roman Shemet <[email protected]>

add Qwen3.5-0.8B and Qwen3.5-2B to model registry

0dad10a

Signed-off-by: Roman Shemet <[email protected]>

HenryNdubuaku merged commit 687a610 into main Mar 6, 2026
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLI upgrades#504

CLI upgrades#504
HenryNdubuaku merged 8 commits intomainfrom
cli/feature-upgrades

rshemet commented Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rshemet commented Mar 6, 2026

1. RAM usage display in cactus run:

--system flag for custom system prompt in cactus run:

Image handler in cactus run for VLMs:

List models and see which ones are downloaded using cactus list

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. RAM usage display in `cactus run`:

`--system` flag for custom system prompt in `cactus run`:

Image handler in `cactus run` for VLMs:

List models and see which ones are downloaded using `cactus list`