Cli reconvert by jakmro · Pull Request #357 · cactus-compute/cactus

jakmro · 2026-02-16T04:01:40Z

No description provided.

Signed-off-by: jakmro <[email protected]>

Copilot

Pull request overview

This PR updates the CLI and publishing workflow to support downloading pre-converted model weights from a Hugging Face org by default (with an option to force reconversion), and refactors the Hugging Face publishing script/workflow to publish models individually (optionally including Apple exports).

Changes:

Default CLI download path now attempts to fetch pre-converted weights from a Hugging Face org, with --reconvert to force source conversion.
Refactors publish_to_hf.py to publish a single model per invocation (via --task export_model) and optionally export Apple weights (--apple).
Updates VAD default model ID to snakers4/silero-vad and modernizes the publish GitHub Action to accept workflow inputs + publish lists from env.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
`tests/run.sh`	Updates default VAD model ID used by the test runner.
`python/src/publish_to_hf.py`	Refactors publishing into explicit tasks; adds per-model publish flow and optional Apple export zip generation.
`python/src/cli.py`	Adds HF pre-converted weights download flow and `--reconvert` flag plumbing across commands.
`.github/workflows/publish_to_hf.yml`	Adds workflow_dispatch inputs, moves model lists into env, publishes models in a loop, and updates org README separately.

Comments suppressed due to low confidence (2)

python/src/publish_to_hf.py:250

main() returns status codes, but __main__ calls main() without sys.exit(...), so the process will always exit 0 and the GitHub Action can succeed even when publishing fails. Import sys and call sys.exit(main()) (or otherwise propagate the return code).

if __name__ == "__main__":
    main()

python/src/publish_to_hf.py:107

The log message "Failed to export pro weights" is now triggered by the --apple path and is confusing. Also, export_pro_weights() can return None, which will raise inside shutil.move(...) and get swallowed. Consider explicitly handling the None case and updating the message to reflect Apple export, optionally including the underlying exception for debugging.

    if export_apple:
        try:
            mlpackage = export_pro_weights(model_id, bits)

            shutil.move(mlpackage, weights_out)

            model_pro_zip = stage / "weights" / f"{model_name_lower}-apple.zip"
            zip_dir(weights_out, model_pro_zip)

            fingerprint.update(sha256(model_pro_zip).encode())
            config["bits"] = bits
        except Exception:
            print("Failed to export pro weights")

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-16T06:17:00Z

python/src/cli.py

+        print_color(YELLOW, "Extracting model weights...")
+        with zipfile.ZipFile(zip_path, 'r') as zip_ref:
+            zip_ref.extractall(weights_dir)


zipfile.ZipFile.extractall() is used on a downloaded zip without validating member paths. A malicious zip can write outside weights_dir (Zip Slip). Validate that each member resolves within weights_dir (or implement a safe extraction helper) before extracting.

Copilot · 2026-02-16T06:17:01Z

python/src/cli.py

+        weights_dir.mkdir(parents=True, exist_ok=True)
+
+        print_color(YELLOW, "Extracting model weights...")
+        with zipfile.ZipFile(zip_path, 'r') as zip_ref:
+            zip_ref.extractall(weights_dir)


download_from_hf() extracts into weights_dir without clearing it first. If weights_dir already exists but is incomplete/corrupt (e.g., missing config.txt), this can mix old and new files and still fail later. Consider removing the directory (or extracting into a temp dir and atomically replacing) before extractall().

Copilot · 2026-02-16T06:17:01Z

python/src/cli.py

+    model_name = get_model_dir_name(model_id)
+    org = "cactus-compute"
+    repo_id = f"{org}/{model_id.split('/')[-1]}"


The Hugging Face org is hard-coded as cactus-compute here, while the publish workflow defaults to Cactus-Compute. If the namespace casing differs, downloads will fail. Consider making the org configurable (flag/env) and/or using a single constant shared with the publishing path.

Copilot · 2026-02-16T06:17:01Z

.github/workflows/publish_to_hf.yml

+      org:
+        description: Organization
+        required: false
+        default: Cactus-Compute


The workflow defaults the publish org to Cactus-Compute, but the CLI download path expects pre-converted weights under cactus-compute. If these differ on Hugging Face, publishing and downloads won’t line up. Align the default org (and casing) across tooling, or pass the same org value into both publishing and download logic.

Suggested change

default: Cactus-Compute

default: cactus-compute

* pre-converted model downloads from HuggingFace Signed-off-by: jakmro <[email protected]> * clean Signed-off-by: jakmro <[email protected]> --------- Signed-off-by: jakmro <[email protected]>

jakmro added 3 commits February 16, 2026 03:09

pre-converted model downloads from HuggingFace

4513583

Signed-off-by: jakmro <[email protected]>

Merge branch 'main' into cli_reconvert

67226be

Signed-off-by: jakmro <[email protected]>

clean

15c39ea

Signed-off-by: jakmro <[email protected]>

jakmro marked this pull request as ready for review February 16, 2026 06:12

Copilot AI review requested due to automatic review settings February 16, 2026 06:12

Copilot started reviewing on behalf of jakmro February 16, 2026 06:12 View session

Copilot AI reviewed Feb 16, 2026

View reviewed changes

HenryNdubuaku merged commit 1d6df24 into main Feb 16, 2026
7 of 8 checks passed

ncylich pushed a commit that referenced this pull request Feb 24, 2026

Cli reconvert (#357)

a171991

* pre-converted model downloads from HuggingFace Signed-off-by: jakmro <[email protected]> * clean Signed-off-by: jakmro <[email protected]> --------- Signed-off-by: jakmro <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cli reconvert#357

Cli reconvert#357
HenryNdubuaku merged 3 commits intomainfrom
cli_reconvert

jakmro commented Feb 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 16, 2026

Uh oh!

Copilot AI Feb 16, 2026

Uh oh!

Copilot AI Feb 16, 2026

Uh oh!

Copilot AI Feb 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jakmro commented Feb 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants