Fix/issue#490 by lennartvoelz · Pull Request #491 · cactus-compute/cactus

lennartvoelz · 2026-03-04T14:14:54Z

This merge request fixes issue #490

The cactus convert pipeline broke in two related ways when working with merged LoRA models:

Tokenizer detection failed: The tokenizer lookup relied on model_id being a Hugging Face repo identifier (e.g. google/functiongemma-270m-it). When merging a LoRA adapter, model_id is set to a local filesystem path, which poisoned name-based detection logic (e.g. special token handling for Gemma models) and caused the sentence piece model lookup to fail entirely.

Merged model treated as HF repo: After merging, the merged weights are written to a temporary directory. The conversion pipeline then attempted to "download" this temp directory as if it were a remote Hugging Face repo ID.

Fix

Tokenizer (fix: tokenizer detection failed for LoRA merged models)

Introduced a separate name variable for model-name-based detection, decoupled from model_id (which may be a local path)
The sentence piece model is now copied from the LoRA directory directly, with a fallback to the base model in the HF cache
Tokenizer config is read locally when available

LoRA merge path handling (fix: treat merged LoRA temp dir as local model path)

Added a check whether model_id is a local directory before invoking the download/conversion step
If it is a local path (i.e. the merged temp directory), the path is passed directly into weight conversion, skipping the HF download logic entirely

Impact of the fixes

Before:

After:

-> The output also shows the special gemma tokens now

I ran the test suite and everything works fine

When `cactus convert` merges a LoRA adapter, it saves the merged model to a temp directory and tries to “download” it as if it were a Hugging Face repo id. The fix checks if the model_id is a local directory and passes the temp path directly into weight conversion by skipping the HF path Signed-off-by: Lennart <[email protected]>

- copy sentence piece model from LoRA directory (or base model in the HF cache as a fallback) and read the tokenizer config locally if possible - the model_id is a path in the merging case, which poisend the whole conversion chain -> replaced with a name variable for name-based detection (e.g. gemma special tokens) Signed-off-by: Lennart <[email protected]>

Signed-off-by: Lennart <[email protected]>

lennartvoelz · 2026-03-04T14:33:58Z

I added one more fix, so that the tokenizer config is also copied from the LoRA directory if present

HenryNdubuaku · 2026-03-04T17:00:40Z

thanks for this @lennartvoelz !!!

lennartvoelz added 3 commits March 4, 2026 11:05

fix: copy tokenizer_config.json from LoRA adapter if available

5f361b5

Signed-off-by: Lennart <[email protected]>

HenryNdubuaku merged commit f2307c7 into cactus-compute:main Mar 4, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/issue#490#491

Fix/issue#490#491
HenryNdubuaku merged 3 commits intocactus-compute:mainfrom
lennartvoelz:fix/issue#490

lennartvoelz commented Mar 4, 2026 •

edited

Loading

Uh oh!

lennartvoelz commented Mar 4, 2026

Uh oh!

HenryNdubuaku commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lennartvoelz commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This merge request fixes issue #490

Fix

Impact of the fixes

Uh oh!

lennartvoelz commented Mar 4, 2026

Uh oh!

HenryNdubuaku commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lennartvoelz commented Mar 4, 2026 •

edited

Loading