fix: make FunctionGemma prompt formatting strict by lennartvoelz · Pull Request #502 · cactus-compute/cactus

lennartvoelz · 2026-03-06T11:52:38Z

Remove hardcoded “When you decide to call…” guidance and arg example from developer turn
Fix FunctionGemma trigger string (no trailing period) and append tools declarations directly
Wrap tool responses in a developer turn and allow stacking multiple tool responses
Stop wrapping tool outputs in value:; pass through {...} payload as-is
Close pending tool-response developer turn before next user/model turn to avoid malformed prompts

…mple, fix tool response wrapping) Remove hardcoded “When you decide to call…” guidance and arg example from developer turn Fix FunctionGemma trigger string (no trailing period) and append tools declarations directly Wrap tool responses in a developer turn and allow stacking multiple tool responses Stop wrapping tool outputs in value:; pass through {...} payload as-is Close pending tool-response developer turn before next user/model turn to avoid malformed prompts Signed-off-by: Lennart <[email protected]>

HenryNdubuaku · 2026-03-06T22:57:22Z

@lennartvoelz thanks for this! one thing, this fails more tool call tests than the old setup, any insights as to why?

lennartvoelz · 2026-03-06T23:13:05Z

What did you test it against? @HenryNdubuaku

lennartvoelz · 2026-03-07T14:06:43Z

@HenryNdubuaku

I would say for the standard weights, it is just noise. But there are a few things off with FunctionGemma setup rn. I'll later push some changes that make the model much more stable. As said, I tested against the Hackathon dataset. With my changes, the effect is especially visible for fine tuned models.
Fine-tuned & new setup:

Fine-tuned & old setup:

Base weights & new setup:

Base weights & old setup:

With the changes in the Gemma implementation, the effect of the extra tokens on latency is much smaller. However, I believe it is better practice to keep it standard (which also works better in this case!).
I am still working on reducing the execution time while maintaining the much better sampling quality.

HenryNdubuaku · 2026-03-07T19:37:53Z

thanks @lennartvoelz I will merge this now, thanks for testing

lennartvoelz mentioned this pull request Mar 6, 2026

Gemma (functiongemma): Hardcoded system prompt is non-standard #501

Closed

HenryNdubuaku merged commit f8b714c into cactus-compute:main Mar 7, 2026
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make FunctionGemma prompt formatting strict#502

fix: make FunctionGemma prompt formatting strict#502
HenryNdubuaku merged 1 commit intocactus-compute:mainfrom
lennartvoelz:fix/issue#501

lennartvoelz commented Mar 6, 2026

Uh oh!

HenryNdubuaku commented Mar 6, 2026

Uh oh!

lennartvoelz commented Mar 6, 2026 •

edited

Loading

Uh oh!

lennartvoelz commented Mar 7, 2026 •

edited

Loading

Uh oh!

HenryNdubuaku commented Mar 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lennartvoelz commented Mar 6, 2026

Uh oh!

HenryNdubuaku commented Mar 6, 2026

Uh oh!

lennartvoelz commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lennartvoelz commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HenryNdubuaku commented Mar 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lennartvoelz commented Mar 6, 2026 •

edited

Loading

lennartvoelz commented Mar 7, 2026 •

edited

Loading