fix: truncation agent token calculations by kalvinnchau · Pull Request #915 · block/goose

kalvinnchau · 2025-01-29T22:10:43Z

truncation agent token calculations

propagate and display any errors that come back from truncate_messages instead of ignoring them
add a note that users should restart in a fresh session if they hit the context limit
self.token_counter.count_tokens(&msg.as_concat_text()) was only counting messages of type Message::Text, ignoring any ToolResponses which tend to be a large amount of tokens
use self.token_counter.count_chat_tokens("", std::slice::from_ref(msg), &[]) to get a full count of the message tokens
update context_limit calculation to subtract the total amount of system_prompt and tools tokens already in use

…ssages previously count_tokens(&msg.as_concat_text) would not count ToolResponses, update that to use count_chat_tokens on each individual message make count_tokens_for_tools a public method to use to account for tool token counts update the context_limit to take into account for the system_prompt, and tool request token counts

at this point we already know what message is being removed, the total_tokens count doesn't get used after this point and we just remove the paired message

salman1993 · 2025-01-29T22:44:06Z

crates/goose/src/agents/truncate.rs

+        let context_limit = remaining_tokens;
+
+        // Calculate current token count of each message, use count_chat_tokens to ensure we
+        // capture the full content of the message


lets mention tool response in the comment

salman1993

lgtm!

wendytang · 2025-01-30T02:36:00Z

crates/goose/src/agents/truncate.rs

-        // Calculate current token count
+        // Take into account the system prompt, and our tools input and subtract that from the
+        // remaining context limit
+        let system_prompt_token_count = self.token_counter.count_tokens(system_prompt);


does we pass the system_prompt back and forth in the reply loop?

if we do, i wonder if subtracting:

count_tokens(system_prompt) * num_user_exchanges

would be accurate

i believe we just pass it in once, and not within the messages in the loop

wendytang

nice!

* origin/main: fix: clarify linux cli install only (#927) feat: update ui for ollama host (#912) feat: add CONFIGURE=false option in install script (#920) fix: truncation agent token calculations (#915) fix: request payload for o1 models (#921) Update SupportedEnvironments.js so others don't get confused on why they can not open the macos app on x86 (#888) fix: improve configure process with error message (#919) docs: Goose on Windows via WSL (#901) fix: more graceful handling of missing usage in provider response (#907) feat: rm uv.lock cause it points to square artifactory (#917) feat: Update issue templates for bug report for goose (#913) fix: post endpoint url on sse endpoint event (#900)

* main: chore: remove gpt-3.5-turbo UI suggestion, as it is deprecated (#959) chore: remove o1-mini suggestion from UI add model view (#957) fix: missing field in request (#956) docs: update provider docs, fix rate limit link (#943) fix: clarify linux cli install only (#927) feat: update ui for ollama host (#912) feat: add CONFIGURE=false option in install script (#920) fix: truncation agent token calculations (#915) fix: request payload for o1 models (#921)

* main: (28 commits) ci: per semver build metadata should be after + (#971) fix: temp fix to make CI workflow pass (#970) chore: bump patch version to 1.0.3 (#967) fix: load shell automatically from env for GUI (#948) fix: update versions in release and canary workflows (#911) docs: fix typo, name (#963) docs: typo fix (#961) chore: remove gpt-3.5-turbo UI suggestion, as it is deprecated (#959) chore: remove o1-mini suggestion from UI add model view (#957) fix: missing field in request (#956) docs: update provider docs, fix rate limit link (#943) fix: clarify linux cli install only (#927) feat: update ui for ollama host (#912) feat: add CONFIGURE=false option in install script (#920) fix: truncation agent token calculations (#915) fix: request payload for o1 models (#921) Update SupportedEnvironments.js so others don't get confused on why they can not open the macos app on x86 (#888) fix: improve configure process with error message (#919) docs: Goose on Windows via WSL (#901) fix: more graceful handling of missing usage in provider response (#907) ...

kalvinnchau added 4 commits January 29, 2025 13:56

fix: remove total_tokens subtraction when matching tool pairs

65401f6

at this point we already know what message is being removed, the total_tokens count doesn't get used after this point and we just remove the paired message

chore: remove lingering warning line

a1b1ee0

docs: cleanup messaging in comments

d6140b3

kalvinnchau changed the title ~~truncation agent updates~~ fix:truncation agent updates Jan 29, 2025

kalvinnchau changed the title ~~fix:truncation agent updates~~ fix:truncation agent token calculations Jan 29, 2025

kalvinnchau requested a review from salman1993 January 29, 2025 22:15

salman1993 reviewed Jan 29, 2025

View reviewed changes

salman1993 approved these changes Jan 29, 2025

View reviewed changes

doc: update comment to include ToolRequests and ToolResponses

988ff02

kalvinnchau marked this pull request as ready for review January 29, 2025 23:58

kalvinnchau added 2 commits January 29, 2025 16:43

chore: add note to restart session when error attempting to truncate

4e5f5aa

chore: spacing in response

261a1b2

wendytang reviewed Jan 30, 2025

View reviewed changes

wendytang approved these changes Jan 30, 2025

View reviewed changes

salman1993 approved these changes Jan 30, 2025

View reviewed changes

salman1993 changed the title ~~fix:truncation agent token calculations~~ fix: truncation agent token calculations Jan 30, 2025

kalvinnchau merged commit ff71de4 into main Jan 30, 2025

kalvinnchau deleted the kalvin/truncate-agent-updates branch January 30, 2025 15:50

kalvinnchau mentioned this pull request Jan 30, 2025

Context Length Exceeded #903

Closed

ahau-square pushed a commit that referenced this pull request May 2, 2025

fix: truncation agent token calculations (#915)

65f769c

cbruyndoncx pushed a commit to cbruyndoncx/goose that referenced this pull request Jul 20, 2025

fix: truncation agent token calculations (block#915)

37ce156

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: truncation agent token calculations#915

fix: truncation agent token calculations#915
kalvinnchau merged 7 commits intomainfrom
kalvin/truncate-agent-updates

kalvinnchau commented Jan 29, 2025 •

edited

Loading

Uh oh!

salman1993 Jan 29, 2025

Uh oh!

salman1993 left a comment

Uh oh!

wendytang Jan 30, 2025

Uh oh!

kalvinnchau Jan 30, 2025

Uh oh!

wendytang left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kalvinnchau commented Jan 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

truncation agent token calculations

Uh oh!

salman1993 Jan 29, 2025

Choose a reason for hiding this comment

Uh oh!

salman1993 left a comment

Choose a reason for hiding this comment

Uh oh!

wendytang Jan 30, 2025

Choose a reason for hiding this comment

Uh oh!

kalvinnchau Jan 30, 2025

Choose a reason for hiding this comment

Uh oh!

wendytang left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kalvinnchau commented Jan 29, 2025 •

edited

Loading