feat(agent): memory condensation for longer context by arielherself · Pull Request #1457 · block/goose

arielherself · 2025-03-02T08:17:18Z

When context grows beyond the limit of the model, the current implementation will cut off older messages. This PR introduces a more graceful solution that lets the model itself summarize the earlier chat history and use it as part of the context. This enables the model to retain crucial information for a prolonged duration.

The previous implementation is actually buggy when the length of the context is greater than `2 * model_limit`. We should apply incremental summarization.

arielherself · 2025-03-02T16:57:20Z

I think it would be more efficient to save the compressed chat in the log file, rather than compress the long chat every time. Since it's not directly related to compression strategies, I will open another PR to implement this.

michaelneale · 2025-03-02T22:22:46Z

I like that idea - we had something like this in the old python version (but it would do it each time, which loses prompt caching) - any reason why truncate.rs is changed - would this be an alternative one completely ideally?

arielherself · 2025-03-03T02:58:41Z

I like that idea - we had something like this in the old python version (but it would do it each time, which loses prompt caching) - any reason why truncate.rs is changed - would this be an alternative one completely ideally?

Oh I modified truncate.rs to fit in the uniform Compressor trait, so we could switch between these two strategies (truncation and memory condensation) without effort. But it came to me afterward that we should fall back to truncation anyway, so maybe I don't need to do this. I will fix this in a moment.

arielherself · 2025-03-03T05:29:59Z

I have undone the unnecessary changes :)

kalvinnchau · 2025-03-03T19:19:16Z

I like the idea here too; but I do think this should be a separate agent from truncate since this introduces a large change to the default behavior.
The model we have been following is users/testers would should opt to use a "summarize" agent and we can test/iterate on it before it becomes the default behavior.

Which would be found and set via:

❯ goose agents
Available agent versions:
* truncate (default)
  reference

❯ cat ~/.config/goose/config.yaml
GOOSE_AGENT: "summarize"

arielherself · 2025-03-04T05:02:12Z

Thanks for your guidance! I have moved the feature to a separate "summarize" agent, and also tweaked the CLI so it displays the current agent version:

❯ ./target/debug/goose agents
Available agent versions:
  reference
* summarize
  truncate (default)

michaelneale · 2025-03-04T22:33:25Z

nice, yes that seems clearer now, @alexhancock @baxen @salman1993 I think this is interesting and potentially low risk to try, especially now we have sessions across GUI and CLI, people are likely to come across larger and larger sessions, so having some summarize would be interesting to try.

arielherself · 2025-03-05T07:12:24Z

That's cool! Please let me know if there's anything else I could do.

crates/goose/src/memory_condense.rs

michaelneale

I like this - adds a new agent for people to try and I like the sound of this approach, seems worth a try?

* main: bugfix: refactor workdirs to be async-safe, and simpler (#1558) feat: split required_extensions in bench to builtin/external (#1547) fix: continue to use resumed session after confirmation is cancelled (#1548) feat: add image tool to developer mcp (#1515) docs: using gooseignore (#1554) ci: use cargo update --workspace to ensure Cargo.lock is updated (#1539) fix: respond to interrupted tool calls with a ToolResponseMessageContent (#1557) fix: get tool def back to chat mode (#1538) ui: add default icon (#1553) fix: fix summarize agent, use session_id and add provider fn (#1552) feat(agent): memory condensation for longer context (#1457) docs: goose tips blog (#1550) docs: update to provider view (#1546) docs: resuming sessions (#1543) feat: goose bench framework for functional and regression testing feat: use refresh_tokens from databricks api (#1517) feat: use Ctrl/Cmd + ↑/↓ to navigate message history (#1501) feat: remove tools from chat mode (#1533) feat: use dropdown for goose selection (#1531) docs: goosehints in desktop (#1529)

arielherself added 8 commits March 2, 2025 16:08

refactor(agent): add abstraction for compressor

b8613bc

refactor(agent): pass capabilities to compressors

a960eaa

refactor(agent): change the compressor to async

b4d8cad

refactor(agent): pass the tokenizer to compressors

74e2a11

feat(agent): switch to memory condense

f0b3aa7

fix(agent): absorb more messages in each condensation loop

6b67527

fix(agent): keep the conversation in the correct format

6ea6250

feat(agent): fallback to truncator if memory condensation fails

bdeb30e

arielherself force-pushed the dev-condense branch from 16c08df to bdeb30e Compare March 2, 2025 10:56

fix(agent): add limitation to the summarization request

4d382e1

The previous implementation is actually buggy when the length of the context is greater than `2 * model_limit`. We should apply incremental summarization.

arielherself added 2 commits March 3, 2025 13:20

refactor(agent): remove Compressor trait

f88e1ba

refactor(agent): revert some abstractions

d192c68

arielherself added 2 commits March 4, 2025 12:57

refactor(agent): move memory condensation to a separate agent

60f93e9

feat(cli): display current agent version

c05013d

kalvinnchau reviewed Mar 5, 2025

View reviewed changes

crates/goose/src/memory_condense.rs Show resolved Hide resolved

crates/goose/src/memory_condense.rs Show resolved Hide resolved

crates/goose/src/memory_condense.rs Show resolved Hide resolved

fix(agent): fix buggy unwrap() and add more comments

69898ed

kalvinnchau approved these changes Mar 6, 2025

View reviewed changes

michaelneale approved these changes Mar 6, 2025

View reviewed changes

kalvinnchau merged commit 5f750a5 into block:main Mar 6, 2025
1 check passed

arielherself deleted the dev-condense branch March 7, 2025 06:19

lily-de pushed a commit that referenced this pull request Mar 7, 2025

feat(agent): memory condensation for longer context (#1457)

ad4db75

ahau-square pushed a commit that referenced this pull request May 2, 2025

feat(agent): memory condensation for longer context (#1457)

d2bb27c

cbruyndoncx pushed a commit to cbruyndoncx/goose that referenced this pull request Jul 20, 2025

feat(agent): memory condensation for longer context (block#1457)

bf706e2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agent): memory condensation for longer context#1457

feat(agent): memory condensation for longer context#1457
kalvinnchau merged 14 commits intoblock:mainfrom
arielherself:dev-condense

arielherself commented Mar 2, 2025

Uh oh!

arielherself commented Mar 2, 2025

Uh oh!

michaelneale commented Mar 2, 2025

Uh oh!

arielherself commented Mar 3, 2025 •

edited

Loading

Uh oh!

arielherself commented Mar 3, 2025

Uh oh!

kalvinnchau commented Mar 3, 2025

Uh oh!

arielherself commented Mar 4, 2025

Uh oh!

michaelneale commented Mar 4, 2025

Uh oh!

arielherself commented Mar 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelneale left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

arielherself commented Mar 2, 2025

Uh oh!

arielherself commented Mar 2, 2025

Uh oh!

michaelneale commented Mar 2, 2025

Uh oh!

arielherself commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arielherself commented Mar 3, 2025

Uh oh!

kalvinnchau commented Mar 3, 2025

Uh oh!

arielherself commented Mar 4, 2025

Uh oh!

michaelneale commented Mar 4, 2025

Uh oh!

arielherself commented Mar 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelneale left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

arielherself commented Mar 3, 2025 •

edited

Loading