fix(a2a): drain output_rx until Flush to prevent response shift (#2326, #2313) by bug-ops · Pull Request #2328 · bug-ops/zeph

bug-ops · 2026-03-28T09:03:19Z

Summary

bug(a2a): message/send responses shifted by one — request N receives response to request N-1 #2326: Replace the non-blocking try_recv drain in AgentTaskProcessor::process() with a blocking recv().await-until-Flush loop. The previous drain was a TOCTOU race: the agent loop continued emitting Usage/Flush tail events after try_recv had already returned empty, leaving them buffered in output_rx for the next message/send call. Each subsequent request then consumed the previous response, producing the observed one-position shift. Flush is the universal end-of-turn sentinel across all agent code paths.
fix(mcp): tx.send() error in embedding guard warm path silently discarded #2313: Replace let _ = tx.send(result) with an is_err() check and tracing::warn!() in EmbeddingAnomalyGuard::check_async() — both the cold-start and warm async paths.

Test plan

cargo +nightly fmt --check passes
cargo clippy --features full --workspace -- -D warnings passes (0 warnings)
cargo nextest run --workspace --features full --lib --bins — 6927 passed (+1 new test)
New test a2a_response_shift_drain_until_flush_prevents_leak verifies the drain loop consumes FullMessage + delayed Usage + Flush and leaves the channel empty before the next request
Follow-up: consider filing P3 issue for a configurable drain timeout to guard against agent loop panics mid-turn (currently recv().await falls back to None on channel close)

Replace the non-blocking try_recv drain with a blocking recv-until-Flush loop after process() exits on FullMessage. The previous try_recv was a TOCTOU race: the agent loop continued emitting Usage/Flush tail events after the drain completed, leaving them buffered for the next request and causing each message/send to return the previous request's response. Flush is the universal end-of-turn sentinel across all agent paths. The drain loop handles channel close (None) as a fallback exit. Also fix a silent tx.send() error discard in EmbeddingAnomalyGuard check_async(): log a tracing::warn when the result channel is closed instead of silently swallowing the error (#2313). Fixes #2326, #2313

github-actions bot added documentation Improvements or additions to documentation rust Rust code changes bug Something isn't working size/M Medium PR (51-200 lines) labels Mar 28, 2026

This was linked to issues Mar 28, 2026

fix(mcp): tx.send() error in embedding guard warm path silently discarded #2313

Closed

bug(a2a): message/send responses shifted by one — request N receives response to request N-1 #2326

Closed

bug-ops enabled auto-merge (squash) March 28, 2026 09:03

bug-ops mentioned this pull request Mar 28, 2026

fix(a2a): add configurable drain timeout in AgentTaskProcessor to guard against agent loop crash mid-turn #2329

Closed

bug-ops merged commit 9b4a67a into main Mar 28, 2026
25 checks passed

bug-ops deleted the 2326-a2a-message-send-shifted branch March 28, 2026 09:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(a2a): drain output_rx until Flush to prevent response shift (#2326, #2313)#2328

fix(a2a): drain output_rx until Flush to prevent response shift (#2326, #2313)#2328
bug-ops merged 1 commit intomainfrom
2326-a2a-message-send-shifted

bug-ops commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 28, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant