fixes breakage of Proposal A6: Retry policies #2577

vroldanbet · 2025-09-25T14:37:20Z

See https://github.com/grpc/proposal/blob/master/A6-client-retries.md#when-retries-are-valid

As I attempted to implement an example client using retry policies, I found that they didn't work.
This was due to the fact SpiceDB's response violated the expectations from that spec, notably:

In certain cases it is not valid to retry an RPC. These cases occur when the RPC has been committed, and thus it does not make sense to perform the retry.
The reasoning behind the first scenario is that the Response-Headers include initial metadata
from the server. The metadata (or its absence) it is transmitted to the client application. This may fundamentally change the state of the client, so we cannot safely retry if a failure occurs later in the RPC’s life.

The RequestID middleware was sending the requestID as a metadata header, which causes the stream to send a frame just to send the headers before sending the actual response. This, in turn, causes the stream to become committed, which, according to the A6 spec, disables retry policies.

The TL;DR is: use trailers for metadata unless there is an explicit reason to use metadata headers; otherwise, retry policies will become disabled.

This also fixes the version middleware, which was subject to the same problem.

⚠️ This is a breaking change because it moves two values from header to trailer. I'm aware zed makes use of this.

codecov · 2025-09-25T14:39:56Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 77.72%. Comparing base (b1ed3fd) to head (3f03e66).
⚠️ Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2577      +/-   ##
==========================================
- Coverage   77.76%   77.72%   -0.04%     
==========================================
  Files         440      440              
  Lines       54379    54378       -1     
==========================================
- Hits        42284    42260      -24     
- Misses       9481     9496      +15     
- Partials     2614     2622       +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

tstirrat15

LGTM; see comments

pkg/middleware/requestid/requestid.go

See https://github.com/grpc/proposal/blob/master/A6-client-retries.md#when-retries-are-valid As I was trying to implement an example client using retry policies, I found they didn't work. This was due to the fact SpiceDB response violated the expectations from that spec, notably: > In certain cases it is not valid to retry an RPC. These cases occur when the RPC has been committed, and thus it does not make sense to perform the retry. > The reasoning behind the first scenario is that the Response-Headers include initial metadata from the server. The metadata (or its absence) it is transmitted to the client application. This may fundamentally change the state of the client, so we cannot safely retry if a failure occurs later in the RPC’s life. The RequestID middleware was sending the requestID as a metadata header, which causes the stream to send a frame just to send the headers before sending the actual response. This in turn causes the stream to become committed, which as per A6 spec causes retry policies to be disabled. The TL;DR is: use trailers for metadata unless there is an explicit reason to use metadata headers, otherwise retry policies will become disabled.

just like with the requestID middleware, the solution is to turn it into a trailer instead of a header. This is a breaking change for zed.

tstirrat15

I really appreciate the regression test, and I'm glad you were able to surface this. Thank you!

tstirrat15 · 2025-10-01T18:31:45Z

I added a zed issue to update its version command accordingly.

vroldanbet requested a review from a team as a code owner September 25, 2025 14:37

github-actions bot added the area/tooling Affects the dev or user toolchain (e.g. tests, ci, build tools) label Sep 25, 2025

vroldanbet marked this pull request as draft September 25, 2025 14:55

tstirrat15 previously approved these changes Sep 25, 2025

View reviewed changes

pkg/middleware/requestid/requestid.go Outdated Show resolved Hide resolved

vroldanbet dismissed tstirrat15’s stale review via 3767950 October 1, 2025 17:28

vroldanbet force-pushed the requestid-mw-bug branch from 2c8c8d3 to 3767950 Compare October 1, 2025 17:28

github-actions bot added the area/cli Affects the command line label Oct 1, 2025

vroldanbet force-pushed the requestid-mw-bug branch 3 times, most recently from 43bd093 to 2ea3cd3 Compare October 1, 2025 17:55

vroldanbet added 3 commits October 1, 2025 19:03

test: demonstrate version middleware also breaks retry policies

783d5fc

fix(api): fixes version middleware breaking retry policies

3f03e66

just like with the requestID middleware, the solution is to turn it into a trailer instead of a header. This is a breaking change for zed.

vroldanbet force-pushed the requestid-mw-bug branch from 55568d1 to 3f03e66 Compare October 1, 2025 18:03

vroldanbet marked this pull request as ready for review October 1, 2025 18:03

vroldanbet requested a review from tstirrat15 October 1, 2025 18:03

vroldanbet self-assigned this Oct 1, 2025

vroldanbet requested a review from josephschorr October 1, 2025 18:05

vroldanbet enabled auto-merge October 1, 2025 18:20

tstirrat15 mentioned this pull request Oct 1, 2025

Update version command to look at both headers and trailers authzed/zed#552

Closed

tstirrat15 approved these changes Oct 1, 2025

View reviewed changes

vroldanbet added this pull request to the merge queue Oct 1, 2025

Merged via the queue into main with commit 5748bec Oct 1, 2025
46 checks passed

vroldanbet deleted the requestid-mw-bug branch October 1, 2025 18:42

github-actions bot locked and limited conversation to collaborators Oct 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixes breakage of Proposal A6: Retry policies #2577

fixes breakage of Proposal A6: Retry policies #2577

Uh oh!

vroldanbet commented Sep 25, 2025 •

edited

Loading

Uh oh!

codecov bot commented Sep 25, 2025 •

edited

Loading

Uh oh!

tstirrat15 left a comment

Uh oh!

Uh oh!

tstirrat15 left a comment

Uh oh!

tstirrat15 commented Oct 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fixes breakage of Proposal A6: Retry policies #2577

fixes breakage of Proposal A6: Retry policies #2577

Uh oh!

Conversation

vroldanbet commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tstirrat15 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tstirrat15 left a comment

Choose a reason for hiding this comment

Uh oh!

tstirrat15 commented Oct 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vroldanbet commented Sep 25, 2025 •

edited

Loading

codecov bot commented Sep 25, 2025 •

edited

Loading