Skip to content

[xDS e2e test] fix flake in HcmConfigUpdatedWithoutRdsChange test#39810

Closed
markdroth wants to merge 1 commit intogrpc:masterfrom
markdroth:xds_cluster_e2e_test_flake_fix
Closed

[xDS e2e test] fix flake in HcmConfigUpdatedWithoutRdsChange test#39810
markdroth wants to merge 1 commit intogrpc:masterfrom
markdroth:xds_cluster_e2e_test_flake_fix

Conversation

@markdroth
Copy link
Copy Markdown
Member

The test was assuming that as soon as the ADS server saw the client's ACK, the client had already switched to the new config. In practice, that's not a good assumption, because the client will send the ACK before it actually swaps the new config into place. So instead, we simply send RPCs until we detect that the client has seen the change.

I believe this flake was triggered by #39736, but that PR wasn't at fault; it just changed the timing enough to expose the race condition in this test.

@markdroth markdroth requested a review from eugeneo June 9, 2025 22:31
@markdroth markdroth added the release notes: no Indicates if PR should not be in release notes label Jun 9, 2025
@markdroth markdroth deleted the xds_cluster_e2e_test_flake_fix branch June 10, 2025 15:35
anniefrchz pushed a commit to anniefrchz/grpc that referenced this pull request Jun 25, 2025
…pc#39810)

The test was assuming that as soon as the ADS server saw the client's ACK, the client had already switched to the new config.  In practice, that's not a good assumption, because the client will send the ACK before it actually swaps the new config into place.  So instead, we simply send RPCs until we detect that the client has seen the change.

I believe this flake was triggered by grpc#39736, but that PR wasn't at fault; it just changed the timing enough to expose the race condition in this test.

Closes grpc#39810

COPYBARA_INTEGRATE_REVIEW=grpc#39810 from markdroth:xds_cluster_e2e_test_flake_fix 5d7f84d
PiperOrigin-RevId: 769643991
paulosjca pushed a commit to paulosjca/grpc that referenced this pull request Aug 23, 2025
…pc#39810)

The test was assuming that as soon as the ADS server saw the client's ACK, the client had already switched to the new config.  In practice, that's not a good assumption, because the client will send the ACK before it actually swaps the new config into place.  So instead, we simply send RPCs until we detect that the client has seen the change.

I believe this flake was triggered by grpc#39736, but that PR wasn't at fault; it just changed the timing enough to expose the race condition in this test.

Closes grpc#39810

COPYBARA_INTEGRATE_REVIEW=grpc#39810 from markdroth:xds_cluster_e2e_test_flake_fix 5d7f84d
PiperOrigin-RevId: 769643991
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants