test(ilp): fix race condition in testSymbolCapacityReloadFuzz() by jerrinot · Pull Request #6389 · questdb/questdb

jerrinot · 2025-11-13T09:21:42Z

Easy to reproduce spurious failure. It's sufficient to inject randomized pauses to Sender threads:

for (int i = 0; i < N; i++) {
    sender1.table("mytable")
            .symbol("sym1", rnd1.nextString(10))
            .symbol("sym2", rnd1.nextString(2))
            .doubleColumn("dd", rnd1.nextDouble())
            .atNow();
    if (i % 1000 == 0) {
        Os.sleep(ThreadLocalRandom.current().nextInt(10));
    }
}
sender1.flush();

With this extra sleep the test fails consistently on my local.

Root cause: The test uses ILP@TCP - so sender.flush() only guarantees ILP lines are successfully written to TCP send buffer. There can be an arbitrary long pause between the flush and when the just flushed lines are observed by TCP receiver, let alone WAL apply job.

The fix acknowledges this delay and relaxes the assertion. It also runs the WalApply job even after the main WAL Apply thread is done, becasue the WAL Apply Thread might finish before it had a chance to observe all lines.

Easy to reproduce spurious failure. It's sufficient to inject randomized pauses to Sender threads: ```java for (int i = 0; i < N; i++) { sender1.table("mytable") .symbol("sym1", rnd1.nextString(10)) .symbol("sym2", rnd1.nextString(2)) .doubleColumn("dd", rnd1.nextDouble()) .atNow(); if (i % 1000 == 0) { Os.sleep(ThreadLocalRandom.current().nextInt(10)); } } sender1.flush(); ``` With this extra sleep the test fails consistently on my local. Root cause: The test uses ILP@TCP - so sender.flush() only guarantees ILP lines are successfully written to TCP send buffer. There can be an arbitrary long pause between the flush and when the just flushed lines are observed by TCP receiver, let alone WAL apply job. The fix acknowledges this delay and relaxes the assertion. It also runs the WalApply job even after the main WAL Apply thread is done, becasue the WAL Apply Thread might finish before it had a chance to observe all lines.

coderabbitai · 2025-11-13T09:21:49Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

A test in LineTcpSenderTest.java was refactored to replace explicit latch and pool listener-based synchronization with an assertEventually pattern. The testSymbolCapacityReloadFuzz method now uses eventual assertion to validate table state after draining the WAL queue instead of waiting for pool return events.

Changes

Cohort / File(s)	Summary
Test synchronization refactoring `core/src/test/java/io/questdb/test/cutlass/line/tcp/LineTcpSenderTest.java`	Added import for `TestUtils.assertEventually`. Removed `writerIsBack` latch and pool listener tracking `EV_RETURN` events in `testSymbolCapacityReloadFuzz`. Replaced with eventual assertion that drains WAL queue and validates final table count via SQL assertion.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Verify that the assertEventually-based approach correctly replaces the latch/listener synchronization logic
Confirm the WAL queue draining and table count validation are equivalent to the previous behavior
Check that the timeout and polling behavior of assertEventually is appropriate for this test scenario

Suggested reviewers

puzpuzpuz
bluestreak01

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and specifically describes the main change: fixing a race condition in testSymbolCapacityReloadFuzz(), which directly matches the file modified and the core objective of the PR.
Description check	✅ Passed	The description is directly related to the changeset, providing detailed context about the race condition, reproduction steps, root cause analysis, and the specific fixes applied to the test.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

jerrinot · 2025-11-13T09:45:53Z

@coderabbitai review pls

coderabbitai · 2025-11-13T09:46:08Z

@jerrinot: I'll review the changes in this PR.

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

jerrinot added the Bug Incorrect or unexpected behavior label Nov 13, 2025

jerrinot mentioned this pull request Nov 13, 2025

perf(sql): improve SQL latency by moving munmap() to a background job #6386

Merged

Merge branch 'master' into jh_flaky_testSymbolCapacityReloadFuzz

0687a20

ideoma approved these changes Nov 13, 2025

View reviewed changes

bluestreak01 merged commit 813dc07 into master Nov 13, 2025
40 checks passed

bluestreak01 deleted the jh_flaky_testSymbolCapacityReloadFuzz branch November 13, 2025 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(ilp): fix race condition in testSymbolCapacityReloadFuzz()#6389

test(ilp): fix race condition in testSymbolCapacityReloadFuzz()#6389
bluestreak01 merged 2 commits intomasterfrom
jh_flaky_testSymbolCapacityReloadFuzz

jerrinot commented Nov 13, 2025

Uh oh!

coderabbitai bot commented Nov 13, 2025 •

edited

Loading

Review skipped

Uh oh!

jerrinot commented Nov 13, 2025

Uh oh!

coderabbitai bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jerrinot commented Nov 13, 2025

Uh oh!

coderabbitai bot commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Estimated code review effort

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

jerrinot commented Nov 13, 2025

Uh oh!

coderabbitai bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coderabbitai bot commented Nov 13, 2025 •

edited

Loading