perf(core): optimize mat view writing by skipping transactions that are fully replaced by future commits or full refresh by ideoma · Pull Request #6416 · questdb/questdb

ideoma · 2025-11-19T14:42:45Z

Pull Request Overview

This PR introduces an optimization to skip processing WAL (Write-Ahead Log) transactions with replace ranges that are fully covered by future transactions or if a TRUNCATE operation is visible in the future transactions, improving performance when handling multiple overlapping replace range operations.

Also, this PR fixed a few minor issues:

Removes error logging about lock errors of wa*.lock files; these are now logged as a debug category
Fixes wrong writerTxn in wal_tables query, sometimes sporadically observed.

…re fully replaced by future commits

ideoma · 2025-11-19T14:43:00Z

@coderabbitai review

coderabbitai · 2025-11-19T14:43:08Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

core/src/main/java/io/questdb/cairo/TableWriter.java

Copilot

Pull Request Overview

This PR introduces an optimization to skip processing WAL (Write-Ahead Log) transactions with replace ranges that are fully covered by future transactions, improving performance when handling multiple overlapping replace range operations.

Key changes:

Added isReplaceRangeCoveredByFutureTransactions() method to detect when a transaction's replace range is superseded by a future transaction
Integrated the optimization check into processWalCommit() to skip redundant transaction processing
Added stress test testStressReplaceLastMinuteRepeatedly() to validate the optimization with 1,000 consecutive replace operations

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 7 comments.

File	Description
core/src/main/java/io/questdb/cairo/TableWriter.java	Implements the skip optimization logic for replace range transactions, including commented-out experimental code and the new coverage detection method
core/src/test/java/io/questdb/test/cairo/wal/WalWriterReplaceRangeTest.java	Adds stress test for repeated replace range operations to validate the optimization works correctly under high load

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

core/src/test/java/io/questdb/test/cairo/wal/WalWriterReplaceRangeTest.java

core/src/main/java/io/questdb/cairo/TableWriter.java

Co-authored-by: Copilot <[email protected]>

+ minor fixes

ideoma · 2025-11-20T17:52:58Z

@coderabbitai review

coderabbitai · 2025-11-20T17:53:04Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 4

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5555187 and 7547af1.

📒 Files selected for processing (6)

core/src/main/java/io/questdb/cairo/TableUtils.java (2 hunks)
core/src/main/java/io/questdb/cairo/TableWriter.java (1 hunks)
core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java (4 hunks)
core/src/main/java/io/questdb/cairo/wal/WalPurgeJob.java (1 hunks)
core/src/test/java/io/questdb/test/cairo/wal/WalWriterReplaceRangeTest.java (3 hunks)
core/src/test/java/io/questdb/test/cutlass/http/ExpParquetExportTest.java (1 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-11-07T00:59:31.522Z

Learnt from: bluestreak01
Repo: questdb/questdb PR: 0
File: :0-0
Timestamp: 2025-11-07T00:59:31.522Z
Learning: In QuestDB's Cairo engine, transaction (_txn) files have a strong invariant: they are never truncated below TX_BASE_HEADER_SIZE. Once created, they are either fully formed (size >= header size) or completely removed along with the entire table directory when the table is dropped.

Applied to files:

core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (34)

GitHub Check: New pull request (SelfHosted Running tests with cover on linux-other)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-pgwire)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-cairo-sub)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-cairo-root)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-fuzz2)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-fuzz1)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-griffin-sub)
GitHub Check: New pull request (Rust Test and Lint on linux-jdk17)
GitHub Check: New pull request (SelfHosted Other tests on linux-x86-graal)
GitHub Check: New pull request (SelfHosted Other tests on linux-x64-zfs)
GitHub Check: New pull request (SelfHosted Running tests with cover on linux-griffin-root)
GitHub Check: New pull request (SelfHosted Other tests on linux-arm64)
GitHub Check: New pull request (Hosted Running tests on windows-other-2)
GitHub Check: New pull request (Hosted Running tests on windows-other-1)
GitHub Check: New pull request (Hosted Running tests on windows-pgwire)
GitHub Check: New pull request (Hosted Running tests on windows-cairo-2)
GitHub Check: New pull request (Hosted Running tests on windows-cairo-1)
GitHub Check: New pull request (Hosted Running tests on windows-fuzz2)
GitHub Check: New pull request (Hosted Running tests on windows-fuzz1)
GitHub Check: New pull request (Hosted Running tests on windows-griffin-sub)
GitHub Check: New pull request (Hosted Running tests on windows-griffin-base)
GitHub Check: New pull request (Hosted Running tests on mac-other)
GitHub Check: New pull request (Hosted Running tests on mac-pgwire)
GitHub Check: New pull request (Hosted Running tests on mac-cairo-fuzz)
GitHub Check: New pull request (Hosted Running tests on mac-cairo)
GitHub Check: New pull request (SelfHosted Cairo tests on linux-x64-zfs)
GitHub Check: New pull request (Hosted Running tests on mac-griffin)
GitHub Check: New pull request (SelfHosted Griffin tests on linux-x86-graal)
GitHub Check: New pull request (SelfHosted Cairo tests on linux-arm64)
GitHub Check: New pull request (SelfHosted Cairo tests on linux-x86-graal)
GitHub Check: New pull request (SelfHosted Griffin tests on linux-arm64)
GitHub Check: New pull request (SelfHosted Griffin tests on linux-x64-zfs)
GitHub Check: New pull request (Check Changes Check changes)
GitHub Check: build

🔇 Additional comments (7)

core/src/test/java/io/questdb/test/cutlass/http/ExpParquetExportTest.java (1)

124-124: Verify the connection between this test change and the PR objectives.

The date change from '1971-01-01' to '1970-01-02' reduces the generated dataset from approximately 1 year (~6.3M rows) to 1 day (~17K rows) of 5-second intervals. While this makes the test more efficient and appropriate for a basic sanity check, it's unclear how this relates to the PR's stated objectives around WAL transaction skipping optimization for materialized views.

Is this test adjustment:

An incidental correction discovered during development?

Related to underlying changes in date/time processing from the WAL optimization work?

Addressing a previous test error?

Please confirm this change is intentional and clarify its relationship to the PR scope.

core/src/main/java/io/questdb/cairo/wal/WalPurgeJob.java (1)

285-285: LGTM! Appropriate logging level for routine WAL discovery.

The change to debug-level logging (verbose=false, logDebug=true) is appropriate for this context where lock failures during WAL segment discovery are routine operations, not exceptional conditions. This aligns well with the existing debug-level logging for successful lock acquisition on line 287.

core/src/main/java/io/questdb/cairo/TableUtils.java (3)

53-53: LGTM! Import required for new logging pattern.

The LogRecord import is necessary to support the conditional log level selection in the updated lock() method implementations.

1091-1128: LGTM! Well-designed logging control for lock operations.

The refactored lock() method provides fine-grained control over logging behavior:

Error paths (lines 1099-1119): Log at ERROR level when verbose=true and logDebug=false, otherwise at DEBUG level when logDebug=true

Success path (lines 1124-1126): Always logs at DEBUG level when logging is enabled (appropriate for routine operations)

The overload chain maintains backward compatibility while enabling callers to choose appropriate logging levels based on operational context (e.g., expected vs. exceptional lock failures).

1130-1132: LGTM! Backward compatibility maintained.

The existing convenience overload continues to delegate appropriately, ensuring no breaking changes for existing callers.

core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java (2)

621-636: Verification complete: atomic behavior of trySkipWalTransactions is correctly implemented.

The method exhibits proper all-or-nothing semantics:

Precondition check before state modification: The lag row/txn count check at line 3075 occurs before any state modifications via commitSeqTxn, ensuring the precondition (no pending WAL writes) is validated first.

Atomic state transitions:

If the precondition fails (lag counts non-zero), the method returns false immediately without modifying writer state.

If the precondition passes, it calls commitSeqTxn(seqTxn + skipTxnCount - 1) and returns true.

Safe fallback: When trySkipWalTransactions returns false at lines 624-627 in ApplyWal2TableJob.java, the writer state remains unchanged, allowing the fallback to commitWalInsertTransactions at lines 631-635 to proceed safely.

The implementation fully addresses the verification requirements outlined in the original review comment.

638-681: Forward progress is guaranteed by both code paths; no issues found.

The verification confirms that the return value calculation at line 681 is safe:

Skip path (line 3080): Explicitly calls commitSeqTxn(seqTxn + skipTxnCount - 1), ensuring getAppliedSeqTxn() >= seqTxn.

Commit path (lines 1277–1363): The unconditional assertion at line 1361—assert txWriter.getLagTxnCount() == (seqTxn - txWriter.getSeqTxn())—guarantees that getAppliedSeqTxn() (which equals txWriter.getSeqTxn() + txWriter.getLagTxnCount()) always equals the method's seqTxn parameter at completion, regardless of whether data was committed to the main table or buffered in LAG.

Consequently, the loop at line 643 (line 643: for (long s = seqTxn; s <= lastCommittedSeqTxn; s++)) always executes at least once, ensuring the return value (int) (lastCommittedSeqTxn - seqTxn + 1) is at least 1, satisfying the assertion at line 484.

core/src/main/java/io/questdb/cairo/TableWriter.java

core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java

core/src/test/java/io/questdb/test/cairo/wal/WalWriterReplaceRangeTest.java

coderabbitai · 2025-11-21T10:46:07Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat-optmise-mat-view-wal-apply

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

nwoolmer

Looks good, a nit and maybe an additional safety check

core/src/main/java/io/questdb/cairo/TableUtils.java

core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java

nwoolmer

LGTM

bluestreak01 · 2025-11-21T19:18:03Z

I would like to merge this after Ent release, to de-risk Enterprise

…ew-wal-apply

glasstiger · 2025-11-25T19:16:34Z

[PR Coverage check]

😍 pass : 51 / 51 (100.00%)

file detail

	path	covered line	new line	coverage
🔵	io/questdb/cairo/TableWriter.java	9	9	100.00%
🔵	io/questdb/cairo/wal/ApplyWal2TableJob.java	42	42	100.00%

…s that are fully replaced by future commits or full refresh (#6416)" This reverts commit 818d227.

feat(core): optimize mat view writing by skipping transactions that a…

c0684b2

…re fully replaced by future commits

ideoma requested a review from Copilot November 19, 2025 14:42

Copilot started reviewing on behalf of ideoma November 19, 2025 14:43 View session

Merge branch 'master' into feat-optmise-mat-view-wal-apply

6377d49

Copilot finished reviewing on behalf of ideoma November 19, 2025 14:45

cleanup

a2e32bb

nwoolmer reviewed Nov 19, 2025

View reviewed changes

core/src/main/java/io/questdb/cairo/TableWriter.java Outdated Show resolved Hide resolved

Copilot AI reviewed Nov 19, 2025

View reviewed changes

nwoolmer reviewed Nov 19, 2025

View reviewed changes

core/src/main/java/io/questdb/cairo/TableWriter.java Outdated Show resolved Hide resolved

ideoma and others added 10 commits November 19, 2025 15:10

Apply suggestions from code review

25f93b4

Co-authored-by: Copilot <[email protected]>

improve transaction skipping to be more lightweight

80ae53d

add tests

565da05

+ minor fixes

Merge branch 'master' into feat-optmise-mat-view-wal-apply

b5f77fb

fix skipping if with ALTERs

d2560db

Merge branch 'master' into feat-optmise-mat-view-wal-apply

8193481

fix skipping with wal txn lag

4aee27d

test for truncate optimization

5df9c10

test for truncate optimization

2e04194

refactoring

7547af1

questdb deleted a comment from coderabbitai bot Nov 20, 2025

ideoma changed the title ~~feat(core): optimize mat view writing by skipping transactions that are fully replaced by future commits~~ perf(core): optimize mat view writing by skipping transactions that are fully replaced by future commits or full refresh Nov 20, 2025

coderabbitai bot reviewed Nov 20, 2025

View reviewed changes

ideoma added 2 commits November 21, 2025 10:15

fix test, address review comments

69c0151

Merge branch 'master' into feat-optmise-mat-view-wal-apply

5f78ce1

questdb deleted a comment from coderabbitai bot Nov 21, 2025

nwoolmer reviewed Nov 21, 2025

View reviewed changes

core/src/main/java/io/questdb/cairo/TableUtils.java Outdated Show resolved Hide resolved

core/src/main/java/io/questdb/cairo/wal/ApplyWal2TableJob.java Show resolved Hide resolved

ideoma added 3 commits November 21, 2025 13:48

implement nit

68c14bb

attempt to fix wrong writerTxn value in wal_tables query

595c00c

fix parquet copy test

a2d55a3

nwoolmer previously approved these changes Nov 21, 2025

View reviewed changes

bluestreak01 added the DO NOT MERGE These changes should not be merged to main branch label Nov 21, 2025

ideoma mentioned this pull request Nov 24, 2025

fix(core): fix rare suspended WAL table when ALTER and RENAME are executed concurrently #6440

Merged

bluestreak01 removed the DO NOT MERGE These changes should not be merged to main branch label Nov 25, 2025

Merge remote-tracking branch 'origin/master' into feat-optmise-mat-vi…

65a6cb7

…ew-wal-apply

bluestreak01 dismissed nwoolmer’s stale review via 65a6cb7 November 25, 2025 18:53

bluestreak01 approved these changes Nov 25, 2025

View reviewed changes

bluestreak01 merged commit 818d227 into master Nov 25, 2025
41 checks passed

bluestreak01 deleted the feat-optmise-mat-view-wal-apply branch November 25, 2025 19:52

ideoma added a commit that referenced this pull request Nov 26, 2025

Revert "perf(core): optimize mat view writing by skipping transaction…

b199f91

…s that are fully replaced by future commits or full refresh (#6416)" This reverts commit 818d227.

This was referenced Nov 26, 2025

chore(core): revert mat view WAL Apply skipping transactions #6460

Merged

perf(core): optimize mat view writing by skipping transactions that are fully replaced by future commits or full refresh (2) #6495

Merged

Conversation

ideoma commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Overview

Uh oh!

ideoma commented Nov 19, 2025

Uh oh!

coderabbitai bot commented Nov 19, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ideoma commented Nov 20, 2025

Uh oh!

coderabbitai bot commented Nov 20, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Other AI code review bot(s) detected

Uh oh!

nwoolmer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nwoolmer left a comment

Choose a reason for hiding this comment

Uh oh!

bluestreak01 commented Nov 21, 2025

Uh oh!

glasstiger commented Nov 25, 2025

[PR Coverage check]

file detail

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ideoma commented Nov 19, 2025 •

edited

Loading

coderabbitai bot commented Nov 21, 2025 •

edited

Loading