feat(sql): optimized ASOF JOIN on single symbol key where RHS symbol is low-frequency by mtopolnik · Pull Request #6208 · questdb/questdb

mtopolnik · 2025-09-30T14:16:10Z

Keyed ASOF JOIN on a non-indexed symbol column uses linear scan to find the matching RHS row. For each LHS row, it repeats the RHS search from scratch. This works fine as long as the RHS table has plenty occurrences of all symbols.

However, if some symbol occurs in the RHS table only rarely, a great many rows will have to be scanned to find it. Then, when we move on after finding it, and encounter the same symbol in a future LHS row, we'll have to repeat the entire scan, only to find the exact same RHS symbol.

This PR saves time by remembering the location of each symbol it encounters in its search for the matching RHS row, and then reusing that knowledge when scanning again. It also remembers where it didn't find a symbol, allowing it to skip over the already-scanned region of the RHS table.

Benchmark

The benchmark uses two tables, prices as the RHS in the join, and orders as LHS.

We set up the data using the Zipf (long-tail) distribution of 5,000 symbols in prices, but hand-pick only the 500 rarest symbols for orders. Each of these symbols occurs just 5-9 times among the table's 320 million rows.

We set up the timestamps such that orders in orders happen later than almost all the data in prices, resulting in a large search space for the matching ASOF JOIN row.

Currency price table:

CREATE TABLE prices (
      ts TIMESTAMP,
      sym SYMBOL,
      price DOUBLE
  ) timestamp(ts) PARTITION BY DAY;
INSERT INTO prices
  SELECT
      dateadd('s', x::int, '2010-01-01T00:00:00.000000Z') as ts,
      rnd_symbol_zipf(5_000, 2.0),
      rnd_double() * 10.0 + 5.0
      FROM long_sequence(320_000_000);

Trade orders table:

CREATE TABLE orders (
    id LONG,
    order_ts TIMESTAMP,
    sym1 SYMBOL CAPACITY 1024,
    sym2 SYMBOL CAPACITY 1024,
    unit_price DOUBLE,
    volume DOUBLE
) TIMESTAMP(order_ts) PARTITION BY DAY;

WITH order_basics AS (SELECT * FROM (
  SELECT
    x AS id,
    dateadd('s', (x * 10)::int, '2020-01-01T00:00:00.000000Z') AS order_ts,
    rnd_symbol('sym4893','sym4774','sym4535', ... 500 rarest symbols in prices) AS sym1,
    rnd_symbol('sym4893','sym4774','sym4535', ... 500 rarest symbols in prices) AS sym2,
    rnd_double() * 20 + 10 AS volume
  FROM long_sequence(1000))
  TIMESTAMP(order_ts)
  ),
  join1 AS (
    SELECT ob.*, p.price price1 FROM order_basics ob ASOF JOIN prices p ON (ob.sym1 = p.sym)
  ),
  join2 AS (
    SELECT j1.*, p.price price2 FROM join1 j1 ASOF JOIN prices p ON (j1.sym2 = p.sym)
  )
INSERT INTO orders
SELECT id, order_ts, sym1, sym2, price1/price2 AS unit_price, volume FROM join2;

The query:

WITH
  offsets AS (
    SELECT sec_offs, 1_000_000 * sec_offs usec_offs 
    FROM (SELECT x-601 AS sec_offs FROM long_sequence(1201))
  ),
  points AS (SELECT * FROM (
    SELECT id, order_ts, sym1, sym2, unit_price, volume, sec_offs, order_ts + usec_offs AS ts
    FROM orders CROSS JOIN offsets
    ORDER BY order_ts + usec_offs
  ) TIMESTAMP(ts)),
  join1 AS (
    SELECT t.*, p.price AS sym1_price
    FROM points as t
    ASOF JOIN prices as p
    ON (t.sym1 = p.sym)
  ),
  join2 AS (
    SELECT t.*, p.price AS sym2_price
    FROM join1 as t
    ASOF JOIN prices as p
    ON (t.sym2 = p.sym)
  ),
  markouts AS (
  SELECT
    sec_offs, 
    volume,
    volume * (unit_price - (sym1_price/sym2_price)) AS weighted_markout
  FROM join2
  )
SELECT sec_offs, sum(weighted_markout) / sum(volume) AS avg_weigthed_markout 
FROM markouts;

This query results in 1,201,000 rows.

With the SQL hint that disables the new algo, SELECT /*+ asof_fast_search(t p)*/, the query times out.
Using Indexed Scan with SELECT /*+ asof_index_search(t p)*/ ...., the query times out again. Now the search is able to skip over entire partitions where the symbol doesn't exist, but it must still go through the partitions one by one, repeatedly.
Using the new default Memoized Scan, the query takes around 15 seconds on a cold disk cache, and 8.5 seconds when repeated.

core/src/main/java/io/questdb/griffin/engine/join/AsOfJoinMemoizedRecordCursorFactory.java

bluestreak01

refactoring in this PR introduced previously non-existent possibility of NPE in SQLOptimiser. Intellij produces warnings that should be investigated:

mtopolnik · 2025-10-27T13:13:06Z

refactoring in this PR introduced previously non-existent possibility of NPE in SQLOptimiser. Intellij produces warnings that should be investigated:

The changes I see in SqlOptimiser are unrelated to the warning.

I think the reason why this is showing up now isn't any change in this PR, but an improvement to the static analysis in IDEA. It can see that a few lines above we have a check if (innerVirtualModel != null), but don't have it later on, where this warning is raised. The reason why I think it's safe without the check is that the parameter addColumnToInnerVirtualModel is only true when innerVirtualModel is non-null. But I haven't 100% confirmed this.

glasstiger · 2025-10-27T13:50:56Z

[PR Coverage check]

😍 pass : 280 / 296 (94.59%)

file detail

	path	covered line	new line	coverage
🔵	io/questdb/griffin/engine/join/ChainedSymbolColumnAccessHelper.java	0	4	00.00%
🔵	io/questdb/griffin/engine/join/NoopColumnAccessHelper.java	2	5	40.00%
🔵	io/questdb/griffin/engine/join/SingleVarcharColumnAccessHelper.java	6	7	85.71%
🔵	io/questdb/griffin/engine/join/SingleStringColumnAccessHelper.java	7	8	87.50%
🔵	io/questdb/griffin/engine/join/AsOfJoinMemoizedRecordCursorFactory.java	165	171	96.49%
🔵	io/questdb/griffin/SqlCodeGenerator.java	54	55	98.18%
🔵	io/questdb/std/CharSequenceLongHashMap.java	1	1	100.00%
🔵	io/questdb/std/LowerCaseCharSequenceIntHashMap.java	1	1	100.00%
🔵	io/questdb/std/IntLongHashMap.java	1	1	100.00%
🔵	io/questdb/griffin/engine/join/AsOfJoinFastRecordCursorFactory.java	12	12	100.00%
🔵	io/questdb/griffin/engine/join/AsofJoinColumnAccessHelper.java	1	1	100.00%
🔵	io/questdb/std/Utf8SequenceLongHashMap.java	1	1	100.00%
🔵	io/questdb/griffin/SqlHints.java	3	3	100.00%
🔵	io/questdb/griffin/engine/join/AsOfJoinIndexedRecordCursorFactory.java	5	5	100.00%
🔵	io/questdb/griffin/SqlOptimiser.java	3	3	100.00%
🔵	io/questdb/griffin/engine/join/SingleSymbolColumnAccessHelper.java	16	16	100.00%
🔵	io/questdb/std/CharSequenceIntHashMap.java	2	2	100.00%

mtopolnik added 30 commits September 12, 2025 12:59

WIP

7f2dd55

style

6590bf2

Improve index scan logic

1287052

Better variable name

e28d47a

Restore slave cursor position at entry istead of end

4384af5

Remove unnecessary checks

8965896

Remove unused fields

0e667ab

Log test params on start of test

bbb44f2

Restore slaveKeySink field

9606df9

Delete unused variables

7c5c54c

Extract variable

e728cd5

Improve performKeyMatching

50f9c20

Apply row ID offset

3bd2d53

Map absolute rowID to localRowID

43a7a0f

Re-acquire index reader after changing partition

1ec1973

Use masterSymbolColumnIndex to access master symbol column

4a32d5d

Remove fallback to linear search

b40ece8

Map back to physical table column index to acces symbol index

29ab4fe

Remove unused method

29131cc

Remove default interface method impl

e52d6b1

Implement new methods in SelectedRecordCursorFactory

1b772cd

Add getPhysicalColumnIndex to TimeFrameRecordCursor

9d57ce3

Properly get physical symbol column index

1875fbd

Calculate row offset at each frame again

b64da3c

Remove redundant local variable timeFrame

8cd917f

Remove useless partitionIndex

0c1bf7b

Remove local variable timeFrame

06cb3d8

Use auxiliary frameRecA for local computation

811e6ee

Fix impl of getBimtapIndexReader

9c5e2b0

Update frameIndex and rowMax on each iteration

75b04c3

mtopolnik and others added 7 commits October 22, 2025 11:35

Adapt tests to new default Asof Scan

7b06a5e

Clean up increment methods in hash maps

c2ee416

Reformat

63c0835

Avoid redundant map lookup

2ed789b

Reformat

043ce20

Improve comments

4631900

More comment cleanup

29717c9

puzpuzpuz self-requested a review October 22, 2025 14:32

puzpuzpuz previously approved these changes Oct 22, 2025

View reviewed changes

bluestreak01 reviewed Oct 23, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/engine/join/AsOfJoinMemoizedRecordCursorFactory.java Outdated Show resolved Hide resolved

bluestreak01 requested changes Oct 23, 2025

View reviewed changes

bluestreak01 added 2 commits October 23, 2025 18:18

tidy-up

f83d96d

Merge remote-tracking branch 'origin/master' into mt_asof-join-fast

30b3d70

bluestreak01 dismissed puzpuzpuz’s stale review via 30b3d70 October 23, 2025 22:19

Merge branch 'master' into mt_asof-join-fast

6113a4e

mtopolnik added 2 commits October 27, 2025 14:17

Remove unused symbolTable

e35bb5c

Merge branch 'master' into mt_asof-join-fast

f6a36ee

bluestreak01 approved these changes Oct 28, 2025

View reviewed changes

bluestreak01 merged commit f51825a into master Oct 28, 2025
34 checks passed

bluestreak01 deleted the mt_asof-join-fast branch October 28, 2025 02:53

This was referenced Oct 31, 2025

perf(sql): improve performance of linear as-of join algo #6338

Merged

chore(sql): speedup tests #6347

Merged

chore(core): refactor and clean up ASOF/LT JOIN code #6348

Merged

tris0laris mentioned this pull request Nov 18, 2025

Real-time markouts for capital markets questdb/roadmap#98

Open

coderabbitai bot mentioned this pull request Jan 1, 2026

feat(sql): add rowCount, txn and timestamp columns to tables() #6581

Merged

4 tasks

coderabbitai bot mentioned this pull request Jan 13, 2026

fix(sql): fix ASOF JOIN crash when ON clause has symbol and other columns #6634

Merged

This was referenced Jan 23, 2026

fix(sql): proper error message when timestamp used along another join key in ASOF/LT join #6698

Merged

feat(core): optimize parquet partition read with late materialization, zero-copy page reading, and use raw array encoding #6675

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sql): optimized ASOF JOIN on single symbol key where RHS symbol is low-frequency#6208

feat(sql): optimized ASOF JOIN on single symbol key where RHS symbol is low-frequency#6208
bluestreak01 merged 140 commits intomasterfrom
mt_asof-join-fast

mtopolnik commented Sep 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

bluestreak01 left a comment

Uh oh!

mtopolnik commented Oct 27, 2025

Uh oh!

glasstiger commented Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mtopolnik commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark

Uh oh!

Uh oh!

bluestreak01 left a comment

Choose a reason for hiding this comment

Uh oh!

mtopolnik commented Oct 27, 2025

Uh oh!

glasstiger commented Oct 27, 2025

[PR Coverage check]

file detail

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mtopolnik commented Sep 30, 2025 •

edited

Loading