perf(sql): optimized Markout Horizon CROSS JOIN by mtopolnik · Pull Request #6283 · questdb/questdb

mtopolnik · 2025-10-17T07:26:14Z

This is the Markout Curve query we're optimizing:

WITH
  offsets AS (
    SELECT sec_offs, 1_000_000 * sec_offs usec_offs 
    FROM (SELECT x-601 AS sec_offs FROM long_sequence(1201))
  ),
  points AS (SELECT * FROM (
    SELECT id, order_ts, sym1, sym2, unit_price, volume, sec_offs, order_ts + usec_offs AS ts
    FROM orders CROSS JOIN offsets
    ORDER BY order_ts + usec_offs
  ) TIMESTAMP(ts)),
  join1 AS (
    SELECT t.*, p.price AS sym1_price
    FROM points as t
    ASOF JOIN prices as p
    ON (t.sym1 = p.sym)
  ),
  join2 AS (
    SELECT t.*, p.price AS sym2_price
    FROM join1 as t
    ASOF JOIN prices as p
    ON (t.sym2 = p.sym)
  ),
  markouts AS (
  SELECT
    sec_offs, 
    volume,
    volume * (unit_price - (sym1_price/sym2_price)) AS weighted_markout
  FROM join2
  )
SELECT sec_offs, sum(weighted_markout) / sum(volume) AS avg_weigthed_markout 
FROM markouts;

This subquery generates the sampling points over the markout horizon of every order:

SELECT id, order_ts, sym1, sym2, unit_price, volume, sec_offs, order_ts + usec_offs AS ts
FROM orders CROSS JOIN offsets
ORDER BY order_ts + usec_offs

It performs poorly on a larger number of orders, because the CROSS JOIN output must be fully materialized and then sorted.

This PR introduces a special-case cursor factory that emits the CROSS JOIN output directly in the required order, without having to materialize the orders table. It does materialize offsets because it needs random access to it, but this table is of a very limited size, coming from long_sequence().

Example usage and benchmark:

CREATE TABLE orders (
      ts TIMESTAMP,
      price DOUBLE
  ) timestamp(ts) PARTITION BY DAY;
INSERT INTO orders
  SELECT
      generate_series as timestamp,
      rnd_double() * 10.0 + 5.0
      FROM generate_series('2025-01-02', '2025-01-02T00:10', '200u');

This creates 3,000,001 orders spaced at 200 µs.

WITH
  offsets AS (
    SELECT sec_offs, 10_000_000 * sec_offs usec_offs 
    FROM (SELECT x-61 AS sec_offs FROM long_sequence(121))
  )
  SELECT /*+ markout_horizon(orders offsets) */ sum(price) FROM (SELECT * FROM (
    SELECT price, ts + usec_offs AS timestamp
    FROM orders CROSS JOIN offsets
    ORDER BY ts + usec_offs
  ) TIMESTAMP(timestamp));

This creates the markout horizon sampling grid over 10 minutes, spaced at 10 seconds. There are 121 sampling points for each order. Therefore, this results in 121 * 3,000,001 = 363,000,121 rows.

The query is built to emphasize the worst case for the Markout Horizon algo in terms of RAM usage: tight spacing of orders vs. the markout horizon. The algorithm must hold 3 million iterator structures in RAM at once. It uses 40 bytes per iterator.

I benchmarked it on a r7a.4xlarge EC2 box.

Without the markout hint, the query took 135 seconds, and RAM usage went from 2.3 GB baseline to 10.7 GB.

With the hint, the query took 17 seconds, with an even split between aggregation and row generation. RAM usage went from 2.3 to 2.4 GB.

mtopolnik · 2025-11-19T14:57:45Z

I've noticed https://github.com/questdb/questdb/pull/6362/files#r2537236278 while debugging the new factory. Please take a look.

Addressed in a1abc77

core/src/main/java/io/questdb/griffin/SqlCodeGenerator.java

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java

core/src/main/java/io/questdb/griffin/SqlCodeGenerator.java

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java

core/src/test/java/io/questdb/test/griffin/engine/join/AsOfJoinFuzzTest.java

glasstiger · 2025-11-21T00:30:34Z

[PR Coverage check]

😍 pass : 334 / 353 (94.62%)

file detail

	path	covered line	new line	coverage
🔵	io/questdb/griffin/engine/orderby/RecordTreeChain.java	0	1	00.00%
🔵	io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java	253	269	94.05%
🔵	io/questdb/griffin/SqlCodeGenerator.java	78	80	97.50%
🔵	io/questdb/cairo/RecordChain.java	1	1	100.00%
🔵	io/questdb/cairo/RecordArray.java	1	1	100.00%
🔵	io/questdb/griffin/SqlHints.java	1	1	100.00%

mtopolnik added 30 commits September 12, 2025 12:59

WIP

7f2dd55

style

6590bf2

Improve index scan logic

1287052

Better variable name

e28d47a

Restore slave cursor position at entry istead of end

4384af5

Remove unnecessary checks

8965896

Remove unused fields

0e667ab

Log test params on start of test

bbb44f2

Restore slaveKeySink field

9606df9

Delete unused variables

7c5c54c

Extract variable

e728cd5

Improve performKeyMatching

50f9c20

Apply row ID offset

3bd2d53

Map absolute rowID to localRowID

43a7a0f

Re-acquire index reader after changing partition

1ec1973

Use masterSymbolColumnIndex to access master symbol column

4a32d5d

Remove fallback to linear search

b40ece8

Map back to physical table column index to acces symbol index

29ab4fe

Remove unused method

29131cc

Remove default interface method impl

e52d6b1

Implement new methods in SelectedRecordCursorFactory

1b772cd

Add getPhysicalColumnIndex to TimeFrameRecordCursor

9d57ce3

Properly get physical symbol column index

1875fbd

Calculate row offset at each frame again

b64da3c

Remove redundant local variable timeFrame

8cd917f

Remove useless partitionIndex

0c1bf7b

Remove local variable timeFrame

06cb3d8

Use auxiliary frameRecA for local computation

811e6ee

Fix impl of getBimtapIndexReader

9c5e2b0

Update frameIndex and rowMax on each iteration

75b04c3

mtopolnik added 4 commits November 19, 2025 14:28

Reuse MarkoutHorizonInfo

40a1dda

Fix ORDER BY direction check

dd7a2b3

Cover branches in detectMarkoutHorizonPattern()

d963623

Merge branch 'master' into mt_adaptive-sort

be61972

mtopolnik added 2 commits November 19, 2025 17:43

Remove iterator block freelist

cdcf61b

Revert "configurable freelist length"

f29f695

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/SqlCodeGenerator.java Show resolved Hide resolved

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java Outdated Show resolved Hide resolved

Remove Reopenable from RecordChain

dc50e0d

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/SqlCodeGenerator.java Outdated Show resolved Hide resolved

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java Outdated Show resolved Hide resolved

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java Show resolved Hide resolved

mtopolnik added 5 commits November 20, 2025 09:22

Delete leftover comment

3239a54

Use size variable

0fffd9c

Early return -> assertion

3763dd2

Merge branch 'master' into mt_adaptive-sort

c7b76dc

Use slaveRowCount instead of slaveRecordArray.size()

198f2a6

puzpuzpuz reviewed Nov 20, 2025

View reviewed changes

core/src/main/java/io/questdb/griffin/engine/join/MarkoutHorizonRecordCursorFactory.java Show resolved Hide resolved

Address nits

dda9ff8

puzpuzpuz self-requested a review November 20, 2025 09:03

puzpuzpuz previously approved these changes Nov 20, 2025

View reviewed changes

Merge branch 'master' into mt_adaptive-sort

c2568a7

bluestreak01 reviewed Nov 21, 2025

View reviewed changes

core/src/test/java/io/questdb/test/griffin/engine/join/AsOfJoinFuzzTest.java Show resolved Hide resolved

cleanup

e4ab7fa

bluestreak01 dismissed puzpuzpuz’s stale review via e4ab7fa November 21, 2025 00:04

bluestreak01 approved these changes Nov 21, 2025

View reviewed changes

bluestreak01 merged commit f744c20 into master Nov 21, 2025
41 checks passed

bluestreak01 deleted the mt_adaptive-sort branch November 21, 2025 22:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(sql): optimized Markout Horizon CROSS JOIN#6283

perf(sql): optimized Markout Horizon CROSS JOIN#6283
bluestreak01 merged 243 commits intomasterfrom
mt_adaptive-sort

mtopolnik commented Oct 17, 2025 •

edited

Loading

Uh oh!

mtopolnik commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glasstiger commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mtopolnik commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mtopolnik commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glasstiger commented Nov 21, 2025

[PR Coverage check]

file detail

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mtopolnik commented Oct 17, 2025 •

edited

Loading