perf(sql): parallel group by with optional filtering by puzpuzpuz · Pull Request #4032 · questdb/questdb

puzpuzpuz · 2023-12-04T14:57:29Z

Currently, GROUP BY queries run in parallel only in a few cases:

Non-keyed GROUP BY with basic aggregate functions, e.g. select sum(value) from t.
Single INT or SYMBOL-keyed GROUP BY with basic aggregate functions, e.g. select int_key, sum(value) from t.

This patch extends cases where we go with multi-threaded GROUP BY execution. The implementation builds on top of parallel SQL filters (a.k.a. async offload), so the same scheduling and cancellation behavior applies. The work is split into page frame tasks, aggregated by the shared workers, and accumulated in FastMap (keyed GROUP BY) or SimpleMapValue (non-keyed GROUP BY). FastMap/SimpleMapValue is reused between different query executions (and different queries).

As an optional second step in the query processing, we merge sharded maps in parallel. Shards contain non-intersecting sets of groups, so that once we have full shards, we return their rows to the caller. This behavior kicks in only in case of large enough maps (cairo.sql.parallel.groupby.sharding.threshold, defaults to 10k).

The implementation also "steals" filter from the underlying factory, so both of the following sample queries will be executed by the new parallel GROUP BY framework:

-- column keys are supported
select str_col, sum(long_col) from t;
-- function and operation keys are supported; filters are also supported
select concat(str_col1, str_col2), sum(long_col) from t where long_col > 42;
-- non-keyed GROUP BY is also supported
select vwap(price, quantity) from t where quantity > 42;

Currently supported aggregate functions: count(*) and count(col), avg, sum, min/max, vwap (all for fixed-size types).

Benchmark results aren't included, but the improvement on my 4c/8t machine varies from 2x to 10x depending on the query.

The new behavior is enabled by default, but can be switched off with cairo.sql.parallel.groupby.enabled=false.

Also includes #4078 (single count_distinct re-write to a parallel GROUP BY for all supported types except symbol).

Other limitations

Aggregate functions that have an additional state, e.g. count_distinct, aren't yet supported.

Next steps

Port aggregate functions with additional state to the new framework.
Get rid of CompactMap and introduce UnorderedMap for the small fixed-size key-value case. That's to speed up key look-ups by avoiding extra access to FastMap's heap once we've determined the hash table slot.
Start building multi-threaded SAMPLE BY factories based on the same approach.

…l_group_by

…group_by

…tdb/questdb into puzpuzpuz_parallel_group_by

…l_group_by

…o puzpuzpuz_parallel_group_by # Conflicts: # core/src/main/java/io/questdb/cairo/map/FastMap.java

…l_group_by

core/src/main/java/io/questdb/cairo/map/FastMap.java

ideoma · 2023-12-22T17:16:01Z

[PR Coverage check]

😍 pass : 2012 / 2272 (88.56%)

file detail

	path	covered line	new line	coverage
🔵	io/questdb/cairo/TestSink.java	0	1	00.00%
🔵	io/questdb/griffin/engine/table/LatestByValueDeferredIndexedFilteredRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/cairo/map/MapRecord.java	0	3	00.00%
🔵	io/questdb/griffin/engine/groupby/vect/GroupByNotKeyedVectorRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/table/LatestByRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/groupby/SampleByInterpolateRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/cairo/map/MapKey.java	0	3	00.00%
🔵	io/questdb/griffin/engine/orderby/SortedLightRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/join/RecordAsAFieldRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/groupby/DistinctRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/table/LatestByLightRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/cairo/map/MapValue.java	0	1	00.00%
🔵	io/questdb/cairo/map/Map.java	0	2	00.00%
🔵	io/questdb/cutlass/pgwire/CleartextPasswordPgWireAuthenticator.java	0	1	00.00%
🔵	io/questdb/griffin/engine/table/LatestByValueIndexedFilteredRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/groupby/SampleByFirstLastRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/orderby/SortedRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/table/LatestByAllIndexedRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/model/RuntimeIntervalModel.java	0	1	00.00%
🔵	io/questdb/griffin/engine/LimitRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/window/CachedWindowRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/griffin/engine/groupby/CountRecordCursorFactory.java	0	1	00.00%
🔵	io/questdb/cairo/CairoConfigurationWrapper.java	0	4	00.00%
🔵	io/questdb/cairo/map/CompactMapValue.java	0	1	00.00%
🔵	io/questdb/griffin/engine/functions/groupby/MaxDoubleGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/SumLongGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MinFloatGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MaxCharGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MinIPv4GroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MinDoubleGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MaxIPv4GroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MaxTimestampGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MaxDateGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MinLongGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MinIntGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/SumIntGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/MaxIntGroupByFunction.java	1	6	16.67%
🔵	io/questdb/griffin/engine/functions/groupby/AbstractCountGroupByFunction.java	1	4	25.00%
🔵	io/questdb/griffin/engine/functions/groupby/MinTimestampGroupByFunction.java	2	7	28.57%
🔵	io/questdb/griffin/engine/functions/groupby/MaxFloatGroupByFunction.java	2	7	28.57%
🔵	io/questdb/griffin/engine/functions/groupby/MaxLongGroupByFunction.java	2	7	28.57%
🔵	io/questdb/griffin/engine/functions/groupby/MinDateGroupByFunction.java	2	7	28.57%
🔵	io/questdb/griffin/model/ExpressionNode.java	1	3	33.33%
🔵	io/questdb/cairo/sql/StatefulAtom.java	1	2	50.00%
🔵	io/questdb/griffin/engine/table/LatestBySubQueryRecordCursorFactory.java	1	2	50.00%
🔵	io/questdb/griffin/engine/functions/GroupByFunction.java	1	2	50.00%
🔵	io/questdb/griffin/engine/functions/groupby/SumLong256GroupByFunction.java	9	14	64.29%
🔵	io/questdb/griffin/engine/functions/groupby/FirstCharGroupByFunction.java	2	3	66.67%
🔵	io/questdb/griffin/engine/functions/groupby/FirstSymbolGroupByFunction.java	2	3	66.67%
🔵	io/questdb/griffin/engine/functions/groupby/FirstBooleanGroupByFunction.java	2	3	66.67%
🔵	io/questdb/griffin/engine/functions/groupby/FirstTimestampGroupByFunction.java	2	3	66.67%
🔵	io/questdb/cairo/map/ShardedMapCursor.java	63	92	68.48%
🔵	io/questdb/griffin/engine/functions/groupby/SumFloatGroupByFunction.java	13	18	72.22%
🔵	io/questdb/griffin/engine/groupby/GroupByNotKeyedRecordCursorFactory.java	12	16	75.00%
🔵	io/questdb/cairo/map/FastMapValue.java	9	12	75.00%
🔵	io/questdb/griffin/engine/groupby/GroupByMergeShardJob.java	16	20	80.00%
🔵	io/questdb/griffin/engine/groupby/vect/GroupByRecordCursorFactory.java	5	6	83.33%
🔵	io/questdb/griffin/engine/groupby/SimpleMapValue.java	7	8	87.50%
🔵	io/questdb/griffin/engine/table/AsyncGroupByNotKeyedRecordCursorFactory.java	45	51	88.24%
🔵	io/questdb/griffin/SqlCodeGenerator.java	104	116	89.66%
🔵	io/questdb/cairo/map/FastMapVarSizeRecord.java	185	201	92.04%
🔵	io/questdb/griffin/engine/table/AsyncGroupByNotKeyedRecordCursor.java	85	91	93.41%
🔵	io/questdb/griffin/engine/groupby/GroupByRecordCursorFactory.java	15	16	93.75%
🔵	io/questdb/griffin/engine/table/AsyncGroupByRecordCursorFactory.java	75	80	93.75%
🔵	io/questdb/griffin/engine/table/AsyncGroupByNotKeyedAtom.java	53	56	94.64%
🔵	io/questdb/cairo/RecordSinkFactory.java	209	218	95.87%
🔵	io/questdb/griffin/model/QueryModel.java	23	24	95.83%
🔵	io/questdb/griffin/engine/table/AsyncGroupByRecordCursor.java	152	160	95.00%
🔵	io/questdb/cairo/map/FastMapFixedSizeRecord.java	114	120	95.00%
🔵	io/questdb/griffin/engine/table/AsyncGroupByAtom.java	169	173	97.69%
🔵	io/questdb/griffin/engine/groupby/GroupByUtils.java	135	136	99.26%
🔵	io/questdb/cairo/map/FastMap.java	130	131	99.24%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullLongGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstShortGroupByFunction.java	3	3	100.00%
🔵	io/questdb/MessageBusImpl.java	11	11	100.00%
🔵	io/questdb/griffin/engine/table/AsyncFilteredNegativeLimitRecordCursor.java	1	1	100.00%
🔵	io/questdb/PropServerConfiguration.java	21	21	100.00%
🔵	io/questdb/cairo/sql/async/PageFrameSequence.java	8	8	100.00%
🔵	io/questdb/cairo/ArrayColumnTypes.java	2	2	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/VwapDoubleGroupByFunction.java	10	10	100.00%
🔵	io/questdb/griffin/engine/table/VirtualRecordCursorFactory.java	13	13	100.00%
🔵	io/questdb/griffin/engine/table/AsyncFilteredRecordCursor.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstFloatGroupByFunction.java	3	3	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstDoubleGroupByFunction.java	3	3	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullTimestampGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/table/SortedSymbolIndexRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/cutlass/http/HttpConnectionContext.java	1	1	100.00%
🔵	io/questdb/griffin/engine/table/AsyncFilterAtom.java	3	3	100.00%
🔵	io/questdb/griffin/engine/PerWorkerLocks.java	2	2	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullGeoHashGroupByFunctionFactory.java	4	4	100.00%
🔵	io/questdb/griffin/engine/table/LatestByValuesIndexedFilteredRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/std/bytes/Bytes.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullDateGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/orderby/LimitedSizeLongTreeChain.java	7	7	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullSymbolGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullIPv4GroupByFunctionFactory.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullTimestampGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstGeoHashGroupByFunctionLong.java	5	5	100.00%
🔵	io/questdb/griffin/engine/table/FilterOnValuesRecordCursorFactory.java	3	3	100.00%
🔵	io/questdb/cairo/sql/async/PageFrameReduceJob.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/AvgDoubleGroupByFunction.java	8	8	100.00%
🔵	io/questdb/griffin/engine/window/WindowRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/cairo/DefaultCairoConfiguration.java	4	4	100.00%
🔵	io/questdb/griffin/engine/table/DataFrameRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/griffin/BasePlanSink.java	4	4	100.00%
🔵	io/questdb/cairo/map/FastMapCursor.java	10	10	100.00%
🔵	io/questdb/griffin/engine/table/FilteredRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/cairo/CairoEngine.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullLongGroupByFunction.java	1	1	100.00%
🔵	io/questdb/ServerMain.java	2	2	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullIntGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/table/SelectedRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/tasks/GroupByMergeShardTask.java	15	15	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullIntGroupByFunction.java	1	1	100.00%
🔵	io/questdb/PropertyKey.java	4	4	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullCharGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullDateGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstLongGroupByFunction.java	3	3	100.00%
🔵	io/questdb/cairo/sql/async/PageFrameReduceTask.java	10	10	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastGeoHashGroupByFunctionFactory.java	4	4	100.00%
🔵	io/questdb/griffin/engine/table/AsyncJitFilteredRecordCursorFactory.java	5	5	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/HaversineDistDegreeGroupByFunction.java	5	5	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/SumDoubleGroupByFunction.java	6	6	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullGeoHashGroupByFunctionFactory.java	4	4	100.00%
🔵	io/questdb/cairo/TableWriter.java	1	1	100.00%
🔵	io/questdb/griffin/engine/table/FilterOnExcludedValuesRecordCursorFactory.java	2	2	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstNotNullSymbolGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstByteGroupByFunction.java	3	3	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstGeoHashGroupByFunctionByte.java	5	5	100.00%
🔵	io/questdb/griffin/SqlOptimiser.java	72	72	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/CountLongConstGroupByFunction.java	5	5	100.00%
🔵	io/questdb/griffin/engine/table/LatestByValueIndexedFilteredRecordCursor.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/eq/EqLong256StrFunctionFactory.java	5	5	100.00%
🔵	io/questdb/griffin/engine/groupby/GroupByFunctionsUpdaterFactory.java	19	19	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/LastNotNullCharGroupByFunction.java	1	1	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstGeoHashGroupByFunctionShort.java	5	5	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstDateGroupByFunction.java	3	3	100.00%
🔵	io/questdb/griffin/engine/table/AsyncFilteredRecordCursorFactory.java	5	5	100.00%
🔵	io/questdb/griffin/engine/functions/groupby/FirstGeoHashGroupByFunctionInt.java	5	5	100.00%
🔵	io/questdb/cairo/map/MapFactory.java	1	1	100.00%
🔵	io/questdb/cairo/sql/RecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/griffin/engine/groupby/AbstractSampleByRecordCursorFactory.java	1	1	100.00%
🔵	io/questdb/griffin/SqlUtil.java	9	9	100.00%

perf(sql): parallel group by and optional filtering

caabd1d

puzpuzpuz added SQL Issues or changes relating to SQL execution Performance Performance improvements labels Dec 4, 2023

puzpuzpuz self-assigned this Dec 4, 2023

puzpuzpuz changed the title ~~perf(sql): parallel group by and optional filtering~~ perf(sql): parallel group by with optional filtering Dec 4, 2023

puzpuzpuz and others added 25 commits December 5, 2023 09:02

Remove unused field

a632d0d

Merge remote-tracking branch 'upstream/master' into puzpuzpuz_paralle…

fffee17

…l_group_by

Add FastMap tests

8c35fe0

Port a number of functions to parallel GROUP BY

437ff11

Fix symbol handling

2dbc132

Remove double close

336421b

Remove another double close

3fa61e7

Simplify AsyncGroupByRecordCursor

64b998a

Merge remote-tracking branch 'origin/master' into puzpuzpuz_parallel_…

0cd70b4

…group_by

Merge branch 'puzpuzpuz_parallel_group_by' of https://github.com/ques…

2b73d81

…tdb/questdb into puzpuzpuz_parallel_group_by

Merge remote-tracking branch 'upstream/master' into puzpuzpuz_paralle…

f1942f8

…l_group_by

Fix a few tests

876ccdb

hot path optimisation for FastMap

882c780

Merge remote-tracking branch 'origin/puzpuzpuz_parallel_group_by' int…

8bab697

…o puzpuzpuz_parallel_group_by # Conflicts: # core/src/main/java/io/questdb/cairo/map/FastMap.java

parallel group by tests

aea2b91

Fix non-keyed case

745c809

Fix more tests

b982c07

Add separate factory for non-keyed GROUP BY

cc60d9f

Add more tests

64103b8

Steal filter from base factory if possible

a6cba62

Fix a few tests

ea43e37

Merge remote-tracking branch 'upstream/master' into puzpuzpuz_paralle…

d6edbe8

…l_group_by

Don't go parallel when base factory uses an index

872dea2

Fix a few more tests

7635f85

Fix more tests

b48e158

puzpuzpuz added 4 commits December 21, 2023 15:47

Fix EqLong256StrFunctionFactory

21f31e4

Fix more tests

36f5572

Fix even more tests

2f903b7

Add a test case for count_distinct rewrite

61f4d61

ideoma previously approved these changes Dec 21, 2023

View reviewed changes

Add Alex' optimization for fixed-size merge

51b11a6

puzpuzpuz dismissed ideoma’s stale review via 51b11a6 December 21, 2023 17:19

ideoma reviewed Dec 21, 2023

View reviewed changes

core/src/main/java/io/questdb/cairo/map/FastMap.java Outdated Show resolved Hide resolved

ideoma reviewed Dec 21, 2023

View reviewed changes

core/src/main/java/io/questdb/cairo/map/FastMap.java Outdated Show resolved Hide resolved

puzpuzpuz added 15 commits December 22, 2023 09:23

Address review comment

b57b9fd

Revert default shared worker count and reduce queue size

11ab18d

Introduce fast path in FastMapRecord#addressOfColumn()

4662c05

Optimize mergeVarSizeKey

454ac10

Split FastMapRecord into var-size and fixed-size classes

175b367

Simplify code

0d052a3

Fix tests

112feb0

Fix bug in FastMapFixedSizeRecord and more tests around it

63bd026

Revert changes in Numbers

021ae30

Make maven javadoc plugin happy

ccb972c

Revert Unsafe changes

bf2e176

Change offsets to long[]

cbdee70

Remove GroupByFunctionsUpdater dependency from Map

8c8f37e

Initialize merge method ref only once

9d08218

Extend MapValueMergeFunction in GroupByFunctionsUpdater

6a4796b

ideoma approved these changes Dec 22, 2023

View reviewed changes

ideoma merged commit fc23804 into master Dec 23, 2023

ideoma deleted the puzpuzpuz_parallel_group_by branch December 23, 2023 23:46

puzpuzpuz mentioned this pull request Dec 29, 2023

Parallel execution for ORDER BY + LIMIT N in GROUP BY queries with large dataset #4085

Open

jerrinot mentioned this pull request Jan 5, 2024

chore(core): FastMap to use Robin Hood hashing #4054

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(sql): parallel group by with optional filtering#4032

perf(sql): parallel group by with optional filtering#4032
ideoma merged 96 commits intomasterfrom
puzpuzpuz_parallel_group_by

puzpuzpuz commented Dec 4, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ideoma commented Dec 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

puzpuzpuz commented Dec 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other limitations

Next steps

Uh oh!

Uh oh!

Uh oh!

ideoma commented Dec 22, 2023

[PR Coverage check]

file detail

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

puzpuzpuz commented Dec 4, 2023 •

edited

Loading