Skip to content

[GLUTEN-7311][CH] Support grace aggregate algorithm in partial aggregating stages#7322

Merged
baibaichen merged 4 commits intoapache:mainfrom
bigo-sg:7311_1
Oct 15, 2024
Merged

[GLUTEN-7311][CH] Support grace aggregate algorithm in partial aggregating stages#7322
baibaichen merged 4 commits intoapache:mainfrom
bigo-sg:7311_1

Conversation

@lgbo-ustc
Copy link
Copy Markdown
Contributor

@lgbo-ustc lgbo-ustc commented Sep 24, 2024

What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

Fixes: #7311

For aggregations involve distinct, use grace aggregate algorithm in partial aggregating stages, instead of streaming aggregating algorithm.

Following changes are made

  1. introduce GraceAggregatingStep. It bases on codes of GraceMergingAggregatedStep, and supports to generate intermediate result.
  2. Make Aggregator to support multiple aggregate phases, change is in [GLUTEN-7311]A patch for supporting multiple aggregate phases in one step Kyligence/ClickHouse#504
  3. Override AggregateFunctionPartialMerge::getBaseAggregateFunctionWithSameStateRepresentation, resolve the failure of checking columns structure equality in aggregate phase INTERMEDIATE_TO_INTERMEDIATE
  4. Keep requiredChildDistributionExpressions in MergeTwoPhasesHashBaseAggregate that we can determine whether a aggregate step is in completed mode.

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)

unit tests, manual tests

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

@github-actions
Copy link
Copy Markdown

#7311

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Oct 9, 2024

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Oct 9, 2024

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Oct 9, 2024

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@zzcclp
Copy link
Copy Markdown
Contributor

zzcclp commented Oct 10, 2024

Is there any performance improved with this pr ?

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

1 similar comment
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

2 similar comments
@lgbo-ustc
Copy link
Copy Markdown
Contributor Author

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI

Copy link
Copy Markdown
Contributor

@baibaichen baibaichen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lgbo-ustc
Copy link
Copy Markdown
Contributor Author

Is there any performance improved with this pr ?

It's not for performance

@baibaichen baibaichen merged commit 74c6641 into apache:main Oct 15, 2024
baibaichen added a commit to Kyligence/gluten that referenced this pull request Oct 15, 2024
baibaichen added a commit that referenced this pull request Oct 15, 2024
* [GLUTEN-1632][CH]Daily Update Clickhouse Version (20241015)

* Fix Build due to ClickHouse/ClickHouse#70135

* Resovle conflict with #7322

* gtest skip since plan is chagned due to #7395

(cherry picked from commit 94e1837a922d5a092226b195d6c3079d320878cb)

---------

Co-authored-by: kyligence-git <[email protected]>
Co-authored-by: Chang Chen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CH] Result diff when set streaming_aggregate_enabled true

3 participants