Use query hash instead of string representation to handle the sample block cache by Algunenano · Pull Request #40065 · ClickHouse/ClickHouse

Algunenano · 2022-08-10T13:18:55Z

Changelog category (leave one):

Performance Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Use query hash instead of string representation to handle the sample block cache

Information about CI checks: https://clickhouse.com/docs/en/development/continuous-integration/

Start pushing to speed up query interpretation, which has been degrading over time. Related to #39996

In this case, change how the sample block cache works to stop using the full query string and instead use the query hash. This is done because the call to getTreeHash is much faster than formatImpl and should be equivalent. In my perf analysis of large queries the query serialization was taking almost 10% of the time:

+    8.88%     0.00%            13  TCPHandler      libclickhouse_parsers.so                             [.] DB::queryToString                                                                                                                              ▒
+    8.87%     0.00%            21  TCPHandler      libclickhouse_parsers.so                             [.] DB::serializeAST

And once implemented I see an almost 10% improvement.

MASTER:

- localhost:9000, queries 200, QPS: 8.331, RPS: 0.000, MiB/s: 0.000, result RPS: 8.331, result MiB/s: 0.003.
- localhost:9000, queries 200, QPS: 8.263, RPS: 0.000, MiB/s: 0.000, result RPS: 8.263, result MiB/s: 0.003.

CHANGES:

- localhost:9000, queries 200, QPS: 8.888, RPS: 0.000, MiB/s: 0.000, result RPS: 8.888, result MiB/s: 0.003.
- localhost:9000, queries 200, QPS: 9.019, RPS: 0.000, MiB/s: 0.000, result RPS: 9.019, result MiB/s: 0.003.

I also added a couple of profile events because that's better than adding logs.

Marking it as draft for now because although I expect this to be equivalent I'm not 100% (almost but not 100%) confident all operations done during query interpretation will be properly reflected in the query hash. Let's see what the test think.

Algunenano · 2022-08-10T15:11:26Z

I had to add the alias to the hash because it was detecting queries as equal that aren't.

A sample of something that breaks (in current code) because of the hash not including aliases is the scalar subquery cache:

SELECT
    (
        SELECT
            1 AS number,
            number
        FROM numbers(1)
    ) AS s,
    (
        SELECT
            1,
            number
        FROM numbers(1)
    ) AS s2

22.7.2.15

┌─s─────┬─s2────┐
│ (1,1) │ (1,1) │
└───────┴───────┘

With these changes:

┌─s─────┬─s2────┐
│ (1,1) │ (1,0) │
└───────┴───────┘

The new one is the correct one, the old one is incorrectly reusing the first scalar result. It's a really odd corner case, but I guess it could happen in real queries.

Algunenano · 2022-08-10T17:28:42Z

Now ASTSubquery hash includes the alias which we probably don't want, specially for scalar caches and so on. I'll look into it and maybe remove it for this case only. Needs more thought in any case.

alexey-milovidov · 2022-08-10T18:07:55Z

Do we still need the sample block cache?

Algunenano · 2022-08-10T19:30:58Z

Do we still need the sample block cache?

The query from the benchmark:

With the cache and the changes: ~9 QPS ('SampleBlockCacheHit':256,'SampleBlockCacheMiss':15)
Without the cache: 0.078 QPS, that is 115x worse

So I'd say yes, at least for now.

…block cache

Algunenano · 2022-08-11T15:19:18Z

It seems there are several other places that relied on the alias not being part of the hash, so this needs more work (probably after I come back from holidays)

…ple_block_cache

Algunenano · 2022-08-23T14:38:40Z

While looking at the failing tests I've detected some other queries that were working incorrectly because the alias was ignored as part of the node hash. I've added some tests for those queries.

The problem with the failed tests arose from, you guessed it, introducing the alias in the hash, as some parts of the code were implicitly relying on that behaviour. To workaround this I've added some helper methods to calculate other hashes were needed: get the hash of the contents of a subquery (ignoring its alias, id or cte name) to prepare and cache sets, and ignore the alias of functions when processing and optimizing queries based on constraints. I'm thinking that maybe the alias shouldn't be part of the node and be a node on itself, but it's likely better to delay massive changes around that until #23194 is clearer.

In any case, the performance seems still good: master (8.454 QPS) vs PR (9.552 QPS)

Algunenano · 2022-08-24T09:59:27Z

Performance improvement seems like the expected 10%:

There are other performance changes but they seem unrelated

…ple_block_cache

Algunenano · 2022-09-06T09:59:49Z

Although the query analyzer is being worked on, I still think this PR is worth including: it improves the performance of the sample block cache (and that is not going away I think) and fixes several bugs in the query interpretation.

alexey-milovidov · 2022-09-30T23:36:00Z

@kitaisreal said that Sample Block Cache is entirely unneeded if Analyzer is used.

Ok, let's continue on this PR, but I did not review it yet...

kitaisreal · 2022-10-03T10:41:32Z

@Algunenano no need to continue this pr.
Sample Block cache is only necessary because we have mess and create Interpreters recursively during analyze multiple times. In Analyzer we analyze query only once, and we do not use AST at all.
Methods getContentHash, getTreeHashWithoutAlias, getTreeHash are hard to understand. I expect that there could be complex bugs with distributed processing and aliases that are not covered by our CI/CD.
Profile events SampleBlockCacheHit should not exists because client should not know anything about SampleBlockCache. Client should not know anything about SampleBlock either.

Algunenano · 2022-10-03T11:12:55Z

Sample Block cache is only necessary because we have mess and create Interpreters recursively during analyze multiple times. In Analyzer we analyze query only once, and we do not use AST at all.

Great!

Profile events SampleBlockCacheHit should not exists because client should not know anything about SampleBlockCache. Client should not know anything about SampleBlock either.

I don't consider profile events are not only for clients, but also for debugging. Clients doesn't need to know anything about ZK either, but we do when investigating the behaviour of the query.

Anyway, I'm closing the PR and I'll keep (already were) an eye on the analyzer PR to see that the issues found here are fixed by the analyzer too.

alexey-milovidov · 2022-10-04T02:46:12Z

@Algunenano thank you!
We need more eyes on the analyzer PR. It is almost ready for review.

robot-ch-test-poll1 added the pr-performance Pull request with some performance improvements label Aug 10, 2022

Algunenano mentioned this pull request Aug 10, 2022

Don't visit the AST for UDFs if none are registered #40069

Merged

Algunenano added 3 commits August 11, 2022 15:29

Add perf test

934f426

Use query hash instead of string representation to handle the sample …

ca50e4c

…block cache

Include node alias in its hash

8afd571

Algunenano force-pushed the interpretation_sample_block_cache branch from e3ebdf3 to 8afd571 Compare August 11, 2022 13:44

Algunenano mentioned this pull request Aug 11, 2022

Reduce the number of clone operations during query interpretation #40132

Closed

Algunenano added 3 commits August 23, 2022 11:12

Merge remote-tracking branch 'blessed/master' into interpretation_sam…

617b5a5

…ple_block_cache

Only consider subquery contents when preparing and comparing sets

cc8cf6e

Ignore function aliases when optimizing using constraints

928abe3

Supress clang-tidy warning

487982a

Algunenano marked this pull request as ready for review August 24, 2022 09:59

Algunenano added 5 commits August 24, 2022 17:26

Move test to the proper file

67efb66

Merge remote-tracking branch 'blessed/master' into interpretation_sam…

7fa10c9

…ple_block_cache

Merge remote-tracking branch 'blessed/master' into interpretation_sam…

5de8979

…ple_block_cache

Better wording

92c0179

Merge remote-tracking branch 'blessed/master' into interpretation_sam…

95534a9

…ple_block_cache

Merge branch 'master' into interpretation_sample_block_cache

c6448d7

Algunenano closed this Oct 3, 2022

Algunenano mentioned this pull request Oct 7, 2022

Added Analyzer, Planner #31796

Merged

Algunenano mentioned this pull request Nov 25, 2022

Support scalar subqueries cache #43640

Merged

Algunenano mentioned this pull request Nov 9, 2023

Fix handling of aliases in query cache #56545

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use query hash instead of string representation to handle the sample block cache#40065

Use query hash instead of string representation to handle the sample block cache#40065
Algunenano wants to merge 13 commits intoClickHouse:masterfrom
Algunenano:interpretation_sample_block_cache

Algunenano commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

alexey-milovidov commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 11, 2022

Uh oh!

Algunenano commented Aug 23, 2022

Uh oh!

Algunenano commented Aug 24, 2022

Uh oh!

Algunenano commented Sep 6, 2022

Uh oh!

alexey-milovidov commented Sep 30, 2022

Uh oh!

kitaisreal commented Oct 3, 2022

Uh oh!

Algunenano commented Oct 3, 2022

Uh oh!

alexey-milovidov commented Oct 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Algunenano commented Aug 10, 2022

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

alexey-milovidov commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 10, 2022

Uh oh!

Algunenano commented Aug 11, 2022

Uh oh!

Algunenano commented Aug 23, 2022

Uh oh!

Algunenano commented Aug 24, 2022

Uh oh!

Algunenano commented Sep 6, 2022

Uh oh!

alexey-milovidov commented Sep 30, 2022

Uh oh!

kitaisreal commented Oct 3, 2022

Uh oh!

Algunenano commented Oct 3, 2022

Uh oh!

alexey-milovidov commented Oct 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants