Delete parts by replacing them with empty parts by CheSema · Pull Request #41145 · ClickHouse/ClickHouse

CheSema · 2022-09-09T16:19:59Z

The issue:
#33457

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

This PR changes how followed queries delete parts: truncate table, alter table drop part, alter table drop partition. Now these queries make empty parts which cover old parts. This makes truncate query works without exclusive lock which means concurrent reads aren't locked. Also achieved durability in all those queries. If request is succeeded then no resurrected pars appear later. Note that atomicity is achieved only with transaction scope.

Information about CI checks: https://clickhouse.com/docs/en/development/continuous-integration/

den-crane · 2022-09-09T16:34:31Z

implements #33457
resolves #15742

src/Common/ErrorCodes.cpp

src/Interpreters/InterpreterDropQuery.cpp

src/Storages/MergeTree/MergeTreeData.cpp

src/Interpreters/InterpreterDropQuery.cpp

src/Storages/StorageMergeTree.cpp

tavplubix · 2022-09-12T17:51:04Z

We can also create empty covering part on disk for ReplicatedMergeTree in order to fix #37664 (but we don't need to commit it to ZooKeeper and can even add it to the set of parts in Outdated state)

tavplubix · 2022-09-14T21:56:08Z

@kssenii, 02417_load_marks_async fails in Fast Test blocking other tests from starting:
https://s3.amazonaws.com/clickhouse-test-reports/41145/875946bc920261d151b0ad5f946503f9af209ff4/fast_test.html

CheSema · 2022-09-15T12:27:07Z

The test 01825_type_json_schema_race_long is failed.

I investigate it. At first look it seems like Object(‘json’) has to be wrapped in Tuple.

This PR seemed to be related #41290

UPD: test fixed with commit
208b65f5a067d14323c06f5c85aa8abfb09f2da1

UPD2:
Previous way to fix JSON is not right. It remembers all old types inside JSON object forever. But truncate table should forget the columns history in JSON object.
Here the right fix with test
593cf4a59848ebce434f874397b9bd787c03c3ec

src/Interpreters/InterpreterDropQuery.cpp

tests/integration/helpers/wait_for_helpers.py

tests/queries/0_stateless/02421_trancate_isolation.sh

tests/integration/test_merge_tree_s3_restore/test.py

src/Storages/StorageMergeTree.cpp

den-crane · 2022-11-25T18:41:05Z

Hooray. Dreams come true.

tavplubix · 2022-11-29T17:12:25Z

@CheSema, seems like it makes test_merge_tree_hdfs more flaky:
https://play.clickhouse.com/play?user=play#c2VsZWN0IAp0b1N0YXJ0T2ZEYXkoY2hlY2tfc3RhcnRfdGltZSkgYXMgZCwKY291bnQoKSwgIGdyb3VwVW5pcUFycmF5KHB1bGxfcmVxdWVzdF9udW1iZXIpLCAgYW55KHJlcG9ydF91cmwpCmZyb20gY2hlY2tzIHdoZXJlICcyMDIyLTA2LTAxJyA8PSBjaGVja19zdGFydF90aW1lIGFuZCB0ZXN0X25hbWUgbGlrZSAnJXRlc3RfbWVyZ2VfdHJlZV9oZGZzJScgYW5kIHRlc3Rfc3RhdHVzIGluICgnRkFJTCcsICdGTEFLWScsICdFUlJPUicpIGdyb3VwIGJ5IGQgb3JkZXIgYnkgZCBkZXNj

CheSema · 2022-12-12T11:44:11Z

@CheSema, seems like it makes test_merge_tree_hdfs more flaky: https://play.clickhouse.com/play?

This should help with it.
#44154

tavplubix · 2023-03-22T00:11:55Z

@CheSema, test_merge_tree_s3/test.py::test_attach_detach_partition is still flaky: link

I saw a server crash with error `Part 20230603_61687_61697_1 intersects part 20230603_61690_61695_1 (state Active) It is a bug. (LOGICAL_ERROR)`. This was using v22.12. I can see that since then the error message was adjusted, but I don't believe the bug itself has been resolved. I believe the bug comes from this change (c976b28). It's associated with this pull request: (ClickHouse#41145). Specifically changes to the file `src/Storages/MergeTree/MergeTreeData.cpp` seem maybe incorrect to me, since the logic used when a part is covered by another part seems to have changed as described below. The change modifies the function `renameTempPartAndReplaceImpl`, such that a call to `getPartHierarchy` is used where previously there was a call to `getActivePartsToReplace`. These two functions have a similar implementation, but one notable difference -- in the latter, execution breaks early if a covering part is found over the new part. In our outer function, there was a condition looking for this covering part, which if hit, would print the warning and return false. Without the check, it instead continues, sees that there is an intersecting part, and crashed with the exception. I believe that the "intersecting" part would also be considered a "covering" part, as in this case it covers the block range from before -> after the range of the new part. I believe a solution is to simply re-add the condition which checks for a covering part, and return early in that case. However, my assessment could be very wrong, I can't say I understand this code all that well. I also don't really know how I could write a test for this either. The other relevant detail; This occurred at the moment when I was running a mutation on a *different* partition of the table. It happened on two occasions, so I'm pretty sure connected to the mutation, although I don't know why operations in different partitions would affect each other in this way.

techkuz · 2023-08-15T04:45:50Z

does detach part also make an empty partition? (detach not listed in the affected methods)

tavplubix · 2023-08-15T09:57:45Z

Yes, DETACH PART[ITION] works the same way as DROP PART[ITION]

acmeguy · 2024-01-10T15:35:17Z

This seems to be an issue still when using a volume with a s3 disk and a cache.

Resurrected partitions are driving us crazy.

Any advice?

filimonov · 2024-01-23T17:58:45Z

Resurrected partitions are driving us crazy.
Any advice?

Try to create a minimal repro and report the issue.

robot-ch-test-poll added the pr-improvement Pull request with some product improvements label Sep 9, 2022

CheSema force-pushed the lock-free-drop-partition branch from 137b570 to 0e70311 Compare September 9, 2022 16:38

tavplubix self-assigned this Sep 9, 2022

tavplubix mentioned this pull request Sep 9, 2022

Fix truncate in MergeTree before hard shutdown #37981

Closed

CheSema force-pushed the lock-free-drop-partition branch 8 times, most recently from c86e8ad to 8a308a9 Compare September 12, 2022 12:35

tavplubix mentioned this pull request Sep 12, 2022

Cancel merges before acquiring lock for truncate #34304

Merged

Algunenano mentioned this pull request Sep 12, 2022

Fix waiting of shared lock after exclusive lock failure #38864

Merged

tavplubix reviewed Sep 12, 2022

View reviewed changes

CheSema force-pushed the lock-free-drop-partition branch 3 times, most recently from 779ccc3 to 875946b Compare September 14, 2022 21:41

CheSema force-pushed the lock-free-drop-partition branch from 875946b to c33c4ec Compare September 14, 2022 22:07

CheSema force-pushed the lock-free-drop-partition branch 3 times, most recently from 6ae9026 to 2e10652 Compare September 19, 2022 09:21

CheSema marked this pull request as ready for review September 19, 2022 09:43

tavplubix reviewed Sep 19, 2022

View reviewed changes

CheSema force-pushed the lock-free-drop-partition branch 2 times, most recently from ec419d5 to 9a925e6 Compare September 22, 2022 23:25

CheSema added 2 commits November 23, 2022 15:16

always write creation_csn

7d74860

add tags to tests, fix error message

9f2c00d

CheSema force-pushed the lock-free-drop-partition branch from f6d78d8 to 9f2c00d Compare November 23, 2022 15:19

Merge branch 'master' into lock-free-drop-partition

6fd7dcf

CheSema merged commit 15a6ce2 into ClickHouse:master Nov 25, 2022

This was referenced Nov 25, 2022

Lock free drop partition (part) / truncate #33457

Closed

inactive DROP (ed) PARTITION (parts) are resurrected after CH failure and restart #15742

Closed

azat mentioned this pull request Nov 26, 2022

Add table_uuid to system.parts (resubmit) #43595

Merged

CheSema mentioned this pull request Nov 28, 2022

do not clear old parts at shutdown #43760

Merged

tavplubix mentioned this pull request Nov 29, 2022

Data race in IMergeTreeDataPart #43799

Closed

tavplubix mentioned this pull request Nov 29, 2022

Detach threads from thread group #43781

Merged

Algunenano mentioned this pull request Nov 30, 2022

Do not acquire read locks in system.tables if possible #43840

Merged

vdimir mentioned this pull request Dec 7, 2022

Fix logical error in right storage join with using #43963

Merged

alexey-milovidov mentioned this pull request Dec 31, 2022

Fix race in system.parts and system.parts_columns #44809

Merged

CheSema mentioned this pull request Mar 22, 2023

test_merge_tree_s3/test.py::test_attach_detach_partition is still flaky #47882

Closed

MikhailBurdukov mentioned this pull request Mar 29, 2023

ALTER TABLE FREEZE executes slow on 22.8 version #48172

Closed

jawm mentioned this pull request Jun 6, 2023

Attempt to fix bug resulting in intersecting parts (server crashes) jawm/ClickHouse#2

Closed

jawm mentioned this pull request Jun 6, 2023

Attempt to fix bug resulting in intersecting parts (server crashes) #50634

Closed

tavplubix mentioned this pull request Sep 7, 2023

Remove strange code and check what will happen #54425

Closed

Conversation

CheSema commented Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Uh oh!

den-crane commented Sep 9, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tavplubix commented Sep 12, 2022

Uh oh!

tavplubix commented Sep 14, 2022

Uh oh!

CheSema commented Sep 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

den-crane commented Nov 25, 2022

Uh oh!

tavplubix commented Nov 29, 2022

Uh oh!

CheSema commented Dec 12, 2022

Uh oh!

tavplubix commented Mar 22, 2023

Uh oh!

techkuz commented Aug 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tavplubix commented Aug 15, 2023

Uh oh!

acmeguy commented Jan 10, 2024

Uh oh!

filimonov commented Jan 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

CheSema commented Sep 9, 2022 •

edited

Loading

CheSema commented Sep 15, 2022 •

edited

Loading

techkuz commented Aug 15, 2023 •

edited

Loading