Use background thread pool for distributed sends by azat · Pull Request #10263 · ClickHouse/ClickHouse

azat · 2020-04-14T18:21:33Z

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Use background thread pool (background_schedule_pool_size) for distributed sends

Detailed description / Documentation draft:
After #8756 the problem with background threads for distributed sends became even worse (since thread per volume will be created).

Fixes: #9551
Refs: #8756

See-also: #10315 (same thing for Buffer engine)

src/Storages/Distributed/DirectoryMonitor.cpp

src/Storages/StorageDistributed.cpp

azat · 2020-04-15T06:23:40Z

test_settings_constraints_distributed/test.py::test_insert_clamps_settings
test_dictionaries_mysql/test.py::test_load_mysql_dictionaries

Fails in upstream/master too

test_settings_constraints_distributed can be fixed with SYSTEM FLUSH DISTRIBUTED

azat · 2020-04-15T07:11:48Z

test_settings_constraints_distributed can be fixed with SYSTEM FLUSH DISTRIBUTED

Added

src/Core/Settings.h

azat · 2020-04-18T07:36:58Z

PVS check — Found 1 new errors, total 20 errors

Does not looks related

test_insert_into_distributed/test.py::test_inserts_batching

Uses sleep over system flush distributed, and I decided do just increase the sleep time (maybe it will handle some corner cases, even though I tried to cover all of them)

alexey-milovidov · 2020-04-18T11:57:39Z

test_inserts_batching did not fix.

Include info about: - kafka streaming - dns cache updates

…ibuted sends After ClickHouse#8756 the problem with 1 thread for each (distributed table, disk) for distributed sends became even worse (since there can be multiple disks), so use predefined thread pool for this tasks, that can be controlled with background_distributed_schedule_pool_size knob.

azat · 2020-04-19T17:39:04Z

Actually not sure that "pr-improvement" will be enough, since it can be pretty tricky to debug the problems with distributed sends i.e. why it became slower (for "regular" user), maybe backward incompatible is better?

alexey-milovidov · 2020-04-19T17:41:55Z

It's only relevant when using a huge number of Distributed tables (rare case). And it should not become slower as we have 16 background threads. Data is sent almost as is without any processing on our side, so it will either saturate the network or there are slow peers (that will require additional debugging).

Follow-up-for: ClickHouse#10315 Follow-up-for: ClickHouse#10263

Follow-up-for: #10315 Follow-up-for: #10263

CurrentMetrics::Increment add amount for specified metric only for the lifetime of the object, but this is not the intention, since DistributedFilesToInsert is a gauge and after ClickHouse#10263 it can exit from the callback (and enter again later, for example after SYSTEM STOP DISTRIBUTED SEND it will always exit from it, until SYSTEM START DISTRIBUTED SEND). So make Increment member of a class (this will also fix possible issues with substructing value on DROP TABLE).

azat commented Apr 14, 2020

View reviewed changes

src/Storages/Distributed/DirectoryMonitor.cpp Outdated Show resolved Hide resolved

azat commented Apr 14, 2020

View reviewed changes

src/Storages/StorageDistributed.cpp Outdated Show resolved Hide resolved

blinkov added the pr-improvement Pull request with some product improvements label Apr 14, 2020

azat force-pushed the distributed-send-bg-pool branch 2 times, most recently from 33b97f7 to a40b4a1 Compare April 14, 2020 18:37

alexey-milovidov self-requested a review April 14, 2020 18:57

azat force-pushed the distributed-send-bg-pool branch from a40b4a1 to 19ab04b Compare April 14, 2020 20:38

azat force-pushed the distributed-send-bg-pool branch from 19ab04b to c78803b Compare April 15, 2020 07:00

azat force-pushed the distributed-send-bg-pool branch from c78803b to cb818d7 Compare April 15, 2020 08:12

azat commented Apr 15, 2020

View reviewed changes

src/Core/Settings.h Outdated Show resolved Hide resolved

azat force-pushed the distributed-send-bg-pool branch 2 times, most recently from da37e24 to 5c61f0f Compare April 16, 2020 17:15

azat mentioned this pull request Apr 16, 2020

Use background thread pool for background buffer flushes #10315

Merged

azat force-pushed the distributed-send-bg-pool branch from ba91491 to 633f4d6 Compare April 18, 2020 07:37

azat force-pushed the distributed-send-bg-pool branch from 633f4d6 to 5054322 Compare April 18, 2020 17:28

azat added 2 commits April 19, 2020 00:22

Drop superfluous locking for atomic in DirectoryMonitor

673ddc9

Cleanup 01040_distributed_directory_monitor_batch_inserts

5ffd8bd

azat force-pushed the distributed-send-bg-pool branch from 5054322 to fe4be16 Compare April 18, 2020 21:47

azat added 2 commits April 19, 2020 11:20

Update comment for background_schedule_pool_size

201d5d5

Include info about: - kafka streaming - dns cache updates

azat force-pushed the distributed-send-bg-pool branch from fe4be16 to 5d11118 Compare April 19, 2020 09:12

alexey-milovidov merged commit 61d33a8 into ClickHouse:master Apr 19, 2020

azat deleted the distributed-send-bg-pool branch April 20, 2020 07:51

azat added a commit to azat/ClickHouse that referenced this pull request Apr 22, 2020

Add tasks/memory metrics for distributed/buffer schedule pools

d854049

Follow-up-for: ClickHouse#10315 Follow-up-for: ClickHouse#10263

This was referenced Apr 22, 2020

Add tasks/memory metrics for distributed/buffer schedule pools #10449

Merged

Fix distributed send that are scheduled by INSERT query #10486

Merged

alesapin pushed a commit that referenced this pull request Apr 26, 2020

Add tasks/memory metrics for distributed/buffer schedule pools

6dc9908

Follow-up-for: #10315 Follow-up-for: #10263

azat mentioned this pull request Apr 27, 2020

DB::Exception: Cannot schedule a task #10504

Closed

azat mentioned this pull request Aug 26, 2020

Fix DistributedFilesToInsert metric (zeroed when it should not) #14095

Merged

alexey-milovidov mentioned this pull request Dec 31, 2020

ClickHouse can hang during startup when nproc soft limit is low #18669

Closed

alexey-milovidov mentioned this pull request Jun 17, 2021

Huge amount of threads #22159

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use background thread pool for distributed sends#10263

Use background thread pool for distributed sends#10263
alexey-milovidov merged 4 commits intoClickHouse:masterfrom
azat:distributed-send-bg-pool

azat commented Apr 14, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

azat commented Apr 15, 2020 •

edited

Loading

Uh oh!

azat commented Apr 15, 2020

Uh oh!

Uh oh!

azat commented Apr 18, 2020

Uh oh!

alexey-milovidov commented Apr 18, 2020

Uh oh!

azat commented Apr 19, 2020

Uh oh!

alexey-milovidov commented Apr 19, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

azat commented Apr 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

azat commented Apr 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

azat commented Apr 15, 2020

Uh oh!

Uh oh!

azat commented Apr 18, 2020

Uh oh!

alexey-milovidov commented Apr 18, 2020

Uh oh!

azat commented Apr 19, 2020

Uh oh!

alexey-milovidov commented Apr 19, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

azat commented Apr 14, 2020 •

edited

Loading

azat commented Apr 15, 2020 •

edited

Loading