Implement async-signal-safe lock for DwarfFDECache by amosbird · Pull Request #34 · ClickHouse/libunwind

amosbird · 2024-12-15T06:38:19Z

There are at least three types of stack unwinding in ClickHouse:

Signal triggered unwinding

SIGUSR1: Used by the CPU Profiler
SIGUSR2: Used by the Real Profiler
SIGTRIM: Used by system.stack_trace

Jemalloc profile unwinding
Exception unwinding

During stack unwinding, libunwind manipulates the DwarfFDECache which acquires a read-write mutex (RWMutex). This is not async-signal-safe and can lead to serious issues. In particular, a signal handler might try to acquire the mutex while current thread is already in the process of unwinding and trying to acquire the same mutex , potentially leading to corruption of the mutex state and making debugging extremely difficult. A deadlock occurring during stack unwinding typically results in a "stop-the-world" situation.

An intuitive way to solve this problem is to block related signals before unwinding stack and unblock them later. However it's almost impossible to unblock signals later after exception unwinding.

Another way is to remove the use of pthread rw lock which is not async-signal-safe. This PR implements a simple lockfree RWMutex which should be good enough for stack unwinding purpose. It has been verified across a cluster of over 100 nodes. Prior to this implementation, the system experienced node deadlocks every 3-5 days. After applying the PR, the cluster has now been stable for two weeks without any reported issues.

Similar issue: ClickHouse/ClickHouse#69904

al13n321

Neat, thanks for the fix!

(I suspect the implementation would be slightly simpler and faster if the two counters were packed into one 64-bit atomic, but it probably doesn't matter.)

al13n321 · 2024-12-21T00:52:36Z

src/RWMutex.hpp

+    while (true) {
+      int expected = 0;
+      if (atomic_compare_exchange_weak(&state, &expected, -1)) {
+        atomic_fetch_sub(&waiting_writers, 1);


Nitpick (EDIT: meh, merging without it):

Suggested change

atomic_fetch_sub(&waiting_writers, 1);

int current_waiting_writers = atomic_fetch_sub(&waiting_writers, 1);

if (current_waiting_writers <= 0) {

abort();

}

hanfei1991 · 2025-02-12T15:37:11Z

@al13n321 why is this PR not merged to master yet?

al13n321 · 2025-02-13T18:54:41Z

Oops, ClickHouse/ClickHouse#76107

nickitat · 2025-02-13T21:51:56Z

Neat, thanks for the fix!

(I suspect the implementation would be slightly simpler and faster if the two counters were packed into one 64-bit atomic, but it probably doesn't matter.)

It'd probably make the correctness of the implementation more plausible if we keep all state (not just a variable called state:) atomic and change it always atomically.

al13n321 · 2025-02-13T23:52:29Z

Turns out DwarfFDECache is supposed to be disabled in clickhouse (using _LIBUNWIND_NO_HEAP), so we shouldn't be using this mutex at all: #35

amosbird · 2025-02-14T04:24:35Z

Turns out DwarfFDECache is supposed to be disabled in clickhouse (using _LIBUNWIND_NO_HEAP), so we shouldn't be using this mutex at all: #35

I originally wanted to delete this cache too, but I'm worried that it might have a performance impact on stack unwinding.

Implement async-signal-safe lock for DwarfFDECache

7729044

amosbird force-pushed the deadlock-fix branch from 8729af7 to 7729044 Compare December 15, 2024 06:38

al13n321 self-assigned this Dec 20, 2024

al13n321 approved these changes Dec 21, 2024

View reviewed changes

al13n321 merged commit 72fb634 into ClickHouse:master Dec 21, 2024

al13n321 mentioned this pull request Feb 13, 2025

Apply libunwind fix for DwarfFDECache ClickHouse/ClickHouse#76107

Merged

al13n321 mentioned this pull request Feb 13, 2025

Gate DwarfFDECache with _LIBUNWIND_NO_HEAP more carefully #35

Merged

al13n321 mentioned this pull request Feb 14, 2025

Revert "Implement async-signal-safe lock for DwarfFDECache" #37

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement async-signal-safe lock for DwarfFDECache#34

Implement async-signal-safe lock for DwarfFDECache#34
al13n321 merged 1 commit intoClickHouse:masterfrom
amosbird:deadlock-fix

amosbird commented Dec 15, 2024 •

edited

Loading

Uh oh!

al13n321 left a comment

Uh oh!

al13n321 Dec 21, 2024 •

edited

Loading

Uh oh!

hanfei1991 commented Feb 12, 2025

Uh oh!

al13n321 commented Feb 13, 2025

Uh oh!

nickitat commented Feb 13, 2025

Uh oh!

al13n321 commented Feb 13, 2025

Uh oh!

amosbird commented Feb 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-        atomic_fetch_sub(&waiting_writers, 1);
+        int current_waiting_writers = atomic_fetch_sub(&waiting_writers, 1);
+        if (current_waiting_writers <= 0) {
+          abort();
+        }

Conversation

amosbird commented Dec 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

al13n321 left a comment

Choose a reason for hiding this comment

Uh oh!

al13n321 Dec 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanfei1991 commented Feb 12, 2025

Uh oh!

al13n321 commented Feb 13, 2025

Uh oh!

nickitat commented Feb 13, 2025

Uh oh!

al13n321 commented Feb 13, 2025

Uh oh!

amosbird commented Feb 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amosbird commented Dec 15, 2024 •

edited

Loading

al13n321 Dec 21, 2024 •

edited

Loading