fix(go/adbc/driver/snowflake): fix potential deadlocks in reader #3870

zeroshade · 2026-01-08T16:10:47Z

Fix Critical Deadlocks and Race Conditions in Snowflake Record Reader

This PR addresses multiple critical concurrency issues in the Snowflake driver's recordReader that could cause complete application hangs under normal racing conditions.

Issues Fixed

1. Critical Deadlock: Release() Blocking Forever

Problem: When Release() was called while producer goroutines were blocked on channel sends, a permanent deadlock occurred:

Release() cancels context and attempts to drain channels
Producer goroutines blocked on ch <- rec cannot see the cancellation
Channels never close because producers never exit
Release() blocks forever on for rec := range ch

Fix: Added a done channel that signals when all producer goroutines have completed. Release() now waits for this signal before attempting to drain channels.

2. Severe Deadlock: Non-Context-Aware Channel Sends

Problem: Channel send operations at lines 694 and 732 checked context before the send but not during:

for rr.Next() && ctx.Err() == nil {  // Context checked here
    // ... 
    ch <- rec  // But send blocks here without checking context
}

Fix: Wrapped all channel sends in select statements with context awareness:

select {
case chs[0] <- rec:
    // Successfully sent
case <-ctx.Done():
    rec.Release()
    return ctx.Err()
}

3. Critical Race Condition: Nil Channel Reads

Problem: Channels were created asynchronously in goroutines after newRecordReader returned. If Next() was called quickly after creation, it could read from uninitialized (nil) channels, causing infinite blocking.

Fix: Initialize all channels upfront before starting any goroutines:

chs := make([]chan arrow.RecordBatch, len(batches))
for i := range chs {
    chs[i] = make(chan arrow.RecordBatch, bufferSize)
}

4. Goroutine Leaks on Initialization Errors

Problem: Error paths only cleaned up the first channel, potentially leaking goroutines if initialization failed after starting concurrent operations.

Fix: Moved all error-prone initialization (GetStream, NewReader) before goroutine creation, and added proper cleanup on errors.

Changes

Added done channel to reader struct to signal goroutine completion
Initialize all channels upfront to eliminate race conditions
Use context-aware sends with select statements for all channel operations
Update Release() to wait on done channel before draining
Reorganize initialization to handle errors before starting goroutines
Signal completion by closing done channel after all producers finish

Reproduction Scenarios Prevented

Deadlock #1:

bufferSize = 1, producer generates 2 records quickly
Channel becomes full after first record
Producer blocks on send
Consumer calls Release() before Next()
Without fix: permanent deadlock
With fix: producer responds to cancellation, Release() completes

Race Condition:

Query returns 3 batches
First batch processes quickly
Next() advances to second channel
Without fix: reads from nil channel, blocks forever
With fix: channel already initialized, works correctly

See #3730

davidhcoe · 2026-01-08T16:30:41Z

Will this get ported to the Foundry as well @zeroshade ?

zeroshade · 2026-01-08T16:42:44Z

@davidhcoe yes, this will get ported to the foundry. Once we complete the shift to the foundry we won't be filing the PRs here anymore and all development will shift to the foundry

bneijt · 2026-01-09T15:06:19Z

Seems like this branch fixes the deadlock issue I was experiencing in #3730. Thank you for looking into this!

zeroshade · 2026-01-09T21:05:55Z

@davidhcoe see adbc-drivers/snowflake#60 for the foundry port

## What's Changed ## Fix Critical Deadlocks and Race Conditions in Snowflake Record Reader This PR addresses multiple critical concurrency issues in the Snowflake driver's `recordReader` that could cause complete application hangs under normal racing conditions. ### Issues Fixed *1. Critical Deadlock: `Release()` Blocking Forever* *Problem*: When `Release()` was called while producer goroutines were blocked on channel sends, a permanent deadlock occurred: * `Release()` cancels context and attempts to drain channels * Producer goroutines blocked on `ch <- rec` cannot see the cancellation * Channels never close because producers never exit * `Release()` blocks forever on `for rec := range ch` *Fix:* Added a `done` channel that signals when all producer goroutines have completed. `Release()` now waits for this signal before attempting to drain channels. *2. Severe Deadlock: Non-Context-Aware Channel Sends* *Problem:* Channel send operations at lines 694 and 732 checked context before the send but not during: ```go for rr.Next() && ctx.Err() == nil { // Context checked here // ... ch <- rec // But send blocks here without checking context } ``` *Fix:* Wrapped all channel sends in `select` statements with context awareness: ```go select { case chs[0] <- rec: // Successfully sent case <-ctx.Done(): rec.Release() return ctx.Err() } ``` *3. Critical Race Condition: Nil Channel Reads* *Problem:* Channels were created asynchronously in goroutines after `newRecordReader` returned. If `Next()` was called quickly after creation, it could read from uninitialized (nil) channels, causing infinite blocking. *Fix:* Initialize all channels upfront before starting any goroutines: ```go chs := make([]chan arrow.RecordBatch, len(batches)) for i := range chs { chs[i] = make(chan arrow.RecordBatch, bufferSize) } ``` *4. Goroutine Leaks on Initialization Errors* *Problem:* Error paths only cleaned up the first channel, potentially leaking goroutines if initialization failed after starting concurrent operations. *Fix:* Moved all error-prone initialization (GetStream, NewReader) before goroutine creation, and added proper cleanup on errors. ---------------------- #### Changes * Added `done` channel to `reader` struct to signal goroutine completion * Initialize all channels upfront to eliminate race conditions * Use context-aware sends with `select` statements for all channel operations * Update `Release()` to wait on `done` channel before draining * Reorganize initialization to handle errors before starting goroutines * Signal completion by closing `done` channel after all producers finish #### Reproduction Scenarios Prevented *Deadlock:* 1. bufferSize = 1, producer generates 2 records quickly 2. Channel becomes full after first record 3. Producer blocks on send 4. Consumer calls Release() before Next() 5. Without fix: permanent deadlock 6. With fix: producer responds to cancellation, Release() completes *Race Condition:* 1. Query returns 3 batches 2. First batch processes quickly 3. Next() advances to second channel 4. Without fix: reads from nil channel, blocks forever 5. With fix: channel already initialized, works correctly Backport of apache/arrow-adbc#3870.

fix(go/adbc/driver/snowflake): fix potential deadlocks in reader

f5d92da

zeroshade requested review from kou and lidavidm January 8, 2026 16:10

github-actions bot added this to the ADBC Libraries 22 milestone Jan 8, 2026

zeroshade mentioned this pull request Jan 8, 2026

[Python/Snowflake] Connection sometimes hanging #3730

Closed

fix lint

035f728

lidavidm modified the milestones: ADBC Libraries 22, ADBC Libraries 23 Jan 9, 2026

lidavidm approved these changes Jan 9, 2026

View reviewed changes

zeroshade merged commit e9e5c5c into apache:main Jan 9, 2026
68 of 69 checks passed

lidavidm mentioned this pull request Jan 10, 2026

fix(go): fix potential deadlocks in reader adbc-drivers/snowflake#60

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(go/adbc/driver/snowflake): fix potential deadlocks in reader #3870

fix(go/adbc/driver/snowflake): fix potential deadlocks in reader #3870

Uh oh!

zeroshade commented Jan 8, 2026 •

edited by lidavidm

Loading

Uh oh!

davidhcoe commented Jan 8, 2026

Uh oh!

zeroshade commented Jan 8, 2026

Uh oh!

bneijt commented Jan 9, 2026

Uh oh!

Uh oh!

zeroshade commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix(go/adbc/driver/snowflake): fix potential deadlocks in reader #3870

fix(go/adbc/driver/snowflake): fix potential deadlocks in reader #3870

Uh oh!

Conversation

zeroshade commented Jan 8, 2026 • edited by lidavidm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix Critical Deadlocks and Race Conditions in Snowflake Record Reader

Issues Fixed

Changes

Reproduction Scenarios Prevented

Uh oh!

davidhcoe commented Jan 8, 2026

Uh oh!

zeroshade commented Jan 8, 2026

Uh oh!

bneijt commented Jan 9, 2026

Uh oh!

Uh oh!

zeroshade commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zeroshade commented Jan 8, 2026 •

edited by lidavidm

Loading