Skip to content

[release-0.18] Fix race condition in config-manager when label is unset#1577

Merged
elezar merged 1 commit intorelease-0.18from
backport-1541-to-release-0.18
Jan 15, 2026
Merged

[release-0.18] Fix race condition in config-manager when label is unset#1577
elezar merged 1 commit intorelease-0.18from
backport-1541-to-release-0.18

Conversation

@github-actions
Copy link
Copy Markdown

@github-actions github-actions Bot commented Jan 7, 2026

🤖 Automated backport of #1541 to release-0.18

✅ Cherry-pick completed successfully with no conflicts.

Original PR: #1541
Original Author: @uristernik

Cherry-picked commits (1):

  • ab05ace Fix race condition in config-manager when label is unset

This backport was automatically created by the backport bot.

When the node label (nvidia.com/device-plugin.config) is not set, a race
condition could cause the config-manager to hang indefinitely on startup.

The issue occurred when the informer's AddFunc fired before the first Get()
call, setting current="" and broadcasting. When Get() was subsequently called,
it found lastRead == current (both empty strings) and waited forever, as no
future events would wake it up.

This fix adds an 'initialized' flag to SyncableConfig to ensure the first
Get() call never waits, regardless of timing. Subsequent Get() calls still
wait properly when the value hasn't changed.

Signed-off-by: Uri Sternik <[email protected]>
(cherry picked from commit ab05ace)
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Jan 7, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cdesiniotis
Copy link
Copy Markdown
Contributor

/ok to test 09d5135

@elezar elezar merged commit b80634e into release-0.18 Jan 15, 2026
9 checks passed
@elezar elezar deleted the backport-1541-to-release-0.18 branch January 15, 2026 07:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants