handle stale deployment records on redeploy (409 Conflict) by peterj · Pull Request #213 · agentregistry-dev/agentregistry

peterj · 2026-02-25T01:34:27Z

Description

Fix 409 Conflict error when redeploying an agent or MCP server whose
runtime resources were removed externally but whose database record
was not cleaned up
Add cleanupExistingDeployment helper that removes stale DB records
and attempts Kubernetes resource cleanup (non-fatal) before retrying
the deployment insert
Remove stray fmt.Println debug statement from DeployServer

Change Type

/kind fix

Changelog

NONE

Additional Notes

Root cause

The deployments table uses PRIMARY KEY (server_name, version). When
runtime resources (e.g. Kubernetes pods) are deleted externally—via
kubectl, namespace cleanup, or failed reconciliation—the corresponding
database record is not removed. Subsequent deploy attempts hit the
unique constraint, returning a 409 Conflict even though no actual
instance exists.

The error path is:
CreateDeployment INSERT → PG error 23505 → ErrAlreadyExists → huma.Error409Conflict

ReconcileAll cannot fix this because it runs after the INSERT
succeeds—it never gets a chance to execute when the INSERT itself fails.

Fix

In DeployServer and DeployAgent, when CreateDeployment returns
ErrAlreadyExists:

Look up the existing deployment record
Attempt Kubernetes resource cleanup (non-fatal, since resources may
already be gone)
Remove the stale DB record
Retry CreateDeployment
Proceed with ReconcileAll to create fresh runtime resources

Signed-off-by: Peter Jausovec <[email protected]>

…stry-dev#213) # Description - Fix 409 Conflict error when redeploying an agent or MCP server whose runtime resources were removed externally but whose database record was not cleaned up - Add `cleanupExistingDeployment` helper that removes stale DB records and attempts Kubernetes resource cleanup (non-fatal) before retrying the deployment insert - Remove stray `fmt.Println` debug statement from `DeployServer` # Change Type ``` /kind fix ``` # Changelog ```release-note NONE ``` # Additional Notes ## Root cause The `deployments` table uses `PRIMARY KEY (server_name, version)`. When runtime resources (e.g. Kubernetes pods) are deleted externally—via `kubectl`, namespace cleanup, or failed reconciliation—the corresponding database record is not removed. Subsequent deploy attempts hit the unique constraint, returning a 409 Conflict even though no actual instance exists. The error path is: `CreateDeployment` INSERT → PG error 23505 → `ErrAlreadyExists` → `huma.Error409Conflict` `ReconcileAll` cannot fix this because it runs *after* the INSERT succeeds—it never gets a chance to execute when the INSERT itself fails. ## Fix In `DeployServer` and `DeployAgent`, when `CreateDeployment` returns `ErrAlreadyExists`: 1. Look up the existing deployment record 2. Attempt Kubernetes resource cleanup (non-fatal, since resources may already be gone) 3. Remove the stale DB record 4. Retry `CreateDeployment` 5. Proceed with `ReconcileAll` to create fresh runtime resources --------- Signed-off-by: Peter Jausovec <[email protected]>

handle stale deployment records on redeploy (409 Conflict)

3e5e9c5

Signed-off-by: Peter Jausovec <[email protected]>

github-actions bot added kind/fix release-note-none labels Feb 25, 2026

fix lint issues

3877818

Signed-off-by: Peter Jausovec <[email protected]>

timflannagan mentioned this pull request Feb 26, 2026

Deleting an agent leaves behind orphaned runtime resources #228

Closed

timflannagan approved these changes Feb 27, 2026

View reviewed changes

peterj merged commit 0ac477b into main Feb 27, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

handle stale deployment records on redeploy (409 Conflict)#213

handle stale deployment records on redeploy (409 Conflict)#213
peterj merged 2 commits intomainfrom
peterj/fix409

peterj commented Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

peterj commented Feb 25, 2026

Description

Change Type

Changelog

Additional Notes

Root cause

Fix

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants