improvement: platfrm-184 update bullmq version for jobScheduler by PrestigePvP · Pull Request #5723 · Infisical/infisical

PrestigePvP · 2026-03-16T22:28:53Z

No description provided.

linear · 2026-03-16T22:28:57Z

PLATFRM-184 Explore reducing Redis memory usage for non-expiring keys

maidul98 · 2026-03-16T22:29:37Z

✅ Snyk checks have passed. No issues have been found so far.

Status	Scan Engine	Critical	High	Medium	Low	Total (0)
✅	Open Source Security	0	0	0	0	0 issues

💻 Catch issues earlier using the plugins for VS Code, JetBrains IDEs, Visual Studio, and Eclipse.

greptile-apps · 2026-03-16T22:34:01Z

Greptile Summary

This PR upgrades BullMQ from ^5.4.2 to ^5.67.3 and migrates all recurring queue jobs from the legacy repeatable jobs API (queue.add with repeat option) to the newer upsertJobScheduler API. This addresses uncapped Redis memory usage caused by the old repeatable jobs implementation, which accumulated Redis keys over time. The PR also adds three new methods to the queue service (upsertJobScheduler, removeJobScheduler, getJobSchedulers), marks the old APIs as @deprecated, and improves type safety by using Partial<Record> for queue/worker containers with optional chaining.

Migration gap — legacy repeatable jobs not cleaned up: The old code explicitly called stopRepeatableJob before setting up schedules to remove stale repeatable job configs from Redis. These cleanup calls are removed without replacement. Since BullMQ's job schedulers and legacy repeatable jobs are independent data structures in Redis, any existing legacy repeatable jobs will continue to fire alongside the new schedulers after deployment, causing duplicate job executions for every affected queue (~15 queues). A one-time cleanup step should be added.
Dynamic secret lease queue: Correctly migrated stopRepeatableJobByJobId → stopJobById for delayed (non-repeatable) jobs, and removed a redundant duplicate call.
Telemetry queue: Previously, stopRepeatableJob was called unconditionally (even when postHog was undefined) to ensure old jobs were always cleaned up. The new code only calls upsertJobScheduler inside the if (postHog) block, so legacy jobs on instances without PostHog will never be cleaned.

Confidence Score: 2/5

This PR risks duplicate job executions on any deployment upgrading from the previous version with existing Redis state.
The migration from legacy repeatable jobs to job schedulers is well-structured, but the removal of all stopRepeatableJob cleanup calls without a replacement migration mechanism means existing deployments will have both legacy repeatable jobs AND new scheduler jobs firing simultaneously. This affects ~15 queues including critical ones like certificate rotation, secret rotation, telemetry, and resource cleanup. The fix is straightforward (add a legacy cleanup step), but without it the PR introduces a high-impact regression on upgrades.
Pay close attention to backend/src/queue/queue-service.ts (missing legacy job cleanup in upsertJobScheduler) and backend/src/services/telemetry/telemetry-queue.ts (conditional cleanup path change).

Important Files Changed

Filename	Overview
backend/src/queue/queue-service.ts	Core queue service: Adds new `upsertJobScheduler`, `removeJobScheduler`, `getJobSchedulers` wrappers and converts internal reconciliation cron to use job schedulers. Changes container types to `Partial<Record>` with optional chaining. Missing cleanup of legacy repeatable jobs risks duplicate executions on upgrades.
backend/package.json	BullMQ version bump from ^5.4.2 to ^5.67.3 to gain job scheduler API support and fix uncapped Redis memory from legacy repeatable jobs.
backend/src/ee/services/dynamic-secret-lease/dynamic-secret-lease-queue.ts	Correctly changed `stopRepeatableJobByJobId` to `stopJobById` for delayed (non-repeatable) revocation jobs. Removed a redundant duplicate `stopRepeatableJobByJobId` call.
backend/src/services/certificate-authority/certificate-authority-queue.ts	Migrated CA CRL rotation from legacy repeatable (with explicit `stopRepeatableJob` cleanup) to `upsertJobScheduler`. Old cleanup removed, no replacement for legacy jobs still in Redis.
backend/src/services/resource-cleanup/resource-cleanup-queue.ts	Migrated both daily and frequent resource cleanup from legacy repeatable to `upsertJobScheduler`. Previous `stopRepeatableJob` cleanup removed without replacement.
backend/src/services/telemetry/telemetry-queue.ts	Migrated telemetry instance stats and aggregated events from legacy repeatable to `upsertJobScheduler`. Previous `stopRepeatableJob` calls (which ran even when postHog was disabled) removed without replacement.

_{Last reviewed commit: 402ece2}

backend/src/queue/queue-service.ts

victorvhs017

When I run the app, I get:

You can replicate by creating a secret rotation on main and then checking out this branch.

The issue here is that even though we stop the repeatable job creator, it may have already created the next job and put it in the queue.

And because our job scheduler uses the same id in its jobs, it won't be able to produce the next job while this one is not executed.

I see two options here:

Remove the repeatable jobs (what you've done) AND any job created by it from the queue
Change the id for the scheduler jobs. This would fix the issue with one drawback: the next job would be executed twice (the remaining job from the repeatable jobs producer + the new job from the new scheduler).

Looking at the queues, it shouldn't be an issue to execute this twice, only once. The biggest impact would be one duplicated email and a couple of duplicated notifications. But it's good to test if the old job id is completely gone from Redis after that.

…edulers

backend/src/queue/queue-service.ts

greptile-apps bot reviewed Mar 16, 2026

View reviewed changes

backend/src/queue/queue-service.ts Show resolved Hide resolved

PrestigePvP force-pushed the tre/platfrm-184-update-bullmq-scheduler branch from c9f14ff to a646e81 Compare March 17, 2026 16:09

victorvhs017 requested changes Mar 17, 2026

View reviewed changes

PrestigePvP added 4 commits March 18, 2026 16:14

fix: uncapped redis memory usage due to bullmq scheduler

e903670

fix: stop legacy jobs, fix regex linter

70f290b

fix: delete old jobs and workers when upserting to use new bullmq sch…

ff23930

…edulers

fix: legacy and current recurring job collision

45a5fd9

PrestigePvP force-pushed the tre/platfrm-184-update-bullmq-scheduler branch from c378b95 to 45a5fd9 Compare March 18, 2026 22:09

PrestigePvP requested a review from victorvhs017 March 18, 2026 22:10

victorvhs017 requested changes Mar 18, 2026

View reviewed changes

backend/src/queue/queue-service.ts Show resolved Hide resolved

PrestigePvP added 3 commits March 18, 2026 19:56

fix: renewal and cleanup queue

c55ed7f

fix: build

d628bc8

fix: double remove orphaned jobs to account for racing creation

ac039df

victorvhs017 mentioned this pull request Mar 19, 2026

feat(secret-sync): add Infisical-to-Infisical secret sync #5743

Merged

11 tasks

victorvhs017 approved these changes Mar 19, 2026

View reviewed changes

PrestigePvP merged commit 3fef3a1 into main Mar 19, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvement: platfrm-184 update bullmq version for jobScheduler#5723

improvement: platfrm-184 update bullmq version for jobScheduler#5723
PrestigePvP merged 7 commits intomainfrom
tre/platfrm-184-update-bullmq-scheduler

PrestigePvP commented Mar 16, 2026

Uh oh!

linear bot commented Mar 16, 2026

Uh oh!

maidul98 commented Mar 16, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 16, 2026

Uh oh!

Uh oh!

victorvhs017 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

PrestigePvP commented Mar 16, 2026

Uh oh!

linear bot commented Mar 16, 2026

Uh oh!

maidul98 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Snyk checks have passed. No issues have been found so far.

Uh oh!

greptile-apps bot commented Mar 16, 2026

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Uh oh!

Uh oh!

victorvhs017 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

maidul98 commented Mar 16, 2026 •

edited

Loading