Skip to content

[CHORE](k8s) increase dev CPU limits from 100m to 200-300m#6435

Merged
rescrv merged 7 commits intomainfrom
rescrv/more-resources
Feb 13, 2026
Merged

[CHORE](k8s) increase dev CPU limits from 100m to 200-300m#6435
rescrv merged 7 commits intomainfrom
rescrv/more-resources

Conversation

@rescrv
Copy link
Copy Markdown
Contributor

@rescrv rescrv commented Feb 13, 2026

Description of changes

Raise CPU resource limits and requests from 100m to 300m for all
services in both values.dev.yaml and values2.dev.yaml. The previous
100m limits were too restrictive for dev workloads, causing CPU
throttling.

Affected services: sysdb, rustFrontendService, queryService,
compactionService, rustLogService, garbageCollector, rustSysdbService,
and rustSysdbMigration.

Test plan

CI

Migration plan

N/A

Observability plan

CI goes green

Documentation Changes

N/A

Co-authored-by: AI

Raise CPU resource limits and requests from 100m to 500m for all
services in both values.dev.yaml and values2.dev.yaml. The previous
100m limits were too restrictive for dev workloads, causing CPU
throttling.

Affected services: sysdb, rustFrontendService, queryService,
compactionService, rustLogService, garbageCollector, rustSysdbService,
and rustSysdbMigration.

Co-authored-by: AI
@github-actions
Copy link
Copy Markdown

Reviewer Checklist

Please leverage this checklist to ensure your code review is thorough before approving

Testing, Bugs, Errors, Logs, Documentation

  • Can you think of any use case in which the code does not behave as intended? Have they been tested?
  • Can you think of any inputs or external events that could break the code? Is user input validated and safe? Have they been tested?
  • If appropriate, are there adequate property based tests?
  • If appropriate, are there adequate unit tests?
  • Should any logging, debugging, tracing information be added or removed?
  • Are error messages user-friendly?
  • Have all documentation changes needed been made?
  • Have all non-obvious changes been commented?

System Compatibility

  • Are there any potential impacts on other parts of the system or backward compatibility?
  • Does this change intersect with any items on our roadmap, and if so, is there a plan for fitting them together?

Quality

  • Is this code of a unexpectedly high quality (Readability, Modularity, Intuitiveness)

@github-actions
Copy link
Copy Markdown

⚠️ The Helm chart was updated without a version bump. Your changes will only be published if the version field in k8s/distributed-chroma/Chart.yaml is updated.

@rescrv
Copy link
Copy Markdown
Contributor Author

rescrv commented Feb 13, 2026

I believe this change fixes what was responsible for the garbage_collector_component::tests::test_k8s_integration_tenant_mode_override test failures in CI.

@propel-code-bot
Copy link
Copy Markdown
Contributor

propel-code-bot bot commented Feb 13, 2026

Updates the dev Helm values used in CI so each service now requests and limits 200m CPU instead of the previous 100m, ensuring throttled pods receive sufficient headroom.

Possible Issues

• Mismatch between PR description/review feedback (300m) and final values (200m) could leave query workloads under-provisioned.
• Higher CPU reservations might exceed available CI node capacity if multiple components run concurrently.

This summary was automatically generated by @propel-code-bot

@blacksmith-sh

This comment has been minimized.

@rescrv rescrv changed the title [CHORE](k8s) increase dev CPU limits from 100m to 500m [CHORE](k8s) increase dev CPU limits from 100m to 300m Feb 13, 2026
@rescrv rescrv changed the title [CHORE](k8s) increase dev CPU limits from 100m to 300m [CHORE](k8s) increase dev CPU limits from 100m to 200-300m Feb 13, 2026
resources:
limits:
cpu: 100m
cpu: 200m
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Important

[Requirements] QueryService is still capped at 200m here (and in values2.dev.yaml), even though the PR description says every service should move to 300m to eliminate throttling. Leaving this service at 200m keeps the query pods under-provisioned while everything else gets the higher limit/request. Please bump both the limits and requests CPU values for QueryService to 300m in both dev values files, e.g.

  resources:
    limits:
      cpu: 300m
    requests:
      cpu: 300m
Context for Agents
QueryService is still capped at 200m here (and in `values2.dev.yaml`), even though the PR description says every service should move to 300m to eliminate throttling. Leaving this service at 200m keeps the query pods under-provisioned while everything else gets the higher limit/request. Please bump both the `limits` and `requests` CPU values for QueryService to 300m in both dev values files, e.g.

```yaml
  resources:
    limits:
      cpu: 300m
    requests:
      cpu: 300m
```

File: k8s/distributed-chroma/values.dev.yaml
Line: 38

cpu: 200m
requests:
cpu: 100m
cpu: 200m
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Important

[Requirements] The PR description mentions that rustSysdbService and rustSysdbMigration are updated in both values files. However, the updates for these services appear to be missing from this file (they are correctly updated in k8s/distributed-chroma/values2.dev.yaml).

Context for Agents
The PR description mentions that `rustSysdbService` and `rustSysdbMigration` are updated in both values files. However, the updates for these services appear to be missing from this file (they are correctly updated in `k8s/distributed-chroma/values2.dev.yaml`).

File: k8s/distributed-chroma/values.dev.yaml
Line: 67

@rescrv rescrv enabled auto-merge (squash) February 13, 2026 22:12
@rescrv rescrv merged commit 8742d3d into main Feb 13, 2026
67 checks passed
tanujnay112 added a commit that referenced this pull request Feb 18, 2026
- **[ENH]: Cache rust git submodules in mounted volume (#6424)**
- **[CHORE](k8s) increase dev CPU limits from 100m to 200-300m (#6435)**
- **[ENH] replace live cloud tests with k8s integration tests (#6434)**
- **[ENH] Make dirty_log_collections metric mcmr-aware. (#6353)**
- **[ENH] Quantized Spann Segment Writer (#6397)**
- **[ENH] Wire up quantized writer in compaction (#6399)**
- **[ENH] Quantized Spann Segment Reader (#6405)**
- **[ENH] Wire up quantized reader in new orchestrator (#6409)**
- **[ENH] Garbage collect usearch index files (#6416)**
- **[ENH] Trace quantized spann implementation (#6425)**
- **[ENH]: Precompute data chunk len() (#6442)**
- **[BUG]: Compaction version file flush was incomplete on MCMR
(#6423)**
- **[DOC]: Fixed broken links in Readme (#6440)**
- **[DOC] Fix link to Rust documentation (#6443)**
- **[ENH]: Allow users to disable FTS in schema (#6214)**

---------

Co-authored-by: Robert Escriva <[email protected]>
Co-authored-by: Macronova <[email protected]>
Co-authored-by: Nilpotent <[email protected]>
Co-authored-by: anderk222 <[email protected]>
Co-authored-by: Sanket Kedia <[email protected]>
@rescrv rescrv deleted the rescrv/more-resources branch March 2, 2026 18:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants