Low Memory mode by generall · Pull Request #8714 · qdrant/qdrant

generall · 2026-04-18T00:41:28Z

Low memory mode

Motivation

It is a frequent situation in production, when customer just keep pushing more data regardless of the machine capacity.
At some point capacity is exhausted and machine cashes. In worst case, machine goes into crash loop, and there are no
nice way to recover it from this situation, as we can't even change config as API are not available.

We need a way to recovery from this situation.

Proposal

Special configuration option low_memory_mode is added to the config.

Should have 3 options:

disabled (default) - no special handling, all collection modules are loaded as usual
no-resident
When it is set, loading of all collection modules should not force anything to RAM if possible:
- Quantization should be loaded as if always_ram=false and vectors are on disk
- Payload indexes should be loaded as if on_disk=true
no-populate - same as no-resident, but also no population of RAM from disk should be done. This affects loading of orginal vectors, HNSW index, payload storage

Implementation details

Make sure that all components that support loading into RAM or disk have compatible format on disk, so they can be loaded in both modes without any issues.
Decide how to propagate parameter, either use global variable, or propagate it through function parameters. It would depend on how deep we need to propagate.
Implement parameter check and handling in all relevant components

Testing scenario:

load snapshot with all payloads and vector quantizations
check out memory reporitng API with and without option enabled

timvisee · 2026-04-20T08:22:10Z

+        // Low-memory mode `no_populate` suppresses mmap prefault globally.
+        // Pages will be faulted in on demand when queries touch them.
+        if crate::low_memory::low_memory_mode().skip_populate() {
+            return Ok(());
+        }
+


It's my understanding the low memory mode should also suppress population of universal IO disk cache.

@xzfc could you also confirm this from your side?

I would say yes, because otherwise we can crash because of local disk cash is full

timvisee

Tested locally, works as expected 👌

In fact, it clearly shows how slow loading into memory is for some of our storage components. In my test loading into memory takes 9 seconds, while starting with no_resident makes it startup in 0.5 seconds. I'm using a local NVMe disk.

* [AI] implement parameter + cover populate + cover quantized vectors * telemetry OpenAPI schema * [AI] hook immutable payload indexes * fmt * do not populate payload index if we fallback to mmap * Reformat * Also suppress universal IO disk cache population --------- Co-authored-by: timvisee <[email protected]>

generall added 4 commits April 18, 2026 02:18

[AI] implement parameter + cover populate + cover quantized vectors

b7b887a

telemetry OpenAPI schema

609aec7

[AI] hook immutable payload indexes

b6497ac

fmt

3edb395

This comment was marked as resolved.

Sign in to view

do not populate payload index if we fallback to mmap

cd68b2f

generall requested a review from timvisee April 19, 2026 22:12

qdrant deleted a comment from coderabbitai Bot Apr 20, 2026

timvisee added 2 commits April 20, 2026 10:16

Reformat

892911e

Also suppress universal IO disk cache population

074bd8b

This comment was marked as resolved.

Sign in to view

timvisee force-pushed the low-memory-mode branch from caa6170 to 074bd8b Compare April 20, 2026 08:31

timvisee reviewed Apr 20, 2026

View reviewed changes

qdrant deleted a comment from coderabbitai Bot Apr 20, 2026

timvisee approved these changes Apr 20, 2026

View reviewed changes

generall merged commit f321c9f into dev Apr 20, 2026
29 of 30 checks passed

generall deleted the low-memory-mode branch April 20, 2026 09:33

timvisee mentioned this pull request May 8, 2026

Bump version to 1.18.0 #8959

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Low Memory mode#8714

Low Memory mode#8714
generall merged 7 commits into
devfrom
low-memory-mode

generall commented Apr 18, 2026 •

edited by timvisee

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

timvisee Apr 20, 2026

Uh oh!

generall Apr 20, 2026

Uh oh!

Uh oh!

timvisee left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

generall commented Apr 18, 2026 • edited by timvisee Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Low memory mode

Motivation

Proposal

Implementation details

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

timvisee Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

generall Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

timvisee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

generall commented Apr 18, 2026 •

edited by timvisee

Loading