feat: Support for vector search with Qdrant by Anush008 · Pull Request #381 · MemPalace/mempalace

Anush008 · 2026-04-09T14:00:22Z

NOTE

This PR is superseded by

feat: add Qdrant as alternative storage backend (#700)

It implements the new vector search backend specification using Qdrant.

Description

This PR adds support for using Qdrant as a vector search provider in MemPalace.

Qdrant is an open-source vector search engine for high performance and massive scale.

Changes

Replaced the current inline Chroma usage with a common interface for vector search backends.
Added concrete implementations using Chroma and Qdrant.
The Qdrant client dependency is marked optional, keeping Chroma as the default backend (current behaviour remains unchanged).

Testing

I've unit tested this integration against a local Qdrant instance.

Setup

You can run Qdrant with

docker run -p 6333:6333 qdrant/qdrant

Set

MEMPALACE_VECTOR_BACKEND=qdrant
MEMPALACE_QDRANT_URL=http://localhost:6333/

The dashboard is accessible at http://localhost:6333/dashboard

Checklist

Tests pass (python -m pytest tests/ -v)
No hardcoded paths
Linter passes (ruff check .)

DanyLeeTW · 2026-04-09T16:12:01Z

Comprehensive Code Review Available

A detailed review of this PR has been conducted and is available at: docs/pr-reviews/pr-381-qdrant-vector-search.md

📊 Review Summary

Architecture: ⭐⭐⭐⭐½ (4.5/5)
Code Quality: ⭐⭐⭐⭐ (4/5)
Test Coverage: ⭐⭐ (2/5) - Needs improvement
Overall: ⭐⭐⭐½ (3.5/5) - Good with room for improvement

🎯 Key Findings

Strengths:

✅ Clean Protocol-based architecture
✅ Excellent backward compatibility
✅ Flexible configuration system
✅ Proper optional dependency handling

Critical Issues (Must Fix Before Merge):

Missing Error Handling (HIGH priority)
- No helpful ImportError when qdrant-client not installed
- Users get cryptic errors instead of install guidance
Global State (MEDIUM priority)
- ChromaDB backend uses module-level globals
- Causes thread safety issues and testing difficulties
Missing Test Coverage (MEDIUM priority)
- No tests for Qdrant backend
- No backend switching tests
- Missing configuration validation tests

📋 Recommended Actions

P0 - Must Fix (2-3 hours):

Add ImportError handling with install instructions
Add configuration validation (qdrant_url required)
Add basic backend tests

P1 - Should Fix (4-6 hours):

Refactor to class-based ChromaBackend
Add comprehensive test coverage
Document embedding model compatibility

🔗 Full Analysis

The complete 508-line review includes:

Detailed code analysis with line numbers
Implementation code examples
Test templates
Migration guides
Rollback procedures

Location: docs/pr-reviews/pr-381-qdrant-vector-search.md

Estimated Effort: 7-11 hours to merge-ready state

Review conducted using comprehensive PR analysis methodology. All findings documented with actionable recommendations.

Anush008 · 2026-04-09T19:59:59Z

With regards to the AI recommended actions above,

The Chroma implementation is kept as is. It doesn't need to be wrapped in another class.
Import errors are indeed handled gracefully.
I've not added test code since there isn't a test suite set up for the other components. I've tested the vector search integration locally. Thoroughly.

web3guru888

Nice abstraction layer. The VectorCollection protocol in backends/__init__.py is the right approach — it lets integrators swap backends without touching application code.

One thing I noticed: the Qdrant path returns a proper QdrantCollection class that explicitly implements VectorCollection, but the ChromaDB path returns native ChromaDB collection objects that satisfy the protocol through duck typing. This works today, but it's fragile — if a future ChromaDB release renames or reorders parameters on .query() or .get(), the protocol match breaks silently (no isinstance check, no type error at construction time).

Might be worth wrapping Chroma in a thin adapter class too (like ChromaCollection(VectorCollection)), even if it's just delegation. That way both backends fail the same way if the contract drifts.

Also: The _scroll_cursor state on QdrantCollection makes .get() with pagination stateful at the instance level. If two callers paginate the same collection concurrently (e.g., MCP handler + CLI status), they'll clobber each other's cursor. The ChromaDB path doesn't have this issue because offset/limit are stateless. Consider making scroll state caller-managed or using a separate iterator pattern.

Anush008 · 2026-04-10T09:08:32Z

the ChromaDB path returns native ChromaDB collection objects that satisfy the protocol through duck typing. This works today, but it's fragile — if a future ChromaDB release renames or reorders parameters on .query() or .get(), the protocol match breaks

It's highly unlikely that Chroma would do this without a major release. If they did, even wrapping it in ChromaCollection(VectorCollection) won't save us. It'll break too.

Formalizes the BaseCollection/BaseBackend contract introduced as a seam in #413 into an interchangeability spec that third-party backends can build to. Driven by six in-flight backend PRs (#574, #643, #665, #697, #700, #381) each implementing the interface differently. Key decisions captured: entry-point distribution, typed QueryResult/ GetResult replacing Chroma dict shape, daemon-first multi-palace model via PalaceRef, required where-clause subset (incl. $contains), mandatory embedder injection with model-identity validation, capability tokens, shared pytest conformance suite, and a backend-neutral migrate/verify CLI.

…nd registry (RFC 001 §10) Advances RFC 001 §10 cleanup so backend-author PRs (#574 LanceDB, #665 Postgres, #700 Qdrant, #697 hosted, #643 PalaceStore, #381 Qdrant) have a stable target to align against. Scope (this PR): - Typed QueryResult / GetResult dataclasses replace Chroma's dict shape at the BaseCollection boundary (§1.3). A transitional _DictCompatMixin keeps existing callers working while the attribute-access migration proceeds. - BaseCollection is now kwargs-only across add/upsert/query/get/delete/update with ABC defaults for estimated_count/close/health and a non-atomic default update() (§1.1–1.2). - PalaceRef replaces raw path strings at the backend boundary (§2.2). - BaseBackend ABC with get_collection/close_palace/close/health/detect (§2.3). - mempalace.backends entry-point group + in-tree registry with resolve_backend_for_palace priority order matching §3.2–3.3. - ChromaCollection normalizes chroma returns into typed results; unknown where-clause operators raise UnsupportedFilterError (no silent drop, §1.4). - ChromaBackend absorbs the inode/mtime client-cache freshness check previously duplicated in mcp_server._get_client() (§10 + PR #757). - searcher.py migrated to typed-attribute access as the reference call site; remaining callers land in a follow-up. - pyproject: chroma registered via [project.entry-points."mempalace.backends"]. Out of scope (explicit follow-ups): - Full caller migration off the dict-compat shim across palace.py, mcp_server.py, miner.py, convo_miner.py, dedup.py, repair.py, exporter.py, palace_graph.py, cli.py, closet_llm.py. - Embedder injection + three-state EmbedderIdentityMismatchError check (§1.5). - maintenance_state() / run_maintenance() benchmark hooks (§7.3). - AbstractBackendContractSuite full coverage (§7.1–7.2). - mempalace migrate / mempalace verify CLI rewrites through BaseCollection (§8). Tests: 970 passed (up from 967 on develop); new coverage for typed results, empty-result outer-shape preservation, \$regex rejection, registry lookup, priority resolver, and PalaceRef-kwarg ChromaBackend.get_collection. Refs: #743 (RFC 001), #989 (RFC 002 tracking issue).

@zackchiutw

Scanned all 233 open upstream PRs today against our open PRs and fork-ahead / planned-work items. Findings merged into README: - P2 (decay) and P3 Tier-0 (LLM rerank): both covered by MemPalace#1032 (@zackchiutw, MERGEABLE, 2026-04-19 — Weibull decay + 4-stage rerank pipeline). Older simpler version at MemPalace#337. Dropped as fork work; watching MemPalace#1032. - P7 (alternative storage): formally out of scope. RFC 001 MemPalace#743 (@igorls) defines the plugin contract; four backend PRs already in flight (MemPalace#700, MemPalace#381 Qdrant; MemPalace#574, MemPalace#575 LanceDB). Fork consumes, does not rebuild. - P0 (multi-label tags): still fork/upstream candidate. MemPalace#1033 (@zackchiutw) ships adjacent privacy-tag + progressive disclosure but not the full multi-label scheme. - Merged MemPalace#1023 section acknowledges complementary MemPalace#976 (felipetruman) which adds broader mine_global_lock() + HNSW num_threads pin. Gives future-us a map so we don't re-file MemPalace#1036-style duplicates.

Anush008 · 2026-04-25T07:34:46Z

This PR is superseded by

feat: add Qdrant as alternative storage backend (#700)

It implements the new vector search backend specification using Qdrant.

feat: Support for Qdrant vector search

126ad62

Merge branch 'main' into Anush008/main

358f3a5

web3guru888 reviewed Apr 10, 2026

View reviewed changes

Merge branch 'main' into main

4d4d4f1

bensig changed the base branch from main to develop April 11, 2026 22:22

bensig requested review from bensig, igorls and milla-jovovich as code owners April 11, 2026 22:22

bensig mentioned this pull request Apr 12, 2026

RFC: Storage backend plugin specification #737

Open

igorls mentioned this pull request Apr 12, 2026

docs: RFC 001 — storage backend plugin specification #743

Open

5 tasks

Anush008 added 2 commits April 13, 2026 16:06

Merge branch 'develop'

e83fbfd

chore: ruff check

3d9f984

igorls added area/cli CLI commands area/install pip/uv/pipx/plugin install and packaging area/mcp MCP server and tools area/mining File and conversation mining area/search Search and retrieval enhancement New feature or request storage labels Apr 14, 2026

Anush008 added 2 commits April 15, 2026 15:11

Merge remote-tracking branch 'origin/develop'

15f653d

chore: uvx ruff check

b7fec5e

igorls mentioned this pull request Apr 18, 2026

refactor(backends): RFC 001 §10 cleanup — typed results, PalaceRef, registry #995

Merged

4 tasks

Anush008 added 2 commits April 22, 2026 01:12

Merge branch 'develop'

855c173

Merge branch 'develop' into main

9d8a505

chore: Updated lockfile

550791f

Merge branch 'develop'

0280f8d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support for vector search with Qdrant#381

feat: Support for vector search with Qdrant#381
Anush008 wants to merge 11 commits intoMemPalace:developfrom
Anush008:main

Anush008 commented Apr 9, 2026 •

edited

Loading

Uh oh!

DanyLeeTW commented Apr 9, 2026

Uh oh!

Anush008 commented Apr 9, 2026

Uh oh!

web3guru888 left a comment

Uh oh!

Anush008 commented Apr 10, 2026

Uh oh!

Anush008 commented Apr 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Anush008 commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

NOTE

Description

Changes

Testing

Setup

Checklist

Uh oh!

DanyLeeTW commented Apr 9, 2026

Comprehensive Code Review Available

📊 Review Summary

🎯 Key Findings

📋 Recommended Actions

🔗 Full Analysis

Uh oh!

Anush008 commented Apr 9, 2026

Uh oh!

web3guru888 left a comment

Choose a reason for hiding this comment

Uh oh!

Anush008 commented Apr 10, 2026

Uh oh!

Anush008 commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Anush008 commented Apr 9, 2026 •

edited

Loading

Anush008 commented Apr 25, 2026 •

edited

Loading