fix: cache ONNXMiniLM_L6_V2 instance in DefaultEmbeddingFunction by Jah-yee · Pull Request #6960 · chroma-core/chroma

Jah-yee · 2026-04-23T10:44:53Z

DefaultEmbeddingFunction.call was constructing a fresh ONNXMiniLM_L6_V2 on every call, triggering cold lazy-init of the tokenizer (~5ms) and ONNX session each time. This caused a ~10x slowdown on repeated embed calls.

Fix: create the ONNXMiniLM_L6_V2 instance once in call and cache it on self._ef for subsequent calls.

Fixes #6941

Signed-off-by: Jah-yee [email protected]

Thank you for your work on this project. I hope this small fix is helpful. Please let me know if there's anything to adjust.

Warmly, RoomWithOutRoof

DefaultEmbeddingFunction.__call__ was constructing a fresh ONNXMiniLM_L6_V2 on every call, triggering cold lazy-init of the tokenizer (~5ms) and ONNX session each time. This caused a ~10x slowdown on repeated embed calls. Fix: create the ONNXMiniLM_L6_V2 instance once in __call__ and cache it on self._ef for subsequent calls. Fixes chroma-core#6941 Signed-off-by: Jah-yee <[email protected]>

github-actions · 2026-04-23T10:45:11Z

propel-code-bot · 2026-04-23T10:45:20Z

Cache ONNXMiniLM_L6_V2 Instance in DefaultEmbeddingFunction

This PR fixes a performance issue in DefaultEmbeddingFunction by avoiding repeated construction of ONNXMiniLM_L6_V2 on every __call__. The implementation now initializes a cached embedding function instance once and reuses it for subsequent calls.

The change is limited to chromadb/api/types.py and updates DefaultEmbeddingFunction.__init__ and DefaultEmbeddingFunction.__call__ to store and use self._ef. This aligns with the PR intent to remove repeated lazy initialization overhead during repeated embedding operations.

This summary was automatically generated by @propel-code-bot

propel-code-bot

No issues were found; the caching change is sound and should improve embedding call performance safely.

Status: No Issues Found | Risk: Low

Review Details

📁 1 files reviewed | 💬 0 comments

propel-code-bot Bot reviewed Apr 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: cache ONNXMiniLM_L6_V2 instance in DefaultEmbeddingFunction#6960

fix: cache ONNXMiniLM_L6_V2 instance in DefaultEmbeddingFunction#6960
Jah-yee wants to merge 1 commit intochroma-core:mainfrom
Jah-yee:fix/6948-cache-onnx-instance

Jah-yee commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

propel-code-bot Bot commented Apr 23, 2026

Uh oh!

propel-code-bot Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jah-yee commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Reviewer Checklist

Testing, Bugs, Errors, Logs, Documentation

System Compatibility

Quality

Uh oh!

propel-code-bot Bot commented Apr 23, 2026

Uh oh!

propel-code-bot Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant