feat(reranker): add Jina v3 reranker support by yes999zc · Pull Request #331 · jundot/omlx

yes999zc · 2026-03-21T00:54:07Z

What

Add support for Jina v3 reranker models (jinaai/jina-reranker-v3-mlx) to omlx's reranker module.

Why

Issue #326 requested Jina reranker support. The Jina v3 reranker uses <|score_token|> logits for scoring instead of the yes/no token approach used by Qwen3-Reranker, and requires a separate implementation path.

Changes

Add JinaForRanking to CAUSAL_LM_RERANKER_ARCHITECTURES in model_discovery.py
Add MLXRerankerModel._load_jina_reranker() for loading Jina models via mlx-lm
Add MLXRerankerModel._rerank_jina() for reranking using <|score_token|> logits
Dispatch to the Jina path when architecture is JinaForRanking
Fix token masking and BOS/EOS handling in the Jina rerank implementation
Fix token ID resolution to handle different tokenizer formats (str vs Token objects in added_tokens_decoder)

Implementation Notes

Jina v3 reranker is based on Qwen3 architecture and uses <|score_token|> and <|rerank_token|> special tokens
The scoring extracts logits at the <|score_token|> position at the last token
The implementation follows the same pattern as the existing _rerank_causal_lm() but with Jina-specific token handling
Falls back to existing reranker implementations for other architectures (no breaking changes)

Validation (Local testing PASSED ✅)

This implementation has been validated with a real jinaai/jina-reranker-v3-mlx model:

✅ Syntax checked with omlx's bundled Python environment
✅ End-to-end tested with actual model (downloaded via Hugging Face Hub)
✅ Functional verification: Model loads successfully, token IDs resolved correctly (<|score_token|> = 151669, <|rerank_token|> = 151671)
✅ Reranking works: Produces sensible scores with correct sorting
✅ Code patterns: Follows existing patterns in reranker.py

Testing Results

The implementation was tested locally with:

Model: jinaai/jina-reranker-v3-mlx
Query: "What is artificial intelligence?"
Documents: 3 sample texts about AI, ML, and Python
Result: Successful loading and reranking with correct token resolution and score sorting

Notes

Token ID resolution now handles multiple tokenizer formats (string tokens, Token objects, dict entries)
Fallback mechanisms: added_tokens_decoder → convert_tokens_to_ids → get_added_vocab

Closes: #326

- Add JinaForRanking to CAUSAL_LM_RERANKER_ARCHITECTURES in model_discovery - Add MLXRerankerModel._load_jina_reranker() for loading jinaai/jina-reranker-v3-mlx via mlx-lm with <|score_token|> and <|rerank_token|> token resolution - Add MLXRerankerModel._rerank_jina() for reranking using score token logits - Dispatch to _rerank_jina() when model is JinaForRanking architecture Closes: jundot#326

…en logic for Jina reranker

- Fix token ID lookup in added_tokens_decoder (handle both str and Token objects) - Add fallback to convert_tokens_to_ids and get_added_vocab - Test passes with real jina-reranker-v3-mlx model

jundot

Reviewed the code. Clean scope, follows the existing _rerank_causal_lm pattern well.

Two bare except: in _load_jina_reranker() should be except Exception: to avoid catching KeyboardInterrupt/SystemExit. I'll fix those in a follow-up commit after merge.

LGTM otherwise.

FocusFlow Dev added 3 commits March 21, 2026 08:48

fix(reranker): add missing _rerank_token_id field and fix bos/eos tok…

b18c3bc

…en logic for Jina reranker

fix(reranker): improve token ID resolution for Jina reranker

8da8a5b

- Fix token ID lookup in added_tokens_decoder (handle both str and Token objects) - Add fallback to convert_tokens_to_ids and get_added_vocab - Test passes with real jina-reranker-v3-mlx model

jundot force-pushed the main branch 7 times, most recently from f6faf2f to c2beead Compare March 21, 2026 05:58

jundot approved these changes Mar 21, 2026

View reviewed changes

jundot merged commit 32be618 into jundot:main Mar 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(reranker): add Jina v3 reranker support#331

feat(reranker): add Jina v3 reranker support#331
jundot merged 3 commits intojundot:mainfrom
yes999zc:feat/jina-reranker-support

yes999zc commented Mar 21, 2026 •

edited

Loading

Uh oh!

jundot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yes999zc commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Changes

Implementation Notes

Validation (Local testing PASSED ✅)

Testing Results

Notes

Uh oh!

jundot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yes999zc commented Mar 21, 2026 •

edited

Loading