Add support for embedding models by cmbrose · Pull Request #21 · tonybaloney/llm-github-models

cmbrose · 2025-05-17T22:50:30Z

Fixes #20

Exposes the already-defined embedding models for use with llm embed. Additionally exposes smaller dimension versions of text-embedding-3-large (256, 1024) and text-embedding-3-small (512).

I took the approach from the openai embedding models from llm itself.

$ llm embed -m github/text-embedding-3-large -c "Hello world"
[ ... embedding floats ... ]

$ llm embed -m github/text-embedding-3-small-512 -c "Hello world"
[ ... embedding floats ... ]

$ llm embed -m github/text-embedding-3-small -c "Hello world" | jq 'length'
1536

$ llm embed -m github/text-embedding-3-small-512 -c "Hello world" | jq 'length'
512

llm_github_models.py

tonybaloney · 2025-05-26T03:49:38Z

llm_github_models.py

+        )
+
+        kwargs = {
+            "input": texts,


input in EmbeddingModel accepts a iterable of str or bytes, whereas client.embed takes str or Embedding Token. For now, I think bytes will probably fail. We should add a test for that too

cmbrose added 3 commits May 17, 2025 15:19

Add support for embedding models

07a887f

Add dimensions support

7e3b164

batch size

b208f68

cmbrose force-pushed the cmbrose/embeddings-support branch 2 times, most recently from 94d8a27 to b208f68 Compare May 19, 2025 17:50

tonybaloney added 3 commits May 26, 2025 13:38

reformat

b9f08ed

Merge branch 'main' into pr/cmbrose/21

f84848b

Ruff format and lint

53d2ced

tonybaloney reviewed May 26, 2025

View reviewed changes

llm_github_models.py Outdated Show resolved Hide resolved

tonybaloney reviewed May 26, 2025

View reviewed changes

tonybaloney added 3 commits May 26, 2025 13:53

Ruff updates

9de9ab1

Merge branch 'main' into pr/cmbrose/21

795f662

Formatter updates

9550823

tonybaloney approved these changes May 26, 2025

View reviewed changes

Update failing test semantics

adf8e31

tonybaloney merged commit 3261a4d into tonybaloney:main May 26, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add support for embedding models#21

Add support for embedding models#21
tonybaloney merged 10 commits intotonybaloney:mainfrom
cmbrose:cmbrose/embeddings-support

cmbrose commented May 17, 2025

Uh oh!

Uh oh!

tonybaloney May 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

cmbrose commented May 17, 2025

Uh oh!

Uh oh!

tonybaloney May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants