Skip to content

Conversation

@sfallah
Copy link
Contributor

@sfallah sfallah commented Nov 20, 2025

Feature Request: #16676

Make sure to read the contributing guidelines before submitting a PR

GGUF Models

sabafallah/DeepSeek-OCR-GGUF

deepseek-ocr-f32.gguf

mmproj-deepseek-ocr-f32.gguf

Running the Model

Build llama.cpp (Mac)

cmake -B build -G Ninja -DCMAKE_BUILD_TYPE=Release
cmake --build build -j --config Release

Running llama-mtmd-cli

DeepSeekOCR Paper (First page)
build/bin/llama-mtmd-cli \
-m gguf_models/deepseek-ai/deepseek-ocr-f16.gguf \
--mmproj gguf_models/deepseek-ai/mmproj-deepseek-ocr-f16.gguf \
--image tmp/mtmd_test_data/Deepseek-OCR-2510.18234v1_page1.png \
-p "<|grounding|>Convert the document to markdown." \
--chat-template deepseek-ocr --temp 0
Hard Test (Old Newspaper Image)
build/bin/llama-mtmd-cli \
-m gguf_models/deepseek-ai/deepseek-ocr-f16.gguf \
--mmproj gguf_models/deepseek-ai/mmproj-deepseek-ocr-f16.gguf \
--image tools/mtmd/test-1.jpeg \
-p "<|grounding|>Convert the document to markdown." \
--chat-template deepseek-ocr --temp 0

@github-actions github-actions bot added model Model specific examples python python script changes labels Nov 20, 2025
@sfallah sfallah marked this pull request as draft November 20, 2025 09:12
@ngxson
Copy link
Collaborator

ngxson commented Dec 12, 2025

heads up, sorry for the breaking change but there will be a refactoring (just moving stuff around) in #17965

after finishing with this refactoring (and after you done testing on your side), I'll go back to deepseek-ocr

@sfallah
Copy link
Contributor Author

sfallah commented Dec 13, 2025

@ngxson

heads up, sorry for the breaking change but there will be a refactoring (just moving stuff around) in #17965

after finishing with this refactoring (and after you done testing on your side), I'll go back to deepseek-ocr

Merge with #17965 is done.
I have also added deepseek-ocr to tests.sh.
As far my tests goes, it works, but the python test script is not done yet.
I will finish the test script tomorrow.

python test script for deepseek-ocr
testing OCR on text-1.jpeg newspaper image
checking against expected reference model output for Free-OCR and Markdown
@sfallah sfallah requested review from CISC and ngxson December 16, 2025 06:20
# Conflicts:
#	gguf-py/gguf/constants.py
#	gguf-py/gguf/tensor_mapping.py
#	tools/mtmd/clip-impl.h
#	tools/mtmd/clip.cpp
#	tools/mtmd/models/models.h
- added GLM-4.6V to big tests
- added missing deps for python test
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants