Skip to content

Conversation

@ikawrakow
Copy link
Owner

DeepSeek-Lite on M2-Max CPU:

model threads test t/s (main) t/s (PR) Speedup
deepseek2 16B IQ1_S_R4 2 tg128 22.76 ± 0.15 24.07 ± 0.19 1.058
deepseek2 16B IQ1_S_R4 4 tg128 37.83 ± 0.00 39.58 ± 0.02 1.046
deepseek2 16B IQ1_S_R4 8 tg128 62.01 ± 0.02 65.26 ± 0.82 1.052
deepseek2 16B IQ1_S_R4 8 pp512 251.97 ± 0.09 283.20 ± 0.54 1.124

@ikawrakow ikawrakow merged commit a6f9f2e into main Feb 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants