Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
Kimi logoKimi K2.6 and Xiaomi logoMiMo-V2.5-Pro are the highest intelligence open source models, followed by DeepSeek logoDeepSeek V4 Pro (Max) & Z AI logoGLM-5.1.

Highlights

Artificial Analysis Openness Index · Higher is better
Artificial Analysis Intelligence Index · Higher is better
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Results

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Open Source Language Models Intelligence By Lab Over Time

Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis · Higher is better
GDPval-AA

Agentic real-world work tasks, (Elo-500)/2000

Terminal-Bench Hard

Agentic coding & terminal use

𝜏²-Bench Telecom

Agentic tool use

AA-LCR

Long context reasoning

Humanity's Last Exam

Reasoning & knowledge

GPQA Diamond

Scientific reasoning

SciCode

Coding

IFBench

Instruction following

CritPt

Physics reasoning

APEX-Agents-AA

Long-horizon agentic tasks

ITBench-AA

Kubernetes incident root-cause analysis

MMMU-Pro

Visual reasoning

Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.0 incorporates 10 evaluations: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.0 includes: GDPval-AA, 𝜏²-Bench Telecom, Terminal-Bench Hard, SciCode, AA-LCR, AA-Omniscience, IFBench, Humanity's Last Exam, GPQA Diamond, CritPt. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

Weights
Provider Benchmarks
Kimi K2.6
Kimi logoKimi
54
1.0KB
32B active at inference time
256k
$0.7
57
NovitaClarifaiGMI
+12
MiMo-V2.5-Pro
Xiaomi logoXiaomi
54
1.0KB
42B active at inference time
1.00M
$0.2
41
XiaomiNovitaGMIDeepInfra
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek logoDeepSeek
52
1.6KB
49B active at inference time
1.00M
$0.2
61
Together.aiNovitaFireworks
+8
GLM-5.1 (Reasoning)
Z AI logoZ AI
51
744B
40B active at inference time
200k
$0.9
83
FriendliAITogether.aiDeepInfra
+9
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek logoDeepSeek
50
1.6KB
49B active at inference time
1.00M
$0.2
61
DeepInfraSiliconFlowNebius
+8
GLM-5 (Reasoning)
Z AI logoZ AI
50
744B
40B active at inference time
200k
$0.7
73
SiliconFlowParasailCoreWeave
+9
MiniMax-M2.7
MiniMax logoMiniMax
50
230B
10B active at inference time
205k
$0.2
59
FireworksMiniMaxSambaNova
+3
MiMo-V2.5
Xiaomi logoXiaomi
49
310B
15B active at inference time
1.00M
$0.1
76
NovitaParasailXiaomi
+2
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA logoNVIDIA
48
550B
55B active at inference time
262k
$0.5
163
Not available
CoreWeaveGMIBlackbox AI
+4
Kimi K2.5 (Reasoning)
Kimi logoKimi
47
1.0KB
32B active at inference time
256k
$0.6
44
NebiusDeepInfraFriendliAI
+12
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek logoDeepSeek
47
284B
13B active at inference time
1.00M
$0.1
100
SiliconFlowParasailNovita
+4
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek logoDeepSeek
46
284B
13B active at inference time
1.00M
$0.1
-
NovitaDeepInfraGMI
+4
Qwen3.6 27B (Reasoning)
Alibaba logoAlibaba
46
27.8B
262k
$0.9
61
SiliconFlowDeepInfraNovita
+2
Qwen3.5 397B A17B (Reasoning)
Alibaba logoAlibaba
45
397B
17B active at inference time
262k
$0.9
52
Together.aiParasailGMI
+9
GLM-5.1 (Non-reasoning)
Z AI logoZ AI
44
744B
40B active at inference time
200k
$0.9
80
NovitaSiliconFlowNebius
+5
Qwen3.6 35B A3B (Reasoning)
Alibaba logoAlibaba
43
36B
3B active at inference time
262k
$0.4
179
SiliconFlowClarifaiAlibaba Cloud
+6
Kimi K2.6 (Non-reasoning)
Kimi logoKimi
43
1.0KB
32B active at inference time
256k
$0.7
45
SiliconFlowDeepInfraNovita
+9
Step 3.7 Flash
StepFun logoStepFun
43
198B
11B active at inference time
256k
$0.2
186
StepFun
GLM-4.7 (Reasoning)
Z AI logoZ AI
42
357B
32B active at inference time
200k
$0.7
76
SiliconFlowParasailNovita
+7
Qwen3.5 27B (Reasoning)
Alibaba logoAlibaba
42
27.8B
262k
$0.5
83
DeepInfraCoreWeaveNovita
+3
MiniMax-M2.5
MiniMax logoMiniMax
42
230B
10B active at inference time
205k
$0.3
215
SiliconFlowClarifaiEigen AI
+13
Hy3-preview (Reasoning)
Tencent logoTencent
42
295B
21B active at inference time
256k
$0.1
93
SiliconFlowGMI
DeepSeek V3.2 (Reasoning)
DeepSeek logoDeepSeek
42
685B
37B active at inference time
128k
$0.2
-
SambaNovaDeepSeekParasail
+12
Qwen3.5 122B A10B (Reasoning)
Alibaba logoAlibaba
42
125B
10B active at inference time
262k
$0.7
137
DeepInfraNovitaSiliconFlow
+2
MiMo-V2-Flash (Feb 2026)
Xiaomi logoXiaomi
41
309B
15B active at inference time
256k
$0.1
143
Xiaomi
Kimi K2 Thinking
Kimi logoKimi
41
1.0KB
32B active at inference time
256k
$0.8
124
Microsoft AzureKimiGoogle
+3
GLM-5 (Non-reasoning)
Z AI logoZ AI
41
744B
40B active at inference time
200k
$0.7
71
FireworksDeepInfraSiliconFlow
+3
Qwen3.5 397B A17B (Non-reasoning)
Alibaba logoAlibaba
40
397B
17B active at inference time
262k
$0.9
53
Together.aiNebiusAlibaba Cloud
+6
MiniMax-M2.1
MiniMax logoMiniMax
39
230B
10B active at inference time
205k
$0.4
232
FriendliAINovitaMiniMax
DeepSeek V4 Pro (Non-reasoning)
DeepSeek logoDeepSeek
39
1.6KB
49B active at inference time
1.00M
$0.2
60
DeepSeekLightning AIMicrosoft Azure
+2
MiMo-V2-Flash (Reasoning)
Xiaomi logoXiaomi
39
309B
15B active at inference time
256k
$0.1
150
Xiaomi
Mistral Medium 3.5
Mistral logoMistral
39
128B
256k
$2.1
147
Mistral
Gemma 4 31B (Reasoning)
Google logoGoogle
39
30.7B
256k
-
35
GMIGoogleDeepInfra
+8
Ring-2.6-1T
InclusionAI logoInclusionAI
38
1.0KB
63B active at inference time
262k
$0.5
127
InclusionAI
Step 3.5 Flash
StepFun logoStepFun
38
196B
11B active at inference time
256k
$0.1
175
StepFunSiliconFlow
Kimi K2.5 (Non-reasoning)
Kimi logoKimi
37
1.0KB
32B active at inference time
256k
$0.8
46
FriendliAIMicrosoft AzureNebius
+6
Qwen3.5 27B (Non-reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.5
91
DeepInfraCoreWeaveAlibaba Cloud
Command A+
Cohere logoCohere
37
218B
25B active at inference time
192k
-
200
Cohere
Qwen3.6 27B (Non-reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.9
59
NovitaMakoraDeepInfraAlibaba Cloud
Qwen3.5 35B A3B (Reasoning)
Alibaba logoAlibaba
37
36B
3B active at inference time
262k
$0.4
154
GMIAlibaba CloudSiliconFlow
+2
DeepSeek V4 Flash (Non-reasoning)
DeepSeek logoDeepSeek
36
284B
13B active at inference time
1.00M
$0.1
97
CoreWeaveGMIMakoraDeepSeek
MiniMax-M2
MiniMax logoMiniMax
36
230B
10B active at inference time
205k
$0.4
113
Amazon BedrockNovitaMiniMaxGoogle
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA logoNVIDIA
36
120.6B
12.7B active at inference time
1.00M
$0.3
154
DeepInfraLightning AINebius
+2
Qwen3.5 122B A10B (Non-reasoning)
Alibaba logoAlibaba
36
125B
10B active at inference time
262k
$0.7
163
DeepInfraAlibaba Cloud
MiMo-V2.5-Pro (Non-reasoning)
Xiaomi logoXiaomi
36
1.0KB
41.7B active at inference time
1.00M
$0.6
56
NovitaXiaomiGMIDeepInfra
GLM-4.7 (Non-reasoning)
Z AI logoZ AI
34
357B
32B active at inference time
200k
$0.7
76
NovitaSiliconFlowDeepInfra
+6
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek logoDeepSeek
34
685B
37B active at inference time
128k
$1.7
-
SambaNovaNovita
Hy3-preview (Non-reasoning)
Tencent logoTencent
34
295B
21B active at inference time
256k
$0.1
93
SiliconFlowGMI
Ling-2.6-1T
InclusionAI logoInclusionAI
34
1.0KB
63B active at inference time
262k
$0.5
-
InclusionAI
gpt-oss-120b (high)
OpenAI logoOpenAI
33
117B
5.1B active at inference time
131k
$0.2
353
NovitaSnowflakeBaseten
+23
DeepSeek V3.2 Exp (Reasoning)
DeepSeek logoDeepSeek
33
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
GLM-4.6 (Reasoning)
Z AI logoZ AI
33
357B
32B active at inference time
200k
$0.7
50
DeepInfraTogether.aiNovita
Qwen3.5 9B (Reasoning)
Alibaba logoAlibaba
32
9.65B
262k
$0.1
66
Together.aiSiliconFlow
Gemma 4 31B (Non-reasoning)
Google logoGoogle
32
30.7B
256k
$0.2
44
FriendliAINovitaParasail
+4
K-EXAONE (Reasoning)
LG AI Research logoLG AI Research
32
236B
23B active at inference time
256k
-
-
-
DeepSeek V3.2 (Non-reasoning)
DeepSeek logoDeepSeek
32
685B
37B active at inference time
128k
$0.5
-
NebiusNebiusSambaNova
+12
Trinity Large Thinking
Arcee AI logoArcee AI
32
399B
13B active at inference time
512k
$0.2
211
ParasailArcee AI
Qwen3.6 35B A3B (Non-reasoning)
Alibaba logoAlibaba
32
36B
3B active at inference time
262k
$0.6
192
ClarifaiMakoraScaleway
+5
Gemma 4 26B A4B (Reasoning)
Google logoGoogle
31
25.2B
3.8B active at inference time
256k
$0.1
-
DeepInfraCloudflareParasail
+4
Kimi K2 0905
Kimi logoKimi
31
1.0KB
32B active at inference time
256k
$0.8
25
Novita
Qwen3.5 35B A3B (Non-reasoning)
Alibaba logoAlibaba
31
36B
3B active at inference time
262k
$0.4
184
DeepInfraAlibaba Cloud
MiMo-V2-Flash (Non-reasoning)
Xiaomi logoXiaomi
30
309B
15B active at inference time
256k
$0.1
149
Xiaomi
GLM-4.6 (Non-reasoning)
Z AI logoZ AI
30
357B
32B active at inference time
200k
$0.8
53
Together.aiNovita
EXAONE 4.5 33B
LG AI Research logoLG AI Research
30
34.4B
262k
-
-
-
GLM-4.7-Flash (Reasoning)
Z AI logoZ AI
30
31.2B
3B active at inference time
200k
$0.1
73
Amazon BedrockNovitaDeepInfra
Qwen3 235B A22B 2507 (Reasoning)
Alibaba logoAlibaba
30
235B
22B active at inference time
256k
$0.6
60
CoreWeaveNovitaEigen AI
+3
DeepSeek V3.2 Speciale
DeepSeek logoDeepSeek
29
685B
37B active at inference time
128k
-
-
-
Gemma 4 12B (Reasoning)
Google logoGoogle
29
12B
256k
-
-
-
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek logoDeepSeek
29
685B
37B active at inference time
128k
$0.3
-
SambaNovaDeepInfraNovita
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.2
-
DeepSeekNovita
Nemotron Cascade 2 30B A3B
NVIDIA logoNVIDIA
28
31.6B
3B active at inference time
1.00M
-
-
-
Apriel-v1.5-15B-Thinker
ServiceNow logoServiceNow
28
15B
128k
-
-
Together.ai
Qwen3 Coder Next
Alibaba logoAlibaba
28
79.7B
3B active at inference time
256k
$0.4
113
ParasailTogether.aiNovitaAmazon Bedrock
DeepSeek V3.1 (Non-reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.7
-
Lightning AINovitaFireworks
+7
Mistral Small 4 (Reasoning)
Mistral logoMistral
28
119B
6.5B active at inference time
256k
$0.2
174
Mistral
DeepSeek V3.1 (Reasoning)
DeepSeek logoDeepSeek
28
685B
37B active at inference time
128k
$0.7
-
Amazon BedrockSambaNovaNovitaGoogle
Qwen3 VL 235B A22B (Reasoning)
Alibaba logoAlibaba
28
235B
22B active at inference time
262k
$1.4
39
Alibaba CloudNovita
Apriel-v1.6-15B-Thinker
ServiceNow logoServiceNow
28
15B
128k
-
-
Together.ai
Qwen3.5 9B (Non-reasoning)
Alibaba logoAlibaba
27
9.65B
262k
-
-
-
Gemma 4 26B A4B (Non-reasoning)
Google logoGoogle
27
25.2B
3.8B active at inference time
256k
$0.2
60
ScalewayDeepInfraNovita
+4
Qwen3.5 4B (Reasoning)
Alibaba logoAlibaba
27
4.66B
262k
$0.0
200
DeepInfra
DeepSeek R1 0528 (May '25)
DeepSeek logoDeepSeek
27
685B
37B active at inference time
128k
$1.6
-
Microsoft AzureDeepInfraGoogle
+3
Qwen3 Next 80B A3B (Reasoning)
Alibaba logoAlibaba
27
80B
3B active at inference time
262k
$1.1
169
GMINovitaHyperbolic
+5
GLM-4.5 (Reasoning)
Z AI logoZ AI
26
355B
32B active at inference time
128k
$0.8
52
Novita
Kimi K2
Kimi logoKimi
26
1.0KB
32B active at inference time
128k
$0.6
24
KimiNovita
Ling 2.6 Flash
InclusionAI logoInclusionAI
26
107B
7.4B active at inference time
262k
$0.1
-
Novita
Seed-OSS-36B-Instruct
ByteDance Seed logoByteDance Seed
25
36.2B
512k
$0.2
36
SiliconFlow
Qwen3 235B A22B 2507 Instruct
Alibaba logoAlibaba
25
235B
22B active at inference time
256k
$0.3
53
NebiusAmazon BedrockFriendliAI
+9
Qwen3 Coder 480B A35B Instruct
Alibaba logoAlibaba
25
480B
35B active at inference time
262k
$0.5
59
GoogleAmazon BedrockAlibaba Cloud
+6
Qwen3 VL 32B (Reasoning)
Alibaba logoAlibaba
25
33.4B
256k
$1.5
90
Alibaba Cloud
gpt-oss-20B (high)
OpenAI logoOpenAI
24
21B
3.6B active at inference time
131k
$0.1
240
Together.aiCloudflareDatabricks
+10
gpt-oss-120b (low)
OpenAI logoOpenAI
24
117B
5.1B active at inference time
131k
$0.2
357
Amazon BedrockSambaNovaCerebras
+19
MiniMax M1 80k
MiniMax logoMiniMax
24
456B
45.9B active at inference time
1.00M
$0.7
-
Novita
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA logoNVIDIA
24
31.6B
3.6B active at inference time
1.00M
$0.1
149
DeepInfraNebius
K2 Think V2
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
24
70B
262k
-
-
-
LongCat Flash Lite
LongCat logoLongCat
24
68.5B
3B active at inference time
256k
-
-
LongCat
HyperCLOVA X SEED Think (32B)
Naver logoNaver
24
32B
128k
-
-
-
GLM-4.6V (Reasoning)
Z AI logoZ AI
23
108B
12B active at inference time
128k
$0.4
90
NovitaSiliconFlow
K-EXAONE (Non-reasoning)
LG AI Research logoLG AI Research
23
236B
23B active at inference time
256k
-
-
-
GLM-4.5-Air
Z AI logoZ AI
23
106B
12B active at inference time
128k
$0.3
80
Together.aiSiliconFlow
Mistral Large 3
Mistral logoMistral
23
675B
41B active at inference time
256k
$0.6
53
Microsoft AzureAmazon BedrockMistral
Ring-1T
InclusionAI logoInclusionAI
23
1.0KB
50B active at inference time
128k
-
-
-
Qwen3.5 4B (Non-reasoning)
Alibaba logoAlibaba
23
4.66B
262k
$0.0
208
DeepInfra
Qwen3 30B A3B 2507 (Reasoning)
Alibaba logoAlibaba
22
30.5B
3.3B active at inference time
262k
$0.4
131
Alibaba CloudClarifai
DeepSeek V3 0324
DeepSeek logoDeepSeek
22
671B
37B active at inference time
128k
$1.2
-
DeepInfraMicrosoft AzureReplicate
+3
INTELLECT-3
Prime Intellect logoPrime Intellect
22
107B
12B active at inference time
131k
-
-
-
GLM-4.7-Flash (Non-reasoning)
Z AI logoZ AI
22
31.2B
3B active at inference time
200k
$0.1
102
Amazon BedrockNovita
Devstral 2
Mistral logoMistral
22
125B
256k
-
61
Mistral
Solar Open 100B (Reasoning)
Upstage logoUpstage
22
102B
12B active at inference time
128k
-
-
-
Nemotron 3 Nano Omni 30B A3B Reasoning
NVIDIA logoNVIDIA
21
30B
3B active at inference time
256k
$0.1
296
ClarifaiNebius
MiniMax M1 40k
MiniMax logoMiniMax
21
456B
45.9B active at inference time
1.00M
-
-
-
gpt-oss-20B (low)
OpenAI logoOpenAI
21
21B
3.6B active at inference time
131k
$0.1
243
Amazon BedrockCompactifAINovita
+9
Qwen3 VL 235B A22B Instruct
Alibaba logoAlibaba
21
235B
22B active at inference time
262k
$0.5
51
Alibaba CloudEigen AINovita
+2
K2-V2 (high)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
21
70B
512k
-
-
-
Qwen3 Next 80B A3B Instruct
Alibaba logoAlibaba
20
80B
3B active at inference time
262k
$0.7
149
HyperbolicGMIAlibaba Cloud
+4
Tri-21B-think Preview
Trillion Labs logoTrillion Labs
20
21B
32.0k
-
-
-
Qwen3 Coder 30B A3B Instruct
Alibaba logoAlibaba
20
30.5B
3.3B active at inference time
262k
$0.3
98
ClarifaiAmazon BedrockScalewayAlibaba Cloud
Qwen3 235B A22B (Reasoning)
Alibaba logoAlibaba
20
235B
22B active at inference time
32.8k
$1.5
58
Alibaba Cloud
QwQ 32B
Alibaba logoAlibaba
20
32.8B
131k
$0.7
30
Cloudflare
Qwen3 VL 30B A3B (Reasoning)
Alibaba logoAlibaba
20
30B
3B active at inference time
256k
$0.3
111
Alibaba CloudNovitaFireworksEigen AI
Devstral Small 2
Mistral logoMistral
19
24B
256k
-
51
Mistral
Ling-1T
InclusionAI logoInclusionAI
19
1.0KB
50B active at inference time
128k
-
-
-
DeepSeek R1 (Jan '25)
DeepSeek logoDeepSeek
19
685B
37B active at inference time
128k
$2.0
-
NovitaAmazon BedrockNovita
+3
Gemma 4 E4B (Reasoning)
Google logoGoogle
19
8B
4.5B active at inference time
128k
-
-
-
K2-V2 (medium)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
19
70B
512k
-
-
-
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA logoNVIDIA
19
49B
128k
$0.1
48
DeepInfra
Mistral Small 4 (Non-reasoning)
Mistral logoMistral
19
119B
6.5B active at inference time
256k
$0.2
164
Mistral
Tri-21B-Think
Trillion Labs logoTrillion Labs
19
21B
32.0k
-
-
-
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research logoNous Research
19
406B
128k
$1.2
40
Nebius
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA logoNVIDIA
18
49B
128k
-
-
-
Llama 4 Maverick
Meta logoMeta
18
402B
17B active at inference time
1.00M
$0.3
111
DatabricksSambaNovaMicrosoft Azure
+6
Qwen3 4B 2507 (Reasoning)
Alibaba logoAlibaba
18
4.02B
262k
-
-
-
MiniCPM5-1B (Reasoning)
OpenBMB logoOpenBMB
18
1B
128k
-
-
-
Magistral Small 1.2
Mistral logoMistral
18
24B
128k
$0.6
109
Amazon BedrockMistral
Sarvam 105B (high)
Sarvam logoSarvam
18
106B
10.3B active at inference time
128k
$0.0
109
Sarvam
Devstral Small (May '25)
Mistral logoMistral
18
23.6B
256k
-
-
-
MiniCPM5-1B (Non-reasoning)
OpenBMB logoOpenBMB
18
1B
128k
-
-
-
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research logoNous Research
18
406B
128k
$1.2
41
Nebius
Llama 3.1 Instruct 405B
Meta logoMeta
17
405B
128k
$3.1
65
DatabricksAmazon BedrockAmazon BedrockMicrosoft Azure
Qwen3 VL 32B Instruct
Alibaba logoAlibaba
17
33.4B
256k
$0.9
72
Alibaba Cloud
DeepSeek R1 Distill Qwen 32B
DeepSeek logoDeepSeek
17
32B
128k
-
-
-
GLM-4.6V (Non-reasoning)
Z AI logoZ AI
17
108B
12B active at inference time
128k
$0.4
93
NovitaSiliconFlow
Qwen3 235B A22B (Non-reasoning)
Alibaba logoAlibaba
17
235B
22B active at inference time
32.8k
$0.6
57
NovitaAlibaba Cloud
Magistral Small 1
Mistral logoMistral
17
23.6B
40.0k
-
-
-
EXAONE 4.0 32B (Reasoning)
LG AI Research logoLG AI Research
17
32B
131k
-
-
-
Qwen3 VL 8B (Reasoning)
Alibaba logoAlibaba
17
8.77B
256k
$0.4
113
Alibaba Cloud
Qwen3 32B (Reasoning)
Alibaba logoAlibaba
17
32.8B
32.8k
$0.2
81
DeepInfraNebiusAlibaba Cloud
+3
DeepSeek V3 (Dec '24)
DeepSeek logoDeepSeek
16
671B
37B active at inference time
128k
$0.4
-
HyperbolicTogether.aiNovita
+2
DeepSeek R1 0528 Qwen3 8B
DeepSeek logoDeepSeek
16
8.19B
32.8k
-
-
-
Qwen3.5 2B (Reasoning)
Alibaba logoAlibaba
16
2.27B
262k
$0.0
-
DeepInfra
Qwen3 14B (Reasoning)
Alibaba logoAlibaba
16
14.8B
32.8k
$0.4
63
DeepInfraAlibaba Cloud
Nanbeige4.1-3B
Nanbeige logoNanbeige
16
3.93B
256k
-
-
-
Qwen3 VL 30B A3B Instruct
Alibaba logoAlibaba
16
30B
3B active at inference time
256k
$0.2
104
Eigen AIAlibaba CloudFireworksNovita
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research logoNous Research
16
70.6B
128k
$0.2
88
Nebius
Ministral 3 14B
Mistral logoMistral
16
14B
256k
$0.2
81
Amazon BedrockMistral
DeepSeek R1 Distill Llama 70B
DeepSeek logoDeepSeek
16
70B
128k
$0.7
41
DeepInfraScalewaySambaNova
DeepSeek R1 Distill Qwen 14B
DeepSeek logoDeepSeek
16
14B
128k
-
-
-
Falcon-H1R-7B
TII UAE logoTII UAE
16
7B
256k
-
-
-
Ling-flash-2.0
InclusionAI logoInclusionAI
16
103B
6.1B active at inference time
128k
$0.2
80
SiliconFlow
Qwen3 Omni 30B A3B (Reasoning)
Alibaba logoAlibaba
16
35.3B
3B active at inference time
65.5k
$0.3
86
Alibaba Cloud
Qwen2.5 Instruct 72B
Alibaba logoAlibaba
16
72B
131k
$0.2
-
SiliconFlowDeepInfraAlibaba Cloud
Step3 VL 10B
StepFun logoStepFun
15
10.2B
65.5k
-
-
-
Qwen3 30B A3B (Reasoning)
Alibaba logoAlibaba
15
30.5B
3.3B active at inference time
32.8k
$0.1
75
Alibaba CloudNovitaEigen AI
+2
Devstral Small (Jul '25)
Mistral logoMistral
15
24B
256k
$0.1
42
Mistral
Gemma 4 E2B (Reasoning)
Google logoGoogle
15
5.1B
2.3B active at inference time
128k
-
-
-
QwQ 32B-Preview
Alibaba logoAlibaba
15
32.8B
32.8k
-
-
-
GLM-4.5V (Reasoning)
Z AI logoZ AI
15
108B
12B active at inference time
64.0k
$0.7
26
Novita
Mistral Large 2 (Nov '24)
Mistral logoMistral
15
123B
128k
$2.4
52
Mistral
Mistral Small 3.2
Mistral logoMistral
15
24B
128k
$0.1
139
MistralDeepInfra
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA logoNVIDIA
15
253B
128k
$0.7
52
Nebius
Qwen3 30B A3B 2507 Instruct
Alibaba logoAlibaba
15
30.5B
3.3B active at inference time
262k
$0.2
132
ClarifaiNebiusAlibaba CloudCoreWeave
ERNIE 4.5 300B A47B
Baidu logoBaidu
15
300B
47B active at inference time
131k
$0.4
-
SiliconFlowNovita
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA logoNVIDIA
15
13.2B
128k
$0.2
286
DeepInfra
Ministral 3 8B
Mistral logoMistral
15
8B
256k
$0.1
106
MistralAmazon Bedrock
Gemma 4 E4B (Non-reasoning)
Google logoGoogle
15
8B
4.5B active at inference time
128k
-
-
-
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA logoNVIDIA
15
9B
131k
$0.1
122
DeepInfra
Granite 4.1 30B
IBM logoIBM
15
30B
131k
-
-
-
NVIDIA Nemotron 3 Nano 4B
NVIDIA logoNVIDIA
15
3.97B
262k
-
-
-
Qwen3.5 2B (Non-reasoning)
Alibaba logoAlibaba
15
2.27B
262k
$0.0
344
DeepInfra
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA logoNVIDIA
15
49B
128k
$0.1
49
DeepInfra
Qwen3 32B (Non-reasoning)
Alibaba logoAlibaba
15
32.8B
32.8k
$0.2
88
Amazon BedrockGroqAlibaba Cloud
+4
Llama 3.3 Instruct 70B
Meta logoMeta
14
70B
128k
$0.6
86
DeepInfraNebiusGoogle
+18
Mistral Small 3.1
Mistral logoMistral
14
24B
128k
$0.1
173
DeepInfraMistralCompactifAICloudflare
K2-V2 (low)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
14
70B
512k
-
-
-
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA logoNVIDIA
14
4.51B
128k
-
-
-
Kimi Linear 48B A3B Instruct
Kimi logoKimi
14
49.1B
3B active at inference time
1.00M
-
-
-
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA logoNVIDIA
14
49B
128k
-
-
-
Qwen3 VL 8B Instruct
Alibaba logoAlibaba
14
8.77B
256k
$0.2
123
Alibaba Cloud
Qwen3 4B (Reasoning)
Alibaba logoAlibaba
14
4.02B
32.0k
$0.2
-
Alibaba Cloud
Llama 3.1 Tulu3 405B
Allen Institute for AI logoAllen Institute for AI
14
405B
128k
-
-
-
Ring-flash-2.0
InclusionAI logoInclusionAI
14
103B
6.1B active at inference time
128k
$0.2
-
SiliconFlow
Pixtral Large
Mistral logoMistral
14
124B
128k
$2.4
52
Mistral
Olmo 3.1 32B Think
Allen Institute for AI logoAllen Institute for AI
14
32.2B
65.5k
-
-
Parasail
Grok 2 (Dec '24)
xAI logoxAI
14
270B
131k
-
-
-
Qwen3 VL 4B (Reasoning)
Alibaba logoAlibaba
14
4.44B
256k
-
-
-
Llama 4 Scout
Meta logoMeta
14
109B
17B active at inference time
10.0M
$0.2
101
GoogleAmazon BedrockCloudflare
+6
Command A
Cohere logoCohere
13
111B
256k
$3.3
65
CohereMicrosoft Azure
Llama 3.1 Nemotron Instruct 70B
NVIDIA logoNVIDIA
13
70B
128k
$1.2
300
DeepInfra
Qwen2.5 Instruct 32B
Alibaba logoAlibaba
13
32B
128k
-
-
-
Qwen3 8B (Reasoning)
Alibaba logoAlibaba
13
8.19B
131k
$0.2
40
Alibaba CloudEigen AI
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA logoNVIDIA
13
31.6B
3.6B active at inference time
1.00M
$0.1
93
DeepInfra
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA logoNVIDIA
13
9B
131k
$0.1
130
DeepInfraAmazon Bedrock
Mistral Large 2 (Jul '24)
Mistral logoMistral
13
123B
128k
$2.4
-
Amazon Bedrock
Qwen3 4B 2507 Instruct
Alibaba logoAlibaba
13
4.02B
262k
-
-
-
Qwen2.5 Coder Instruct 32B
Alibaba logoAlibaba
13
32B
131k
-
-
-
Qwen3 14B (Non-reasoning)
Alibaba logoAlibaba
13
14.8B
32.8k
$0.3
62
DeepInfraAlibaba Cloud
GLM-4.5V (Non-reasoning)
Z AI logoZ AI
13
108B
12B active at inference time
64.0k
$0.7
38
Novita
Mistral Small 3
Mistral logoMistral
13
24B
32.0k
$0.1
167
MistralDeepInfra
MiniCPM-V 4.6 1.3B
OpenBMB logoOpenBMB
13
1.3B
262k
-
-
-
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research logoNous Research
13
70.6B
128k
$0.2
88
Nebius
Qwen3 30B A3B (Non-reasoning)
Alibaba logoAlibaba
13
30.5B
3.3B active at inference time
32.8k
$0.1
76
DeepInfraEigen AIAlibaba Cloud
DeepSeek-V2.5 (Dec '24)
DeepSeek logoDeepSeek
13
236B
21B active at inference time
128k
-
-
-
Qwen3 4B (Non-reasoning)
Alibaba logoAlibaba
12
4.02B
32.0k
$0.1
-
Alibaba Cloud
Llama 3.1 Instruct 70B
Meta logoMeta
12
70B
128k
$0.6
34
Amazon BedrockAmazon BedrockDeepInfraDeepInfra
Granite 4.1 8B
IBM logoIBM
12
8B
131k
$0.1
122
CoreWeave
Sarvam 30B (high)
Sarvam logoSarvam
12
32.2B
2.4B active at inference time
65.5k
$0.0
166
Sarvam
DeepSeek-V2.5
DeepSeek logoDeepSeek
12
236B
21B active at inference time
128k
-
-
-
Olmo 3.1 32B Instruct
Allen Institute for AI logoAllen Institute for AI
12
32.2B
65.5k
-
-
-
DeepSeek R1 Distill Llama 8B
DeepSeek logoDeepSeek
12
8B
128k
-
-
-
Gemma 4 E2B (Non-reasoning)
Google logoGoogle
12
5.1B
2.3B active at inference time
128k
-
-
-
Olmo 3 32B Think
Allen Institute for AI logoAllen Institute for AI
12
32.2B
65.5k
-
-
-
R1 1776
Perplexity logoPerplexity
12
671B
37B active at inference time
128k
-
-
-
Llama 3.2 Instruct 90B (Vision)
Meta logoMeta
12
90B
128k
$1.4
59
Microsoft AzureAmazon Bedrock
Solar Mini
Upstage logoUpstage
12
10.7B
4.10k
$0.1
-
Upstage
Llama 3.1 Instruct 8B
Meta logoMeta
12
8B
128k
$0.1
155
Microsoft AzureCoreWeaveCerebras
+12
Grok-1
xAI logoxAI
12
314B
78B active at inference time
8.19k
-
-
-
Qwen2 Instruct 72B
Alibaba logoAlibaba
12
72B
131k
-
-
-
EXAONE 4.0 32B (Non-reasoning)
LG AI Research logoLG AI Research
12
32B
131k
-
-
-
Ministral 3 3B
Mistral logoMistral
11
3B
256k
$0.1
176
MistralAmazon Bedrock
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research logoNous Research
11
24B
32.0k
-
-
-
Jamba 1.7 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
60
AI21 Labs
Granite 4.0 H Small
IBM logoIBM
11
32B
9B active at inference time
128k
$0.1
357
Replicate
Jamba 1.5 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
-
Amazon Bedrock
Qwen3 Omni 30B A3B Instruct
Alibaba logoAlibaba
11
35.3B
3B active at inference time
65.5k
$0.3
94
Alibaba Cloud
Hermes 3 - Llama-3.1 70B
Nous Research logoNous Research
11
70.6B
128k
$0.3
29
DeepInfra
Qwen3 8B (Non-reasoning)
Alibaba logoAlibaba
11
8.19B
32.8k
$0.2
40
Alibaba CloudEigen AIFireworks
DeepSeek-Coder-V2
DeepSeek logoDeepSeek
11
236B
21B active at inference time
128k
-
-
-
OLMo 2 32B
Allen Institute for AI logoAllen Institute for AI
11
32.2B
4.10k
-
-
-
Jamba 1.6 Large
AI21 Labs logoAI21 Labs
11
398B
94B active at inference time
256k
$2.6
61
AI21 Labs
Qwen3.5 0.8B (Reasoning)
Alibaba logoAlibaba
11
0.873B
262k
$0.0
-
DeepInfra
LFM2 24B A2B
Liquid AI logoLiquid AI
10
23.8B
2.3B active at inference time
32.8k
$0.0
118
Together.ai
Phi-4
Microsoft logoMicrosoft
10
14B
16.0k
$0.2
33
Microsoft AzureDeepInfra
Gemma 3 27B Instruct
Google logoGoogle
10
27.4B
128k
$0.1
-
DeepInfraAmazon BedrockNebius
+3
Mistral Small (Sep '24)
Mistral logoMistral
10
22B
32.8k
$0.2
172
Mistral
Phi-3 Mini Instruct 3.8B
Microsoft logoMicrosoft
10
3.8B
4.10k
-
-
-
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA logoNVIDIA
10
13.2B
128k
$0.2
221
DeepInfraAmazon Bedrock
Gemma 3n E4B Instruct Preview (May '25)
Google logoGoogle
10
8.39B
4B active at inference time
32.0k
-
-
-
Phi-4 Multimodal Instruct
Microsoft logoMicrosoft
10
5.6B
128k
-
17
Microsoft Azure
Qwen2.5 Coder Instruct 7B
Alibaba logoAlibaba
10
7.62B
131k
-
-
-
Qwen3.5 0.8B (Non-reasoning)
Alibaba logoAlibaba
10
0.873B
262k
$0.0
91
DeepInfra
Mixtral 8x22B Instruct
Mistral logoMistral
10
141B
39B active at inference time
65.4k
-
-
-
Llama 2 Chat 7B
Meta logoMeta
10
7B
4.10k
$0.1
-
Replicate
Llama 3.2 Instruct 3B
Meta logoMeta
10
3B
128k
$0.1
51
Amazon Bedrock
Jamba Reasoning 3B
AI21 Labs logoAI21 Labs
10
3B
262k
-
-
-
Qwen3 VL 4B Instruct
Alibaba logoAlibaba
10
4.44B
256k
-
-
-
Qwen1.5 Chat 110B
Alibaba logoAlibaba
10
110B
32.0k
-
-
-
Reka Flash 3
Reka AI logoReka AI
10
21B
128k
$0.3
-
Reka AI
Olmo 3 7B Think
Allen Institute for AI logoAllen Institute for AI
9
7B
65.5k
-
-
-
OLMo 2 7B
Allen Institute for AI logoAllen Institute for AI
9
7.3B
4.10k
-
-
-
Molmo 7B-D
Allen Institute for AI logoAllen Institute for AI
9
8.02B
4.10k
-
-
-
Ling-mini-2.0
InclusionAI logoInclusionAI
9
16.3B
1.4B active at inference time
131k
-
-
-
DeepSeek R1 Distill Qwen 1.5B
DeepSeek logoDeepSeek
9
1.5B
128k
-
-
-
DeepSeek-V2-Chat
DeepSeek logoDeepSeek
9
236B
21B active at inference time
128k
-
-
-
Llama 3 Instruct 70B
Meta logoMeta
9
70B
8.19k
$0.9
-
Amazon BedrockNovitaReplicate
Arctic Instruct
Snowflake logoSnowflake
9
480B
17B active at inference time
4.00k
-
-
-
Qwen Chat 72B
Alibaba logoAlibaba
9
72B
33.8k
-
-
-
Gemma 3 12B Instruct
Google logoGoogle
9
12.2B
128k
$0.1
-
DeepInfraGoogleAmazon Bedrock
+2
Llama 3.2 Instruct 11B (Vision)
Meta logoMeta
9
11B
128k
$0.2
51
DeepInfraAmazon BedrockMicrosoft Azure
Granite 4.1 3B
IBM logoIBM
9
3B
131k
-
-
-
DeepSeek Coder V2 Lite Instruct
DeepSeek logoDeepSeek
8
16B
2.4B active at inference time
128k
-
-
-
Sarvam M (Reasoning)
Sarvam logoSarvam
8
23.6B
32.8k
-
-
Sarvam
Phi-4 Mini Instruct
Microsoft logoMicrosoft
8
3.84B
128k
-
22
CoreWeaveMicrosoft Azure
Llama 2 Chat 70B
Meta logoMeta
8
70B
4.10k
-
-
-
DeepSeek LLM 67B Chat (V1)
DeepSeek logoDeepSeek
8
7B
4.10k
-
-
-
Llama 2 Chat 13B
Meta logoMeta
8
13B
4.10k
-
-
-
Command-R+ (Apr '24)
Cohere logoCohere
8
104B
128k
$4.2
-
Amazon Bedrock
OpenChat 3.5 (1210)
OpenChat logoOpenChat
8
7B
8.19k
-
-
-
DBRX Instruct
Databricks logoDatabricks
8
132B
36B active at inference time
32.8k
-
-
-
Exaone 4.0 1.2B (Reasoning)
LG AI Research logoLG AI Research
8
1.28B
64.0k
-
-
-
Olmo 3 7B Instruct
Allen Institute for AI logoAllen Institute for AI
8
7B
65.5k
$0.1
-
Parasail
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research logoLG AI Research
8
1.28B
64.0k
-
-
-
LFM2.5-1.2B-Thinking
Liquid AI logoLiquid AI
8
1.17B
32.0k
-
-
-
Jamba 1.7 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
258k
-
-
-
LFM2 2.6B
Liquid AI logoLiquid AI
8
2.57B
32.8k
-
-
?
LFM2.5-1.2B-Instruct
Liquid AI logoLiquid AI
8
1.17B
32.0k
-
-
?
Jamba 1.5 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
256k
$0.2
-
Amazon Bedrock
Granite 4.0 H 1B
IBM logoIBM
8
1.5B
128k
-
-
-
Qwen3 1.7B (Reasoning)
Alibaba logoAlibaba
8
2.03B
32.0k
$0.2
-
Alibaba Cloud
Jamba 1.6 Mini
AI21 Labs logoAI21 Labs
8
52B
12B active at inference time
256k
$0.2
184
AI21 Labs
Mixtral 8x7B Instruct
Mistral logoMistral
8
46.7B
12.9B active at inference time
32.8k
$0.5
-
Amazon Bedrock
Gemma 3 270M
Google logoGoogle
8
0.268B
32.0k
-
-
-
Apertus 70B Instruct
Swiss AI Initiative logoSwiss AI Initiative
8
70B
65.5k
$1.0
-
Public AI
Granite 4.0 Micro
IBM logoIBM
8
3B
128k
-
-
-
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research logoNous Research
8
8B
128k
-
-
-
Llama 65B
Meta logoMeta
7
65B
2.05k
-
-
-
Qwen Chat 14B
Alibaba logoAlibaba
7
14B
8.19k
-
-
-
Mistral 7B Instruct
Mistral logoMistral
7
7B
8.19k
$0.2
98
Amazon BedrockMistral
Command-R (Mar '24)
Cohere logoCohere
7
35B
128k
$0.6
-
Amazon Bedrock
Granite 4.0 1B
IBM logoIBM
7
1.6B
128k
-
-
-
Molmo2-8B
Allen Institute for AI logoAllen Institute for AI
7
8.66B
36.9k
-
-
-
LFM2 8B A1B
Liquid AI logoLiquid AI
7
8.34B
1.5B active at inference time
32.8k
-
-
?
Granite 3.3 8B (Non-reasoning)
IBM logoIBM
7
8.17B
128k
$0.1
408
Replicate
Qwen3 1.7B (Non-reasoning)
Alibaba logoAlibaba
7
2.03B
32.0k
$0.1
-
Alibaba Cloud
Qwen3 0.6B (Reasoning)
Alibaba logoAlibaba
6
0.752B
32.0k
$0.2
-
Alibaba Cloud
Llama 3 Instruct 8B
Meta logoMeta
6
8B
8.19k
$0.1
-
NovitaDeepInfraAmazon BedrockReplicate
Gemma 3n E4B Instruct
Google logoGoogle
6
8.39B
4B active at inference time
32.0k
$0.0
53
Together.ai
LFM2 1.2B
Liquid AI logoLiquid AI
6
1.17B
32.8k
-
-
?
Gemma 3 4B Instruct
Google logoGoogle
6
4.3B
128k
$0.0
-
GoogleAmazon BedrockDeepInfra
Llama 3.2 Instruct 1B
Meta logoMeta
6
1B
128k
$0.1
84
Amazon BedrockNovita
LFM2.5-VL-1.6B
Liquid AI logoLiquid AI
6
1.6B
32.0k
-
-
?
Granite 4.0 350M
IBM logoIBM
6
0.35B
32.8k
-
-
-
Apertus 8B Instruct
Swiss AI Initiative logoSwiss AI Initiative
6
8B
65.5k
$0.1
-
Public AI
Qwen3 0.6B (Non-reasoning)
Alibaba logoAlibaba
6
0.752B
32.0k
$0.1
-
Alibaba Cloud
Gemma 3 1B Instruct
Google logoGoogle
6
1B
32.0k
-
-
Google
Granite 4.0 H 350M
IBM logoIBM
5
0.34B
32.8k
-
-
-
Gemma 3n E2B Instruct
Google logoGoogle
5
5.98B
2B active at inference time
32.0k
-
-
Google
Tiny Aya Global
Cohere logoCohere
5
3.35B
8.19k
-
-
Cohere
EXAONE 4.5 33B (Non-reasoning)
LG AI Research logoLG AI Research
-
34.4B
262k
-
-
-
Cogito v2.1 (Reasoning)
Deep Cogito logoDeep Cogito
-
671B
37B active at inference time
128k
$1.3
69
Together.ai