Available Models
Pay with USDC on Base. No account needed.
43 models
GPT-5.4
openaiMost capable and efficient frontier model with 1M context, native computer use, and thinking mode
GPT-5.4 Pro
openaiPremium GPT-5.4 with maximum compute for the hardest problems
GPT-5.3
openaiHigh intelligence with medium speed. Multimodal with vision, function calling, and structured outputs
GPT-5.2
openaiFrontier model with 400K context and adaptive reasoning
GPT-5.4 Mini
openaiStrongest mini model for coding, computer use, and subagents with GPT-5.4 capabilities
GPT-5 Mini
openaiCost-optimized reasoning and chat
GPT-5.4 Nano
openaiFastest and most affordable GPT-5.4 model for high-throughput tasks
GPT-5.2 Pro
openaiUses more compute for consistently better answers
GPT-5.3 Codex
openaiIndustry-leading agentic coding model. 400K context, reasoning, tool use, and complex execution
o1
openaiAdvanced reasoning model for complex tasks
o1-mini
openaiFast reasoning model optimized for STEM
o3
openaiLatest reasoning model with improved performance
o3-mini
openaiEfficient reasoning model for STEM tasks
GPT-OSS 20B
openaiTestnetOpen-weight 20B model (Apache 2.0), similar performance to o3-mini. Available on testnet for developer testing.
GPT-OSS 120B
openaiTestnetOpen-weight 120B model (Apache 2.0), flagship open model. Available on testnet for developer testing.
Claude Haiku 4.5
anthropicFastest and most efficient Claude, near-frontier intelligence
Claude Sonnet 4.6
anthropicBest balance of intelligence, speed, and cost
Claude Opus 4.5
anthropicLatest Anthropic flagship with enhanced reasoning and creativity
Claude Opus 4.6
anthropicLatest flagship Claude with extended 64k output, vision, and advanced reasoning
Gemini 3.1 Pro
googleLatest Gemini with improved thinking, token efficiency, and agentic capabilities. Optimized for software engineering (requires new SDK)
Gemini 3 Pro Preview
googleFlagship frontier model for high-precision multimodal reasoning
Gemini 3 Flash Preview
googleFrontier-class performance with Pro-level intelligence at Flash speed and pricing. Includes thinking mode (requires new SDK)
Gemini 2.5 Pro
googleState-of-the-art for reasoning, coding, and mathematics
Gemini 2.5 Flash
googleFast and efficient Gemini model with vision support
Gemini 3.1 Flash Lite
googleUltra-fast and lightweight Gemini 3.1 model with thinking mode for high-throughput tasks
Gemini 2.5 Flash Lite
googleMost economical Gemini model - ultra-fast and lightweight (requires new SDK)
DeepSeek V3.2 Chat
deepseekDeepSeek V3.2 non-thinking mode, excellent for chat and coding
DeepSeek V3.2 Reasoner
deepseekDeepSeek V3.2 thinking mode for complex reasoning tasks
GLM-5
zaiZ.AI's flagship foundation model with 200K context. Strong reasoning and agentic capabilities
GLM-5 Turbo
zaiOptimized GLM-5 variant with faster inference
MiniMax M2.7
minimaxMiniMax's flagship reasoning model with recursive self-improvement. Great value for complex tasks (~60 tps)
GPT-OSS 120B (Free)
nvidiaFreeOpenAI's open-weight 120B model hosted free by NVIDIA. Apache 2.0 license, great for experimentation
GPT-OSS 20B (Free)
nvidiaFreeOpenAI's open-weight 20B model hosted free by NVIDIA. Fast and efficient for simpler tasks
Kimi K2.5
nvidiaMoonshot's flagship MoE model (1T params) hosted by NVIDIA. Vision and agentic capabilities
Nemotron Ultra 253B (Free)
nvidiaFreeNVIDIA's flagship 253B reasoning model. Strong on math, coding, and instruction following
Nemotron 3 Super 120B (Free)
nvidiaFreeNVIDIA MoE model (12B active params) with thinking mode. Fast and capable reasoning
Nemotron Super 49B (Free)
nvidiaFreeNVIDIA Nemotron 49B with thinking mode. Good balance of speed and reasoning quality
DeepSeek V3.2 (Free)
nvidiaFreeDeepSeek's latest V3.2 MoE model hosted free by NVIDIA. Same quality, zero cost
Mistral Large 3 675B (Free)
nvidiaFreeMistral's flagship 675B model hosted free by NVIDIA. Largest Mistral model ever released
Qwen3 Coder 480B (Free)
nvidiaFreeQwen's 480B MoE coding model (35B active) hosted by NVIDIA. Optimized for code generation
Devstral 2 123B (Free)
nvidiaFreeMistral's 123B coding-focused model hosted free by NVIDIA. Strong code and instruction following
GLM-4.7 (Free)
nvidiaFreeZhipu AI's GLM-4.7 with thinking mode hosted by NVIDIA. Unique Chinese AI lab model
Llama 4 Maverick (Free)
nvidiaFreeMeta's Llama 4 Maverick MoE (17B x 128 experts) hosted free by NVIDIA