Access frontier AI models, open-source inference, image generation, and video models through Relay.
Change base_url. Nothing else. Every OpenAI-compatible SDK works out of the box.
Multi-provider routing. One provider goes down, requests reroute automatically. Your code never changes.
Pay per token. No monthly minimums, no seat fees. Top up once and every token bills at our discounted rate.
Claude, GPT-5, DeepSeek, Llama, Qwen, plus image, video, and audio. One API key, zero juggling.
Enterprise security from day one. EU Sovereign tier adds GDPR, zero retention, and no CLOUD Act exposure.
Every price at /v2/pricing. Build cost dashboards, alerts, and budget controls into your infra.
import axios from 'axios'const { data } = await axios.post('https://relaygpu.com/v2/ollama/api/chat', {model: 'gpt-oss:20b',messages: [{role: 'user',content: 'Break down the pros and cons of decentralized GPU compute.'}],stream: false,think: 'low'},{ headers: { 'X-API-Key': process.env.RELAY_API_KEY } })console.info(data.message.content)
See how the same workload compares across direct APIs, aggregators, and Relay
Every top-up returns base credits plus a bonus on top. The native token crowns your stack with twice the bonus coins.
FLUX, Kling, Wan, Sora, Whisper. The same frontier generative models, routed through RelayGPU at lower cost.
Explore all modelsImage To Video
imageText To Image
Text To Video
imageText To Image
Text To Video
imageText To Image
imageText To Image
Text To Video