【开源】智能，安全的 KEY 轮询网关 (减少被 ban & 429 的概率，快速部署到 Cloudflare)

ajd · 2025 年7 月 12 日 07:12

背景

NotebookLM 播客太惊艳了，一直想集成到 zenfeed，来一波 RSS+播客 (已实现【RSS + 播客】基于 L 站热帖生成播客)
但所需的 TTS 价格居高不下，当时还发帖问了波有无便宜的大模型语音合成 API ，遂搁置
天无绝人之路，大善人推出 gemini-flash-tts，音质还行，主要可以嫖免费额度 Gemini TTS 的速率限制是多少
但问题又来了，一直用的 Gemini Balance 居然只支持单人语音（貌似是 OpenAI 格式，非 gemini 多人格式），结合自己另外的一些小需求，干脆自己撸了一个…（ps 如果这里以及下面提到的差异不是你的痛点，更推荐坛里的 gb）

介绍

!!!降低封禁风险!!!: 通过 Cloudflare AI Gateway 路由请求，有效降低 API 密钥（尤其是 Gemini）被封禁的概率。大家用 cf ai gateway 能减少 gemini key G 掉的概率吗
!!!智能的错误处理（同时你只管加 KEY，不需要担心存量 KEY 的死活）!!!:
- 模型级限流: 精准识别并暂时屏蔽达到速率限制的特定模型。特别地，针对 Google AI Studio，能智能区分分钟级和天级配额，进行差异化冷却（例如，触发天级配额后冷却 24 小时）。冷却后自动换 key 重试
- 自动熔断: 永久禁用被提供商封禁（403 错误）的密钥，减少无效重试。
广泛的兼容性: 支持 Cloudflare AI Gateway 兼容的所有 API 提供商，不止轮询 Gemini，轮询 OpenAI 也是支持的

部署

CF + 部署脚本，就很快

0. 检查环境

安装 Node.js 和 pnpm。
有一个 Cloudflare 账户。

1. 创建 AI Gateway

创建一个新的 AI Gateway，并将其命名为 one-balance

2. 一键部署！

git clone https://github.com/glidea/one-balance.git
cd one-balance
pnpm install

# Mac/Linux
AUTH_KEY=your-super-secret-auth-key pnpm run deploycf

# Windows (PowerShell)
$env:AUTH_KEY = "your-super-secret-auth-key"; pnpm run deploycf

脚本将引导你登录 wrangler (如果尚未登录)，自动创建所需的 D1 数据库(AI Gateway 不支持通过wrangler创建所以得手动)，并部署 Worker。部署成功后，会得到一个 Worker 的 URL，例如 https://one-balance.<your-subdomain>.workers.dev

使用

1. 配置待轮询 KEYS

访问 https://<your-worker-url>

最佳实践：尽量避免和他人共享 Key，这样系统无法感知全局的调用信息，可能会增加 429 概率

2. 访问 API

curl "https://<your-worker-url>/api/compat/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-super-secret-auth-key" \
  -d '{
    "model": "google-ai-studio/gemini-2.5-pro",
    "messages": [
      {
        "role": "user",
        "content": "Hello!"
      }
    ]
  }'

注意model需要加provider前缀，详细参见：

以上方式在流式回复时，存在中文乱码的问题（AI Gateway 侧），通过调用具体 Provider 接口绕过，如 google-ai-studio

curl "https://<your-worker-url>/api/google-ai-studio/v1/models/gemini-2.5-flash:streamGenerateContent?alt=sse" \
 -H 'content-type: application/json' \
 -H 'x-goog-api-key: your-super-secret-auth-key' \
 -d '{
      "contents": [
          {
            "role":"user",
            "parts": [
              {"text":"你是谁?"}
            ]
          }
        ]
      }'

其它 provider 参考：Provider Native · Cloudflare AI Gateway docs

Cherry Studio

Gemini 格式

CleanShot 2025-07-21 at 16.21.06504×458 21.5 KB

koast18 · 2025 年7 月 12 日 07:21

应该支持的吧虽然我没用过

ajd · 2025 年7 月 12 日 07:27

这个貌似是 openai 格式的，不是可以支持多人语音的 gemini 格式

佬严谨我改下

koast18 · 2025 年7 月 12 日 07:29

高端没用过tts这些

ajd · 2025 年7 月 12 日 07:31

但应该听过 NotebookLM 的双人播客应该就是用的 gemini-2.5pro-tts

lop · 2025 年7 月 12 日 07:39

谷歌的tts怎么用呀

handsome · 2025 年7 月 12 日 07:43

太强了，大佬

ccxkai · 2025 年7 月 12 日 08:54

太强啦佬

Andyy · 2025 年7 月 12 日 08:58

@ajd 佬搭好了，也放了几个可用的key进去但是服务访问不通，500错误

ufoo · 2025 年7 月 12 日 09:04

大佬，请问这个项目是纯部署在cloudflare的吗还是要有自己的vps

inlook · 2025 年7 月 12 日 09:24

感谢分享

ajd · 2025 年7 月 12 日 09:26

单纯curl呢，上面有例子。另外可以登陆cf看worker日志

ajd · 2025 年7 月 12 日 09:27

纯cf，嘎嘎方便

ufoo · 2025 年7 月 12 日 09:41

感谢大佬，star+fork已操作。并期待后续继续努力优化

Andyy · 2025 年7 月 12 日 09:55

@ajd 真有bug，日志里看到了请求/api/compact，竟然去用了openai的key，然后去站点里看gemini的key全无了
重试了确实会自动删除gemini的key…

ajd · 2025 年7 月 12 日 10:51

Model加前缀了没，根据前缀确定provider的，可以看cf ai gateway文档

ajd · 2025 年7 月 12 日 10:51

返回403就会被ban

passerby064857 · 2025 年7 月 12 日 11:50

我也遇到了，我后来发现URL调用的后缀是/api/compat/chat/completions

在cherry studio里要填：/api/compat/chat/completions#

Andyy · 2025 年7 月 12 日 12:39

好人呐，地址没错，cherry srudio里可以再简化下/api/compat/就行了

但是调用成功一次后所有gemini key会被立即block，你遇到过吗

ajd · 2025 年7 月 12 日 13:29

感谢佬反馈，文档又写错了已修正

话题		回复	浏览量
又一款实现 Claude Code 自由的神器（Cloudflare Worker 云端转换，本地直接用)【附 Gemini 版部署教程】开发调优人工智能	63	2492	2025 年9 月 2 日
[别用简单密码] [全功能+Cloudflare Workers] 最简方法免费部署GCP Claude3.5 Sonnet Vertex无损转Anthropic官方版本API + 全套教程 + 可用NextChat、酒馆等资源荟萃人工智能	369	19978	2024 年12 月 15 日
[OneBalance] 基于 Cloudflare Worker 的轮询项目更新日志开发调优人工智能 , 纯水	28	436	2025 年8 月 10 日
【自荐】也许是最好用的 Gemini 客户端，一键部署、内置代理，麻雀虽小，五脏俱全开发调优人工智能	277	7553	2025 年4 月 12 日
api Proxy 支持Cloudflare Workers 一键部署【开源】资源荟萃 Proxy , 人工智能	71	1522	2025 年6 月 6 日