SiliconCloud 模型食用指南

Reno · 2024 年6 月 15 日 01:53

简介

SiliconCloud 基于优秀的开源基础模型，提供高性价比的 GenAI 服务。

不同于多数大模型云服务平台只提供自家大模型 API，SiliconCloud 上架了包括 Qwen、DeepSeek、GLM、Yi、Mistral、LLaMA 3、SDXL、InstantID 在内的多种开源大语言模型及图片生成模型，用户可自由切换适合不同应用场景的模型。

更重要的是，SiliconCloud 提供开箱即用的大模型推理加速服务，为您的 GenAI 应用带来更高效的用户体验。

注意：测试期间（6月18日前）赠送42元额度

2024-6-25更新新用户赠送14元额度，小模型免费

免费模型

简单上手

当前公测版本，RPM 限制为 100 ，RPS 为3

注册账号

访问 注册页 注册账号

获取密钥

访问 密钥页 点击 创建新API密钥 获取密钥

oneapi配置

对话模型

base_url：https://api.siliconflow.cn
key：你的密钥
model：deepseek-v2-chat, deepseek-Ilm-67b-chat, Qwen2-72B-Instruct, Qwen2-7B-Instruct, Qwen2-57B-A14B-Instruct, Qwen1.5-110B-Chat, Qwen1.5-32B-Chat, Qwen1.5-14B-Chat, Qwen1.5-7B-Chat, Yi-1.5-34B-Chat, Yi-1.5-9B-Chat, Yi-1.5-6B-Chat, chatglm3-6B, glm4-9B-chat, Mixtral-8x7B-Instruct-v0.1, Mistral-7B-Instruct-v0.2, gemma-7b-it, gemma-2b-it
模型重定向：

{
  "deepseek-v2-chat": "deepseek-ai/DeepSeek-V2-Chat",
  "deepseek-Ilm-67b-chat": "deepseek-ai/deepseek-Ilm-67b-chat",
  "deepseek-coder-v2-instruct": "deepseek-ai/DeepSeek-Coder-V2-Instruct",
  "Qwen2-72B-Instruct": "Qwen/Qwen2-72B-Instruct",
  "Qwen2-7B-Instruct": "Qwen/Qwen2-7B-Instruct",
  "Qwen2-57B-A14B-Instruct": "Qwen/Qwen2-57B-A14B-Instruct",
  "Qwen1.5-110B-Chat": "Qwen/Qwen1.5-110B-Chat",
  "Qwen1.5-32B-Chat": "Qwen/Qwen1.5-32B-Chat",
  "Qwen1.5-14B-Chat": "Qwen/Qwen1.5-14B-Chat",
  "Qwen1.5-7B-Chat": "Qwen/Qwen1.5-7B-Chat",
  "Yi-1.5-34B-Chat": "01-ai/Yi-1.5-34B-Chat-16K",
  "Yi-1.5-9B-Chat": "01-ai/Yi-1.5-9B-Chat-16K",
  "Yi-1.5-6B-Chat": "01-ai/Yi-1.5-6B-Chat",
  "chatglm3-6B": "THUDM/chatglm3-6b",
  "glm4-9B-chat": "THUDM/glm-4-9b-chat",
  "Mixtral-8x7B-Instruct-v0.1": "mixtralai/Mixtral-8x7B-Instruct-v0.1",
  "Mistral-7B-Instruct-v0.2": "mixtralai/Mistral-7B-Instruct-v0.2",
  "gemma-7b-it": "meta-llama/Meta-Llama-3-8B-Instruct",
  "gemma-2b-it": "meta-llama/Meta-Llama-3-8B-Instruct"
}

文生图模型

base_url：你的worker地址
key：你的密钥
model： stable-diffusion-3-medium, stable-diffusion-xl-base-1.0, stable-diffusion-2-1, sd-turbo, sdxl-turbo, SDXL-Lightning
模型重定向

{
   "stable-diffusion-3-medium":"stabilityai/stable-diffusion-3-medium",
   "stable-diffusion-xl-base-1.0":"stabilityai/stable-diffusion-xl-base-1.0",
   "stable-diffusion-2-1":"stabilityai/stable-diffusion-2-1",
   "sd-turbo":"stabilityai/sd-turbo",
   "sdxl-turbo":"stabilityai/sdxl-turbo",
   "SDXL-Lightning":"ByteDance/SDXL-Lightning"
}

worker.js

addEventListener('fetch', event => {
  event.respondWith(handleRequest(event.request))
})

async function handleRequest(request) {
  try {
    const body = await request.json();
    const { model, messages } = body;

    if (!model || !messages || messages.length === 0) {
      return new Response(
        `Bad Request: Missing required fields`,
        { status: 400, headers: { 'Content-Type': 'application/json' } }
      );
    }

    const prompt = messages[messages.length - 1].content;
    const newUrl = `https://api.siliconflow.cn/v1/${model}/text-to-image`;

    const newRequestBody = {
      prompt: prompt,
      image_size: "1024x1024",
      batch_size: 1,
      num_inference_steps: 4,
      guidance_scale: 1
    };

    const modifiedRequest = new Request(newUrl, {
      method: 'POST',
      headers: {
        'accept': 'application/json',
        'content-type': 'application/json',
        'Authorization': request.headers.get('Authorization')
      },
      body: JSON.stringify(newRequestBody)
    });

    const response = await fetch(modifiedRequest);
    const responseBody = await response.json();

    const imageUrl = responseBody.images[0].url;
    const uniqueId = Date.now();
    const currentTimestamp = Math.floor(uniqueId / 1000);

    const responsePayload = {
      id: uniqueId,
      object: "chat.completion.chunk",
      created: currentTimestamp,
      model: model,
      choices: [
        {
          index: 0,
          delta: {
            content: `![](${imageUrl})`
          },
          finish_reason: "stop"
        }
      ]
    };

    const dataString = JSON.stringify(responsePayload);

    return new Response(`data: ${dataString}\n\n`, {
      status: 200,
      headers: {
        "Content-Type": "text/event-stream",
        'Access-Control-Allow-Origin': '*',
        "Access-Control-Allow-Headers": '*'
      }
    });
  } catch (error) {
    return new Response("Internal Server Error: " + error.message, {
      status: 500,
      headers: {
        "Content-Type": "application/json",
        'Access-Control-Allow-Origin': '*',
        "Access-Control-Allow-Headers": '*'
      }
    });
  }
}

参考链接

官方文档

binaryYuki · 2024 年6 月 15 日 01:54

mark~

delph1s · 2024 年6 月 15 日 01:59

R佬太强啦

bbb · 2024 年6 月 15 日 02:01

喂到嘴里了～

kingtest · 2024 年6 月 15 日 02:52

nextchat可以用，接到newapi就不行，提示请求上游地址失败

suixin6 · 2024 年6 月 15 日 03:22

不用实名，开箱即用？这我不得薅一下？ 618截止？

Reno · 2024 年6 月 15 日 03:24

速！

bbb · 2024 年6 月 15 日 03:25

这个有时候能用有时候用不了，具体原因不详

Leon01 · 2024 年6 月 15 日 03:43

siliconflow的deepseek比它们自己官方的API吐字速度还要快…

OICQ · 2024 年6 月 15 日 03:55

这里的DeepSeek-V2比glm-4-9b-chat贵

kkk2 · 2024 年6 月 15 日 03:56

确实，我拿了3个账号了，不过似乎对话超过10秒就断开

handsome · 2024 年6 月 15 日 03:57

reno佬太强了！

Reno · 2024 年6 月 15 日 04:01

等等，很刑啊

lueluelue · 2024 年6 月 15 日 04:16

太感谢了

OICQ · 2024 年6 月 15 日 04:20

这个是SiliconLLM吧

Meatbuns · 2024 年6 月 15 日 04:21

mark啊

dannyng · 2024 年6 月 15 日 04:22

有用谢谢佬哈哈哈

yizhgood001 · 2024 年6 月 15 日 04:22

强啊，感谢

ljoker · 2024 年6 月 15 日 04:49

我这边邮箱注册基本接不到码，鸭鸭薅到了吗？

suixin6 · 2024 年6 月 15 日 04:57

没弄

话题		回复	浏览量
又来一个可以白嫖国内外大模型的站点，还可以用Stable Diffusion 资源荟萃 ChatGPT , 人工智能	141	5006	2024 年8 月 29 日
Cloudflare免费模型食用指南资源荟萃人工智能	68	5083	2024 年12 月 8 日
沉浸式翻译提供免费的大模型了资源荟萃人工智能	62	2271	2024 年12 月 9 日
Nvidia免费模型食用指南资源荟萃人工智能	68	3202	2024 年12 月 9 日
在cloudflare部署hugging face的免费api，可对接oneapi/newapi，免费使用Qwen2.5 72B等模型资源荟萃 ChatGPT , OpenAI , 人工智能	64	6697	2025 年5 月 7 日