Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview

dragonfsky · 2025 年3 月 1 日 04:21

deepseek刚刚发布了
Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: Cross-node EP-powered batch scaling Computation-communication overlap Load balancing Statistics of DeepSeek’s Online Service: 73.7k/14.8k input/output tokens per second per H800 node Cost profit margin 545% We hope this week’s insights offer value to the community and contribute to our shared AGI goals. Deep Dive: bit.ly/4ihZUiO

kanna · 2025 年3 月 1 日 04:28

Over the past 24 hours (UTC+8 02/27/2025 12:00 PM to 02/28/2025 12:00 PM), the combined peak node occupancy for V3 and R1 inference services reached 278, with an average occupancy of 226.75 nodes (each node contains 8 H800 GPUs). Assuming the leasing cost of one H800 GPU is $2 per hour, the total daily cost amounts to $87,072.
If all tokens were billed at DeepSeek-R1’s pricing (*), the total daily revenue would be $562,027, with a cost profit margin of 545%.
成本没有想象的高，长期看好英伟达

handsome · 2025 年3 月 1 日 04:47

省流

6512345 · 2025 年3 月 1 日 04:48

One more thing

homeworkkun · 2025 年3 月 1 日 05:00

这个545%好像是因为显卡是他们自己的。有人用租赁h800每小时按2美元算，利润率也有85%左右

monster_dump · 2025 年3 月 1 日 05:01

继续给 DeepSeek 点赞

dragonfsky · 2025 年3 月 1 日 05:13

文章中就是拿租的显卡计算的成本

homeworkkun · 2025 年3 月 1 日 05:14

这样吗，我再看看，推上有人说他算的是84.5%

system · 2025 年3 月 31 日 05:15

此话题已在最后回复的 30 天后被自动关闭。不再允许新回复。

话题		回复	浏览量
DeepSeek开源周 One More Thing DeepSeek-V3/R1推理系统概览前沿快讯	6	215	2025 年4 月 1 日
Deepseek开源周 Day6：DeepSeek-V3/R1推理系统概述（虚晃一枪前沿快讯人工智能	14	497	2025 年3 月 31 日
DeepSeek:别黑了！我们的利润率达到了545% 搞七捻三 DeepSeek , 人工智能 , 纯水	19	1981	2025 年4 月 8 日
deepseek：其实我2000+张卡推理的理论成本利润率有545% 搞七捻三人工智能	9	445	2025 年3 月 31 日
我怀疑幻方量化已经开好英伟达的空单了，但我不懂人工智能基础设施，讨论一下前沿快讯人工智能	17	667	2025 年3 月 26 日

Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview

相关话题