DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 blogs.nvidia.com 13 points by moondistance 11 hours ago
billconan 11 hours ago https://news.ycombinator.com/item?id=42879864this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.
https://news.ycombinator.com/item?id=42879864
this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.