Chinese AI darling DeepSeek's now infamous R1 research report was published in the Journal Nature this week, alongside new information on the compute resources required to train the model. Unfortunately, some people got the wrong idea about just how expensive it was to create.

The disclosures led some to believe the Chinese AI darling had actually managed to train the model at cost of just $294,000 USD, a figure much lower than previously reported. In reality, the true cost to train the model was roughly 20x that. At least.

The confusion stemmed from the supplementary information released alongside the original January paper , in which the AI model dev revealed it had used just 64 eight-way H800 boxes totaling 512 GPUs running at full tilt for 198 hours to train the preliminary R1-Zero r

See Full Page