Microsoft · Blackwell · DeepSeek · Nvidia · Google · Llama · NVIDIA Blog
Fastest, Largest, Strongest: NVIDIA Blackwell Sweeps MLPerf Teaching 6.0
Compiled by KHAO Editorial — aggregated from 1 source. See llms.txt for citation guidance.
★ Tier-1 Source
Every breakthrough AI model starts the same way: with a training run.
Key facts
- Nebius, running NVIDIA Blackwell and Blackwell Ultra infrastructure on its AI cloud, enabled Higgsfield to reduce model training time by 30%, supporting a platform that now serves 22 million users
- NVIDIA also submitted results at 5,120 GPUs with NVIDIA GB200 NVL72 systems on Llama 3.1 405B, one of the largest dense LLMs in the suite
- NVIDIA GB300 NVL72 Delivered up to 1.6x Performance Over GB200 NVL72: In this round, GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale
- MLPerf Training 6.0 added two new mixture-of-experts (MoE) pretraining workloads to the suite: DeepSeek-V3 671B and GPT-OSS-20B, reflecting the growing centrality of MoE architectures
Summary
As models grow in size, complexity and intelligence, the demands on training infrastructure are also rising. In MLPerf Training 6.0, the latest of a series of rigorous, peer-reviewed industry benchmarks for evaluating AI training performance, the NVIDIA Blackwell platform led across every category, demonstrating:. Largest-scale training across 8,192 GPUs using NVIDIA Blackwell NVL72 systems. The only platform with submissions across all seven benchmarks in the suite.