Fastest, Largest, Strongest: NVIDIA Blackwell Sweeps MLPerf Teaching 6.0

Tue, Jun 16 · 3:00 PM UTC 2 min read

Compiled by KHAO Editorial — aggregated from 1 source. See llms.txt for citation guidance.

★ Tier-1 Source

Watch NVIDIA CEO Jensen Huang's GTC Taipei Keynote Replay.

Every breakthrough AI model starts the same way: with a training run.

Key facts

Nebius, running NVIDIA Blackwell and Blackwell Ultra infrastructure on its AI cloud, enabled Higgsfield to reduce model training time by 30%, supporting a platform that now serves 22 million users
NVIDIA also submitted results at 5,120 GPUs with NVIDIA GB200 NVL72 systems on Llama 3.1 405B, one of the largest dense LLMs in the suite
NVIDIA GB300 NVL72 Delivered up to 1.6x Performance Over GB200 NVL72: In this round, GB300 NVL72 delivered up to 1.6x faster training than GB200 NVL72 at the same scale
MLPerf Training 6.0 added two new mixture-of-experts (MoE) pretraining workloads to the suite: DeepSeek-V3 671B and GPT-OSS-20B, reflecting the growing centrality of MoE architectures

Summary

As models grow in size, complexity and intelligence, the demands on training infrastructure are also rising. In MLPerf Training 6.0, the latest of a series of rigorous, peer-reviewed industry benchmarks for evaluating AI training performance, the NVIDIA Blackwell platform led across every category, demonstrating:. Largest-scale training across 8,192 GPUs using NVIDIA Blackwell NVL72 systems. The only platform with submissions across all seven benchmarks in the suite.

Read full article at NVIDIA Blog →

#Microsoft #Blackwell #DeepSeek #Nvidia #Google #Llama