Train your models to hallucinate in less than half the time. Great!
Nvidia's MLPerf submission shows B200 offers up to 2.2x training performance of H100
Nvidia offered the first look at how its upcoming Blackwell accelerators stack up against the venerable H100 in real-world training workloads, claiming up to 2.2x higher performance. The benchmarks, released as part of this week's MLPerf results, are in line with what we expected from Blackwell at this stage. The DGX B200 …
COMMENTS
-
-
Friday 15th November 2024 09:34 GMT Korev
Re: what a big huge massive humongous elephant in the room.....
On paper, the B200 is capable of churning out 9 petaFLOPS of sparse FP8 performance, and is rated for a kilowatt of power and heat. The 1.2 kW GPUs found in Nvidia's flagship GB200, on the other hand, are each capable of churning out 10 petaFLOPS at the same precision.
-