Home » Tag Archives: inferencing

Tag Archives: inferencing

Nvidia compares Blackwell and Hopper GPUs on LLM inferencing

Nvidia GB200 NVL72 cluster

Nvidia has announced results for its forthcoming Blackwell GPU in the latest round of MLPerf industry benchmarks, Inference v4.1. “The Blackwell platform revealed up to 4x more performance than the Hopper architecture on MLPerf’s biggest LLM [large language model] workload, Llama 2 70B, thanks to its use of a second-generation transformer engine and FP4 tensor cores,” according to the company. ...