CoreWeave Achieves New Record-Breaking AI Inferencing Benchmark with NVIDIA GB200 Grace Blackwell Superchips

CoreWeave set a new AI inference record using NVIDIA GB200 Superchips, achieving 800 TPS on Llama 3.1 405B and boosting Llama 2 70B throughput by 40% with H200 GPUs.

Copied

CoreWeave is the first cloud service provider to submit MLPerf Inference v5.0 results for NVIDIA GB200 Superchips

LIVINGSTON, N.J., April 2, 2025 /PRNewswire/ -- CoreWeave, the AI Hyperscaler™, today announced its MLPerf v5.0 results, setting a new industry benchmark in AI inference with NVIDIA GB200 Grace Blackwell Superchips. Using a CoreWeave instance with NVIDIA GB200, featuring two NVIDIA Grace CPUs and four NVIDIA Blackwell GPUs, CoreWeave delivered 800 tokens per second (TPS) on the Llama 3.1 405B model¹—one of the largest open-source models.

"CoreWeave is committed to delivering cutting-edge infrastructure optimized for large-model inference through our purpose-built cloud platform," said Peter Salanki, Chief Technology Officer at CoreWeave. "These benchmark MLPerf results reinforce CoreWeave's position as a preferred cloud provider for leading AI labs and enterprises."

CoreWeave also submitted new results for NVIDIA H200 GPU instances. It achieved 33,000 TPS on the Llama 2 70B model, representing a 40 percent improvement in throughput over NVIDIA H100 instances.²

These results further demonstrate CoreWeave as an industry-leading cloud infrastructure services provider. This year, the company became the first to offer general availability of NVIDIA GB200 NVL72-based instances. Last year, the company was among the first to offer NVIDIA H100 and H200 GPUs, and it was one of the first to demo NVIDIA GB200 NVL72.

MLPerf Inference is an industry-standard suite for measuring machine learning performance across realistic deployment scenarios. How quickly systems can process inputs and produce results using a trained model has a direct impact on user experience.

About CoreWeave
‍
‍CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to move at the pace of innovation, building and scaling AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave serves as a force multiplier by combining superior infrastructure performance with deep technical expertise to accelerate breakthroughs. Established in 2017, CoreWeave completed its public listing on Nasdaq (CRWV) in March 2025. Learn more at www.coreweave.com.
‍

Media Contact: Gurion Kastenberg, [email protected]

‍

¹Verified MLPerf® score of v5.1 Inference Closed Llama 3.1 405B offline. Retrieved from https://mlcommons.org/benchmarks/inference, 2 April 2025, entry 5.0-0076. The MLPerf name and logo are registered and unregistered trademarks of MLCommons Association in the United States and other countries. All rights reserved. Unauthorized use strictly prohibited. See www.mlcommons.org for more information.

²Verified MLPerf® score of v5.1 Inference Closed Llama 2 70B server. Retrieved from https://mlcommons.org/benchmarks/inference, 2 April 2025, entry 5.0-0077. The MLPerf name and logo are registered and unregistered trademarks of MLCommons Association in the United States and other countries. All rights reserved. Unauthorized use strictly prohibited. See www.mlcommons.org for more information.

SOURCE CoreWeave

To see the release on PR Newswire, please click here.

‍

Copied

Media Contacts

CoreWeave Media

[email protected]

CoreWeave Achieves New Record-Breaking AI Inferencing Benchmark with NVIDIA GB200 Grace Blackwell Superchips

Media Contacts

More press releases

CoreWeave Announces Agreement to Power Runway’s Next Generation AI Video Models

CoreWeave Expands Mission Control To Accelerate Enterprise AI Adoption

Jane Street and CoreWeave Announce Seed Investment in Numerata, Developer of AI-Powered Software Development Tools

CoreWeave Announces Zero Egress Migration, Unlocking Multi-Cloud Development for AI Workloads

CoreWeave Achieves SemiAnalysis’ Platinum ClusterMAX™ Rating for the Second Consecutive Ranking, Remaining the Industry’s Sole Platinum Provider

CrowdStrike and CoreWeave Partner to Power the Secure AI Cloud for the Agentic Era

CoreWeave Acquires Marimo to Unify the Generative AI Developer Workflow

CoreWeave to Enter the U.S. Federal Market

CoreWeave Appoints Jon Jones as First Chief Revenue Officer To Lead Next Phase of Rapid Growth

CoreWeave Unveils AI Object Storage, Redefining How AI Workloads Access and Scale Data

Products

Solutions

AI Infrastructure

Why CoreWeave

Resources

About