CoreWeave Helps Power the Next Era of AI with NVIDIA Blackwell Clusters

CoreWeave is among the first to deploy NVIDIA Blackwell clusters, offering massive AI performance gains with HGX B200 and GB200 NVL72 systems.

Emma Ramos

Copied

CoreWeave Helps Power the Next Era of AI with NVIDIA Blackwell Clusters

There’s a gap in the cloud market for customers running large-scale AI/ML workloads on legacy hyperscalers. Customers spend precious time building and retrofitting their stack on these general-purpose clouds just to get started. Alternative clouds are typically focused on smaller projects and are therefore unable to provide the scale of compute needed to deploy enterprise-grade large language models (LLMs). That’s why CoreWeave began building the first-ever AI Hyperscaler.

We’ve honed in on delivering high-performance, scalable, and secure fully managed infrastructure for AI/ML customers. Our teams built first-of-its-kind tools for automated node health-checking, fast model copying and loading, and streamlined workload management. CoreWeave customers have unprecedented visibility into their clusters, and we work hand-in-hand with engineering teams to resolve issues quickly and seamlessly.

We’re proud of our ability to innovate. But that doesn’t mean we’re done making progress. After all, AI shows no signs of slowing down.

As the scale of LLMs grows, our AI infrastructure continues to evolve at breakneck speeds to power a new era of computing.

CoreWeave is proud to be among the first providers to deliver large-scale NVIDIA Blackwell clusters to the market, offering two configurations: the NVIDIA HGX B200 and the NVIDIA GB200 NVL72, all interconnected with ultra-fast NVIDIA Quantum-2 InfiniBand networking.

The NVIDIA HGX B200 is designed for the most demanding AI, data processing, and high-performance computing workloads. Powered by NVIDIA Blackwell GPUs, the HGX B200 x86 platform features 15X faster real-time inference on trillion-parameter models.

The NVIDIA GB200 NVL72 is a liquid-cooled, rack-scaled solution that connects 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell GPUs and delivers up to 30X faster real-time trillion-parameter LLM inference.

NVIDIA introduced its Blackwell architecture at NVIDIA GTC earlier this year. With the ability to link up to 72 GPUs in a single NVIDIA NVLink domain, Blackwell innovations enable greater GPU-GPU communication bandwidth to train and deploy massive models. This opens up possibilities for new use cases far beyond AI training and inference, such as quantum computing, drug discovery, and fusion energy.

Scientific computing and physics-based simulations are positioned to make large strides as AI powers the world’s next breakthroughs.

With CoreWeave’s fully managed infrastructure, you can spend less time on DevOps overhead and more time on what matters: deploying workloads with confidence and going to market with your innovations faster. CoreWeave customers enjoy more control and visibility over their clusters, and our suite of proprietary software, purpose-built for AI, ensures maximum resource performance and efficiency.

‍

If you’re ready to partner with us, reach out here.

‍

Published on

July 18, 2024