NVIDIAGB200 NVL72

GB200 NVL72 connects 36 Grace CPUs and 72 Blackwell GPUs in a rack-scale design. The GB200 NVL72 is a liquid-cooled, rack-scale solution that boasts a 72-GPU NVLink domain that acts as a single massive GPU and delivers 30X faster real-time trillion-parameter LLM inference.

Get pricing Contact sales
+44 20 8050 7646

Available Mon-Fri 9am-5pm UK time

Available at the most cost-effective pricing

Launch your AI products faster with on-demand GPUs and a global network of data center partners

Bare metal

Complete control over a physical machine for more control.

  1. Powered by renewable energy
  2. No noisy neighbours
  3. SpectrumX local networking
  4. 300Gbps external connectivity
  5. NVMe SSD storage

Pricing available on request

Enterprise

We offer a range of solutions for enterprise customers.

  1. Powerful GPU clusters
  2. Scalable data center colocation
  3. Large quantities of GPUs and hardware
  4. Optimise to your requirements
  5. Expert installation
  6. Scale as your demand grows

Specifications

Browse specifications for the NVIDIA GB200 NVL72 GPU

Starting from Contact us for pricing
Architecture NVIDIA Blackwell
Configuration36 Grace CPU : 72 Blackwell GPUs
FP4 Tensor Core1,440 PFLOPS
FP8/FP6 Tensor Core720 PFLOPS
INT8 Tensor Core720 POPS
FP16/BF16 Tensor Core360 PFLOPS
TF32 Tensor Core180 PFLOPS
FP326,480 TFLOPS
FP643,240 TFLOPS
FP64 Tensor Core3,240 TFLOPS
GPU Memory | BandwidthUp to 13.5 TB HBM3e | 576 TB/s
NVLink Bandwidth130TB/s
CPU Core Count2,592 Arm® Neoverse V2 cores
CPU Memory | BandwidthUp to 17 TB LPDDR5X | Up to 18.4 TB/s

Ideal uses cases for the NVIDIA GB200 NVL72 GPU

Explore uses cases for the NVIDIA GB200 NVL72 including Supercharging Next-Generation AI and Accelerated Computing, Energy-Efficient Infrastructure, Massive-Scale Training.

Supercharging Next-Generation AI and Accelerated Computing

GB200 NVL72 introduces cutting-edge capabilities and a second-generation Transformer Engine which enables FP4 AI and when coupled with fifth-generation NVIDIA NVLink, delivers 30X faster real-time LLM inference performance for trillion-parameter language models.

Energy-Efficient Infrastructure

Liquid-cooled GB200 NVL72 racks reduce a data center’s carbon footprint and energy consumption. Liquid cooling increases compute density, reduces the amount of floor space used, and facilitates high-bandwidth, low-latency GPU communication with large NVLink domain architectures.

Massive-Scale Training

GB200 NVL72 includes a faster second-generation Transformer Engine featuring FP8 precision, enabling a remarkable 4X faster training for large language models at scale.

Browse alternative GPU solutions for your workloads

Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads

NVIDIA H100 SXM

NVIDIA H100 SXM

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

NVIDIA H100 PCIe

NVIDIA H100 PCIe

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

NVIDIA HGX B200

NVIDIA HGX B200

Pricing on request.

Scale with high performance HGX B200 GPUs on our reserved cloud.

NVIDIA A800 PCIe

NVIDIA A800 PCIe

from $0.80 /hr

Deploy performant A800s on-demand with CUDO Compute.

NVIDIA H200 SXM

NVIDIA H200 SXM

Pricing on request.

Deploy performant H200s on-demand with CUDO Compute.

NVIDIA B100

NVIDIA B100

Pricing on request.

Scale with high performance B100 GPUs on our reserved cloud.

NVIDIA A40

NVIDIA A40

from $0.39 /hr

Deploy performant A40s on-demand with CUDO Compute.

NVIDIA L40S

NVIDIA L40S

from $1.42 /hr

Deploy performant L40Ss on-demand with CUDO Compute.

NVIDIA A100 PCIe

NVIDIA A100 PCIe

from $1.50 /hr

Deploy performant A100s on-demand with CUDO Compute.

NVIDIA V100

NVIDIA V100

from $0.39 /hr

Deploy performant V100s on-demand with CUDO Compute.

NVIDIA RTX 4000 SFF Ada

NVIDIA RTX 4000 SFF Ada

Pricing on request.

Deploy performant RTX 4000 SFF Adas on-demand with CUDO Compute.

NVIDIA RTX A4000

NVIDIA RTX A4000

Pricing on request.

Scale with high performance RTX A4000 GPUs on our reserved cloud.

NVIDIA RTX A5000

NVIDIA RTX A5000

from $0.35 /hr

Deploy performant RTX A5000s on-demand with CUDO Compute.

NVIDIA RTX A6000

NVIDIA RTX A6000

from $0.45 /hr

Deploy performant RTX A6000s on-demand with CUDO Compute.

AMD MI250/300

AMD MI250/300

Pricing on request.

Scale with high performance MI250/300 GPUs on our reserved cloud.

An NVIDIA preferred partner for compute

We're proud to be an NVIDIA preferred partner for compute, offering the latest GPUs and high-performance computing solutions.

Also trusted by our other key partners:

Talk to sales

  • Reserve GPUs. Access a GB200 NVL72 GPU Cloud alongside other high performance models for as long as you need it.

  • Deployment & scaling. Seamless deployment alongside expert installation, ready to scale as your demands grow.

"CUDO Compute is a true pioneer in aggregating the world's cloud in a sustainable way, enabling service providers like us to integrate with ease"

VPS AI logo

Get started today or speak with an expert...