NVIDIAGB200 NVL72

GB200 NVL72 connects 36 Grace CPUs and 72 Blackwell GPUs in a rack-scale design. The GB200 NVL72 is a liquid-cooled, rack-scale solution that boasts a 72-GPU NVLink domain that acts as a single massive GPU and delivers 30X faster real-time trillion-parameter LLM inference. Starting from $3.99/hr

Available at the most cost-effective pricing

Launch your AI products faster with on-demand GPUs and a global network of data center partners

Bare metal

Complete control over a physical machine providing absolute performance & control

Powered by renewable energy
  1. No noisy neighbours
  2. SpectrumX local networking
  3. 300Gbps external connectivity
  4. NVMe SSD storage

Looking to scale? Please contact us for enterprise solutions.

Speak with an expert

The NVIDIA GB200 NVL72 is perfect for a wide range of workloads

Deploying AI based workloads on CUDO Compute is easy and cost-effective. Follow our AI related tutorials.

Specifications

Starting from $3.99/hr
Architecture NVIDIA Blackwell
Configuration36 Grace CPU : 72 Blackwell GPUs
FP4 Tensor Core1,440 PFLOPS
FP8/FP6 Tensor Core720 PFLOPS
INT8 Tensor Core720 POPS
FP16/BF16 Tensor Core360 PFLOPS
TF32 Tensor Core180 PFLOPS
FP326,480 TFLOPS
FP643,240 TFLOPS
FP64 Tensor Core3,240 TFLOPS
GPU Memory | BandwidthUp to 13.5 TB HBM3e | 576 TB/s
NVLink Bandwidth130TB/s
CPU Core Count2,592 Arm® Neoverse V2 cores
CPU Memory | BandwidthUp to 17 TB LPDDR5X | Up to 18.4 TB/s

Use cases

Supercharging Next-Generation AI and Accelerated Computing

GB200 NVL72 introduces cutting-edge capabilities and a second-generation Transformer Engine which enables FP4 AI and when coupled with fifth-generation NVIDIA NVLink, delivers 30X faster real-time LLM inference performance for trillion-parameter language models.

Energy-Efficient Infrastructure

Liquid-cooled GB200 NVL72 racks reduce a data center’s carbon footprint and energy consumption. Liquid cooling increases compute density, reduces the amount of floor space used, and facilitates high-bandwidth, low-latency GPU communication with large NVLink domain architectures.

Massive-Scale Training

GB200 NVL72 includes a faster second-generation Transformer Engine featuring FP8 precision, enabling a remarkable 4X faster training for large language models at scale.

Browse alternative GPU solutions for your workloads

Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads

NVIDIA H100 SXM

NVIDIA H100 SXM

Quickly deploy H100 GPUs on our on-demand cloud.

NVIDIA H100 PCIe

NVIDIA H100 PCIe

Quickly deploy H100 GPUs on our on-demand cloud.

NVIDIA DGX B200

NVIDIA DGX B200

Get the highest performing DGX B200 GPUs at scale on our reserved cloud.

NVIDIA H200

NVIDIA H200

Get the highest performing H200 GPUs at scale on our reserved cloud.

NVIDIA B100

NVIDIA B100

Get the highest performing B100 GPUs at scale on our reserved cloud.

NVIDIA A40

NVIDIA A40

Quickly deploy A40 GPUs on our on-demand cloud.

NVIDIA L40S

NVIDIA L40S

Quickly deploy L40S GPUs on our on-demand cloud.

NVIDIA A100 PCIe

NVIDIA A100 PCIe

Quickly deploy A100 GPUs on our on-demand cloud.

NVIDIA V100

NVIDIA V100

Quickly deploy V100 GPUs on our on-demand cloud.

NVIDIA RTX A4000 SFF Ada

NVIDIA RTX A4000 SFF Ada

Quickly deploy RTX A4000 SFF Ada GPUs on our on-demand cloud.

NVIDIA RTX A4000

NVIDIA RTX A4000

Quickly deploy RTX A4000 GPUs on our on-demand cloud.

NVIDIA RTX A5000

NVIDIA RTX A5000

Quickly deploy RTX A5000 GPUs on our on-demand cloud.

NVIDIA RTX A6000

NVIDIA RTX A6000

Quickly deploy RTX A6000 GPUs on our on-demand cloud.

AMD MI250/300

AMD MI250/300

Get the highest performing MI250/300 GPUs at scale on our reserved cloud.

An NVIDIA preferred partner for compute

We're proud to be an NVIDIA preferred partner for compute, offering the latest GPUs and high-performance computing solutions.

Also trusted by our other key partners:

  • AMD logo
  • blendergrid logo
  • nucocloud logo
  • dpp logo

Pricing & reservation enquiry

Enquire about access today to test the GB200 NVL72 GPU Cloud, or reserve your GB200 NVL72 Cloud on CUDO Compute for as long as you want it, with unique contracts tailored to suit your needs.

Get started today or speak with an expert...