NVIDIAHGX B200

Eight NVIDIA Blackwell GPUs interconnected with fifth-generation NVIDIA® NVLink®, B200 delivers leading-edge performance, offering 3X the training performance and 15X the inference performance of previous generations.

Get pricing Contact sales
+44 20 8050 7646

Available Mon-Fri 9am-5pm UK time

Available at the most cost-effective pricing

Launch your AI products faster with on-demand GPUs and a global network of data center partners

Enterprise

We offer a range of solutions for enterprise customers.

  1. Powerful GPU clusters
  2. Scalable data center colocation
  3. Large quantities of GPUs and hardware
  4. Optimise to your requirements
  5. Expert installation
  6. Scale as your demand grows

Specifications

Browse specifications for the NVIDIA HGX B200 GPU

Starting from Contact us for pricing
Architecture NVIDIA Blackwell
GPU8x NVIDIA Blackwell GPUs
GPU Memory1,440GB total GPU memory
Performance72 petaFLOPS training and 144 petaFLOPS inference
CPU2 Intel® Xeon® Platinum 8570 Processors - 112 Cores total, 2.1 GHz(Base), 4 GHz(Max Boost)
System MemoryUp to 4TB
Networking4x OSFP ports serving 8x single-port NVIDIA ConnectX-7 VPI | 2x dual-port QSFP112 NVIDIA BlueField-3 DPU
Management Network10Gb/s onboard NIC with RJ45, 100Gb/s dual-port ethernet NIC, Host baseboard management controller (BMC) with RJ45
StorageOS: 2x 1.9TB NVMe M.2, Internal storage: 8x 3.84TB NVMe U.2
SoftwareNVIDIA AI Enterprise: Optimized AI Software, NVIDIA Base Command™: Orchestration, Scheduling, and Cluster Management, DGX OS / Ubuntu: Operating system

Ideal uses cases for the NVIDIA HGX B200 GPU

Explore uses cases for the NVIDIA HGX B200 including Powerhouse of AI Performance, Real Time Large Language Model Inference, High-performance computing.

Powerhouse of AI Performance

Powered by the NVIDIA Blackwell architecture’s advancements in computing, B200 delivers 3X the training performance and 15X the inference performance of H100.

Real Time Large Language Model Inference

Token-to-token latency (TTL) = 50ms real time, first token latency (FTL) = 5s, input sequence length = 32,768, output sequence length = 1,028, 8x eight-way H100 GPUs air-cooled vs. 1x eight-way B200 air-cooled, per GPU performance comparison​.

High-performance computing

From complex scientific simulations to weather forecasting and intricate financial modelling, the B200 will empower diverse organizations to accelerate high-performance computing tasks. Its unmatched memory bandwidth and processing capabilities ensure smooth operation for workloads of any scale, allowing you to achieve unmatched results faster than ever.

Browse alternative GPU solutions for your workloads

Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads

NVIDIA H100 SXM

NVIDIA H100 SXM

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

NVIDIA H100 PCIe

NVIDIA H100 PCIe

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

NVIDIA GB200 NVL72

NVIDIA GB200 NVL72

Pricing on request.

Scale with high performance GB200 NVL72 GPUs on our reserved cloud.

NVIDIA A800 PCIe

NVIDIA A800 PCIe

from $0.80 /hr

Deploy performant A800s on-demand with CUDO Compute.

NVIDIA H200 SXM

NVIDIA H200 SXM

Pricing on request.

Deploy performant H200s on-demand with CUDO Compute.

NVIDIA B100

NVIDIA B100

Pricing on request.

Scale with high performance B100 GPUs on our reserved cloud.

NVIDIA A40

NVIDIA A40

from $0.39 /hr

Deploy performant A40s on-demand with CUDO Compute.

NVIDIA L40S

NVIDIA L40S

from $1.42 /hr

Deploy performant L40Ss on-demand with CUDO Compute.

NVIDIA A100 PCIe

NVIDIA A100 PCIe

from $1.50 /hr

Deploy performant A100s on-demand with CUDO Compute.

NVIDIA V100

NVIDIA V100

from $0.39 /hr

Deploy performant V100s on-demand with CUDO Compute.

NVIDIA RTX 4000 SFF Ada

NVIDIA RTX 4000 SFF Ada

Pricing on request.

Deploy performant RTX 4000 SFF Adas on-demand with CUDO Compute.

NVIDIA RTX A4000

NVIDIA RTX A4000

Pricing on request.

Scale with high performance RTX A4000 GPUs on our reserved cloud.

NVIDIA RTX A5000

NVIDIA RTX A5000

from $0.35 /hr

Deploy performant RTX A5000s on-demand with CUDO Compute.

NVIDIA RTX A6000

NVIDIA RTX A6000

from $0.45 /hr

Deploy performant RTX A6000s on-demand with CUDO Compute.

AMD MI250/300

AMD MI250/300

Pricing on request.

Scale with high performance MI250/300 GPUs on our reserved cloud.

An NVIDIA preferred partner for compute

We're proud to be an NVIDIA preferred partner for compute, offering the latest GPUs and high-performance computing solutions.

Also trusted by our other key partners:

Talk to sales

  • Reserve GPUs. Access a HGX B200 GPU Cloud alongside other high performance models for as long as you need it.

  • Deployment & scaling. Seamless deployment alongside expert installation, ready to scale as your demands grow.

"CUDO Compute is a true pioneer in aggregating the world's cloud in a sustainable way, enabling service providers like us to integrate with ease"

VPS AI logo

Get started today or speak with an expert...