How much does it cost to rent an NVIDIA H200?

Please get in touch using pricing enquiry form below to find out more information regarding NVIDIA H200 pricing with CUDO Compute. If you're looking for something else, please see our pricing page for the most up-to-date GPU pricing information.

Do you offer any discounts on NVIDIA H200s for committed use?

Unfortunately we do not currently offer commitment pricing options for the NVIDIA H200 GPU. You can see our full range of GPU pricing options, including commitment prices on our GPU pricing page.

NVIDIAH200

The NVIDIA H200 is an ideal choice for large-scale AI applications. It uses the NVIDIA Hopper architecture that combines advanced features and capabilities, accelerating AI training and inference on larger models.

Get pricing Contact sales

The NVIDIA H200 is perfect for a wide range of workloads

Deploying AI based workloads on CUDO Compute is easy and cost-effective. Follow our AI related tutorials.

Tutorial

Deploying LLM’s like Google Gemma

In this tutorial we will run Google Gemma with Ollama so that you can send queries via a Rest API

Read tutorial

Tutorial

Deploy PyTorch

With CUDO Compute you can deploy PyTorch docker containers to the latest NVIDIA Ampere Architecture GPUs.

Read tutorial

Available at the most cost-effective pricing

Launch your AI products faster with on-demand GPUs and a global network of data center partners

Virtual machines

The ideal deployment strategy for AI workloads with a H200.

Up to 8 GPUs / virtual machine
Flexible
Network attached storage
Private networks
Security groups
Images

Pricing available on request

Get pricing Learn more

Bare metal

Complete control over a physical machine for more control.

Powered by renewable energy
Up to 8 GPUs / host
No noisy neighbors
SpectrumX local networking
300Gbps external connectivity
NVMe SSD storage

from $23.29/hr

Enquire now Learn more

Enterprise

We offer a range of solutions for enterprise customers.

Powerful GPU clusters
Scalable data center colocation
Large quantities of GPUs and hardware
Optimize to your requirements
Expert installation
Scale as your demand grows

Contact sales Learn more

Specifications

Browse specifications for the NVIDIA H200 GPU

Starting from	Contact us for pricing
Architecture	NVIDIA Hopper
Form factor	SXM
FP64 tensor core	34 TFLOPS
FP32	67 TFLOPS
FP32 tensor core	67 TFLOPS
BFLOAT16 tensor core	1,979 TFLOPS
FP16 tensor core	1,979 TFLOPS
FP8 tensor core	3,958 TFLOPS
INT8 tensor core	3,958 TOPS
GPU memory	141GB
GPU memory bandwidth	4.8 TB/s
Decoders	7x NVDEC 7x NVJPEG
Max thermal design power (TDP)	Up to 700W
Multi-instance GPUs (MIG)	Up to 7 MIGs @16.5GB each
Interconnect	NVLink: 900GB/s PCIe Gen5: 128GB/s

Ideal uses cases for the NVIDIA H200 GPU

Explore uses cases for the NVIDIA H200 including AI inference, Deep learning, High-performance computing.

AI inference

AI developers can utilize the NVIDIA H200 to accelerate AI inference workloads, such as image and speech recognition, at lightning speed. The H200 GPU’s powerful Tensor Cores enable it to quickly process large amounts of data, making it perfect for real-time inference applications.

Deep learning

The NVIDIA H200 empowers data scientists and researchers to achieve groundbreaking milestones in deep learning. Its massive memory and processing power guarantee significantly reduced training and deployment times for complex, large-scale models and enables model training on significantly larger datasets.

High-performance computing

From complex scientific simulations to weather forecasting and intricate financial modelling, the H200 empowers diverse organizations to accelerate high-performance computing tasks. Its unmatched memory bandwidth and processing capabilities ensure smooth operation for workloads of any scale, allowing you to achieve unmatched results faster than ever.

Browse alternative GPU solutions for your workloads

Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads

NVIDIA H100 SXM

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA H100 PCIe

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA HGX B200

Pricing on request.

Scale with high performance HGX B200 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA GB200 NVL72

Pricing on request.

Scale with high performance GB200 NVL72 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA A800 PCIe

from $0.80 /hr

Deploy performant A800s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA B100

Pricing on request.

Scale with high performance B100 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA A40

from $0.39 /hr

Deploy performant A40s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA L40S

from $1.42 /hr

Deploy performant L40Ss on-demand with CUDO Compute.

Learn more Deploy

NVIDIA A100 PCIe

from $1.50 /hr

Deploy performant A100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA V100

from $0.39 /hr

Deploy performant V100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX 4000 SFF Ada

Pricing on request.

Deploy performant RTX 4000 SFF Adas on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX A4000

Pricing on request.

Scale with high performance RTX A4000 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA RTX A5000

from $0.35 /hr

Deploy performant RTX A5000s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX A6000

from $0.45 /hr

Deploy performant RTX A6000s on-demand with CUDO Compute.

Learn more Deploy

AMD MI250/300

Pricing on request.

Scale with high performance MI250/300 GPUs on our reserved cloud.

Learn more Enquire

Trusted by NVIDIA. Built for you.

As a preferred partner, we offer NVIDIA’s most advanced GPUs with tested infrastructure, ready for AI, HPC and demanding workloads.

Also trusted by our other key partners:

Frequently asked questions

Are you looking for support with something more specific? Check out our knowledge base

Talk to sales

Reserve GPUs. Access a H200 GPU Cloud alongside other high performance models for as long as you need it.
Deployment & scaling. Seamless deployment alongside expert installation, ready to scale as your demands grow.

"CUDO Compute is a true pioneer in aggregating the world's cloud in a sustainable way, enabling service providers like us to integrate with ease"

VPS AI

"I’ve found CUDO’s service to be exceptional, with impressively fast responses"

MetaHub

"AI workloads using dstack and CUDO Compute as easily and reliably as with the big three cloud providers"

dstack

Loading GPU resource form...

Get started today or speak with an expert...

+44 20 8050 7646

Available Mon-Fri 9am-5pm UK time

Get started Speak with an expert