How much does it cost to rent an NVIDIA L40S?

The price for running an NVIDIA L40S on CUDO Compute starts at $0.70/hr with our commitment pricing. See our pricing table to see our full range of available prices or learn more about how commitment pricing works in our documentation.

Do you offer any discounts on NVIDIA L40Ss for committed use?

We can offer up to a 50% discount with our commitment pricing. Our commitment pricing plans for GPUs can include 1 month, 3 month, 6 month, 12 month and 36 month options. You can see our full range of commitment prices on our GPU pricing page

NVIDIAL40S

The NVIDIA L40S is a cloud-based GPU that delivers breakthrough acceleration to perform a wide range of high-performance computing workloads. Powered by the Ada Lovelace architecture and cutting-edge features, the L40S brings next-level performance and exceptional processing power to handle intensive tasks, such as AI inference and training, rendering, 3D graphics and virtual workstations. The NVIDIA L40S is available for reservation on the CUDO Compute marketplace.

Committed

from $0.70/hr

On-demand

from $0.87/hr

Launch an instance Contact sales

The NVIDIA L40S is perfect for a wide range of workloads

Deploying AI based workloads on CUDO Compute is easy and cost-effective. Follow our AI related tutorials.

Tutorial

Deploying LLM’s like Google Gemma

In this tutorial we will run Google Gemma with Ollama so that you can send queries via a Rest API

Read tutorial

Tutorial

Deploy PyTorch

With CUDO Compute you can deploy PyTorch docker containers to the latest NVIDIA Ampere Architecture GPUs.

Read tutorial

Available at the most cost-effective pricing

Launch your AI products faster with on-demand GPUs and a global network of data center partners

Virtual machines

The ideal deployment strategy for AI workloads with a L40S.

Up to 8 GPUs / virtual machine
Flexible
Network attached storage
Private networks
Security groups
Images

Committed

from $0.70/hr

Save $4,471.68

On-demand

from $0.87/hr

Launch an instance Learn more

Bare metal

Complete control over a physical machine for more control.

Powered by renewable energy
Up to 8 GPUs / host
No noisy neighbors
SpectrumX local networking
300Gbps external connectivity
NVMe SSD storage

from $8.80/hr

Save up to $0.40/hr

Enquire now Learn more

Enterprise

We offer a range of solutions for enterprise customers.

Powerful GPU clusters
Scalable data center colocation
Large quantities of GPUs and hardware
Optimize to your requirements
Expert installation
Scale as your demand grows

Contact sales Learn more

Specifications

Browse specifications for the NVIDIA L40S GPU

Starting from	$0.70/hr
Architecture	NVIDIA Ada Lovelace
GPU Memory	48GB GDDR6 with ECC
Memory Bandwidth	864GB/s
Interconnect Interface	PCIe Gen4 x16: 64GB/s bidirectional
NVIDIA Ada Lovelace Architecture-Based CUDA® Cores	18,176
NVIDIA Third-Generation RT Cores	142
NVIDIA Fourth-Generation Tensor Cores	568
RT Core Performance TFLOPS	212
FP32 TFLOPS	91.6
TF32 Tensor Core TFLOPS	183 I 366
BFLOAT16 Tensor Core TFLOPS	362.05 I 733
FP16 Tensor Core	362.05 I 733
FP8 Tensor Core	733 I 1,466
Peak INT8 Tensor TOPS	733 I 1,466
Peak INT4 Tensor TOPS	733 I 1,466
Form Factor	4.4" (H) x 10.5" (L), dual slot
Display Ports	4x DisplayPort 1.4a
Max Power Consumption	350W
Power Connector	16-pin
Thermal	Passive
Virtual GPU (vGPU) Software Support	Yes
NVENC I NVDEC	3x l 3x (includes AV1 encode and decode)
Secure Boot With Root of Trust	Yes
NEBS Ready	Level 3
Multi-Instance GPU (MIG) Support	No
NVIDIA® NVLink® Support	No

Ideal uses cases for the NVIDIA L40S GPU

Explore uses cases for the NVIDIA L40S including Rendering and 3D graphics, High-performance virtual workstations, Data science and AI.

Rendering and 3D graphics

Powered by the latest fourth-generation Tensor Core and featuring enhanced AI capabilities, the L40S is the top choice for artists and content creators to handle complex rendering and graphics tasks.

High-performance virtual workstations

When combined with NVIDIA RTX Virtual Workstation (vWS) software, the L40S allows professionals to access the most demanding applications from anywhere with awe-inspiring performance that rivals physical workstations.

Data science and AI

The L40S GPU offers powerful training and inference performance, allowing professionals to reduce the time to completion for model training and development as well as data preparation workflows.

Browse alternative GPU solutions for your workloads

Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads

NVIDIA H100 SXM

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA H100 PCIe

from $2.45 /hr

Deploy performant H100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA HGX B200

Pricing on request.

Scale with high performance HGX B200 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA GB200 NVL72

Pricing on request.

Scale with high performance GB200 NVL72 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA A800 PCIe

from $0.80 /hr

Deploy performant A800s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA H200 SXM

Pricing on request.

Deploy performant H200s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA B100

Pricing on request.

Scale with high performance B100 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA A40

from $0.39 /hr

Deploy performant A40s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA A100 PCIe

from $1.50 /hr

Deploy performant A100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA V100

from $0.39 /hr

Deploy performant V100s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX 4000 SFF Ada

Pricing on request.

Deploy performant RTX 4000 SFF Adas on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX A4000

Pricing on request.

Scale with high performance RTX A4000 GPUs on our reserved cloud.

Learn more Enquire

NVIDIA RTX A5000

from $0.35 /hr

Deploy performant RTX A5000s on-demand with CUDO Compute.

Learn more Deploy

NVIDIA RTX A6000

from $0.45 /hr

Deploy performant RTX A6000s on-demand with CUDO Compute.

Learn more Deploy

AMD MI250/300

Pricing on request.

Scale with high performance MI250/300 GPUs on our reserved cloud.

Learn more Enquire

Trusted by NVIDIA. Built for you.

As a preferred partner, we offer NVIDIA’s most advanced GPUs with tested infrastructure, ready for AI, HPC and demanding workloads.

Also trusted by our other key partners:

Frequently asked questions

Are you looking for support with something more specific? Check out our knowledge base

Talk to sales

Reserve GPUs. Access a L40S GPU Cloud alongside other high performance models for as long as you need it.
Deployment & scaling. Seamless deployment alongside expert installation, ready to scale as your demands grow.

"CUDO Compute is a true pioneer in aggregating the world's cloud in a sustainable way, enabling service providers like us to integrate with ease"

VPS AI

"I’ve found CUDO’s service to be exceptional, with impressively fast responses"

MetaHub

"AI workloads using dstack and CUDO Compute as easily and reliably as with the big three cloud providers"

dstack

Loading GPU resource form...

Get started today or speak with an expert...

+44 20 8050 7646

Available Mon-Fri 9am-5pm UK time

Get started Speak with an expert