NVIDIAL40S
The NVIDIA L40S is a cloud-based GPU that delivers breakthrough acceleration to perform a wide range of high-performance computing workloads. Powered by the Ada Lovelace architecture and cutting-edge features, the L40S brings next-level performance and exceptional processing power to handle intensive tasks, such as AI inference and training, rendering, 3D graphics and virtual workstations. The NVIDIA L40S is available for reservation on the CUDO Compute marketplace.
from $0.75/hr
from $0.88/hr

The NVIDIA L40S is perfect for a wide range of workloads
Deploying AI based workloads on CUDO Compute is easy and cost-effective. Follow our AI related tutorials.
Available at the most cost-effective pricing
Launch your AI products faster with on-demand GPUs and a global network of data center partners
Virtual machines
The ideal deployment strategy for AI workloads with a L40S.
- Up to 8 GPUs / virtual machine
- Flexible
- Network attached storage
- Private networks
- Security groups
- Images
from $0.75/hr
from $0.88/hr
Bare metal
Complete control over a physical machine for more control.
- Powered by renewable energy
- Up to 8 GPUs / host
- No noisy neighbours
- SpectrumX local networking
- 300Gbps external connectivity
- NVMe SSD storage
from $8.80/hr
Enterprise
We offer a range of solutions for enterprise customers.
- Powerful GPU clusters
- Scalable data center colocation
- Large quantities of GPUs and hardware
- Optimise to your requirements
- Expert installation
- Scale as your demand grows
Specifications
Browse specifications for the NVIDIA L40S GPU
Starting from | $0.75/hr |
Architecture | NVIDIA Ada Lovelace |
GPU Memory | 48GB GDDR6 with ECC |
Memory Bandwidth | 864GB/s |
Interconnect Interface | PCIe Gen4 x16: 64GB/s bidirectional |
NVIDIA Ada Lovelace Architecture-Based CUDA® Cores | 18,176 |
NVIDIA Third-Generation RT Cores | 142 |
NVIDIA Fourth-Generation Tensor Cores | 568 |
RT Core Performance TFLOPS | 212 |
FP32 TFLOPS | 91.6 |
TF32 Tensor Core TFLOPS | 183 I 366 |
BFLOAT16 Tensor Core TFLOPS | 362.05 I 733 |
FP16 Tensor Core | 362.05 I 733 |
FP8 Tensor Core | 733 I 1,466 |
Peak INT8 Tensor TOPS | 733 I 1,466 |
Peak INT4 Tensor TOPS | 733 I 1,466 |
Form Factor | 4.4" (H) x 10.5" (L), dual slot |
Display Ports | 4x DisplayPort 1.4a |
Max Power Consumption | 350W |
Power Connector | 16-pin |
Thermal | Passive |
Virtual GPU (vGPU) Software Support | Yes |
NVENC I NVDEC | 3x l 3x (includes AV1 encode and decode) |
Secure Boot With Root of Trust | Yes |
NEBS Ready | Level 3 |
Multi-Instance GPU (MIG) Support | No |
NVIDIA® NVLink® Support | No |
Ideal uses cases for the NVIDIA L40S GPU
Explore uses cases for the NVIDIA L40S including Rendering and 3D graphics, High-performance virtual workstations, Data science and AI.
Rendering and 3D graphics
Powered by the latest fourth-generation Tensor Core and featuring enhanced AI capabilities, the L40S is the top choice for artists and content creators to handle complex rendering and graphics tasks.
High-performance virtual workstations
When combined with NVIDIA RTX Virtual Workstation (vWS) software, the L40S allows professionals to access the most demanding applications from anywhere with awe-inspiring performance that rivals physical workstations.
Data science and AI
The L40S GPU offers powerful training and inference performance, allowing professionals to reduce the time to completion for model training and development as well as data preparation workflows.
Browse alternative GPU solutions for your workloads
Access a wide range of performant NVIDIA and AMD GPUs to accelerate your AI, ML & HPC workloads
An NVIDIA preferred partner for compute
We're proud to be an NVIDIA preferred partner for compute, offering the latest GPUs and high-performance computing solutions.
Also trusted by our other key partners:
Frequently asked questions
Talk to sales
Reserve GPUs. Access a L40S GPU Cloud alongside other high performance models for as long as you need it.
Deployment & scaling. Seamless deployment alongside expert installation, ready to scale as your demands grow.
"CUDO Compute is a true pioneer in aggregating the world's cloud in a sustainable way, enabling service providers like us to integrate with ease"
Get started today or speak with an expert...
Available Mon-Fri 9am-5pm UK time