L40S vs A16

Comparing the L40S and A16 for cloud rental? The L40S is available from 0 providers starting at , while the A16 starts at across 0 providers. Below is a live breakdown of pricing, VRAM, compute throughput, and where to rent each GPU.

L40S

VRAM
48 GB
TFLOPS
366
Cheapest
See all L40S offers

A16

VRAM
64 GB
TFLOPS
36
Cheapest
See all A16 offers

L40S vs A16 at a glance

Spec profile

L40SA16
L40S vs A16 spec comparison radar chartVRAMFP16BandwidthValueProviders

Each axis is scaled to the stronger GPU — the outer edge is better.

Value for money

Live pricing for both GPUs is needed to compute value-for-money metrics. See the provider tables below for current rates.

Specs are manufacturer peak FP16 Tensor figures; real-world throughput varies by workload, precision, and software stack.

L40S vs A16 specs & pricing

SpecificationL40SA16
Cheapest price
VRAM
48 GB
64 GB
FP16 TFLOPS
366
36
Memory bandwidth
864 GB/s
928 GB/s
Release year
2023
2021
Cloud providers

Which should you choose?

  • A16 has more VRAM (64 GB), so it fits larger models and bigger batch sizes without sharding.
  • L40S offers higher FP16 throughput, training and serving faster on compute-bound workloads.

Provider availability

Only L40S
None
Both GPUs
None
Only A16
None

Frequently asked questions

How much VRAM do the L40S and A16 have?

The L40S has 48 GB of VRAM and the A16 has 64 GB. More VRAM lets you serve larger models and longer context windows on a single GPU.

Which is faster, the L40S or A16?

On paper the L40S is faster, with 366 FP16 TFLOPS versus 36. Real-world speed also depends on memory bandwidth and software stack.

Live prices come from public provider catalogs and can change quickly. How we collect prices.

Related GPU comparisons