GB200 vs L40S

Comparing the GB200 and L40S for cloud rental? The GB200 is available from 0 providers starting at , while the L40S starts at across 0 providers. Below is a live breakdown of pricing, VRAM, compute throughput, and where to rent each GPU.

GB200

VRAM
192 GB
TFLOPS
1125
Cheapest
See all GB200 offers

L40S

VRAM
48 GB
TFLOPS
366
Cheapest
See all L40S offers

GB200 vs L40S at a glance

Spec profile

GB200L40S
GB200 vs L40S spec comparison radar chartVRAMFP16BandwidthValueProviders

Each axis is scaled to the stronger GPU — the outer edge is better.

Value for money

Live pricing for both GPUs is needed to compute value-for-money metrics. See the provider tables below for current rates.

Specs are manufacturer peak FP16 Tensor figures; real-world throughput varies by workload, precision, and software stack.

GB200 vs L40S specs & pricing

SpecificationGB200L40S
Cheapest price
VRAM
192 GB
48 GB
FP16 TFLOPS
1125
366
Memory bandwidth
8000 GB/s
864 GB/s
Release year
2024
2023
Cloud providers

Which should you choose?

  • GB200 has more VRAM (192 GB), so it fits larger models and bigger batch sizes without sharding.
  • GB200 offers higher FP16 throughput, training and serving faster on compute-bound workloads.

Provider availability

Only GB200
None
Both GPUs
None
Only L40S
None

Frequently asked questions

How much VRAM do the GB200 and L40S have?

The GB200 has 192 GB of VRAM and the L40S has 48 GB. More VRAM lets you serve larger models and longer context windows on a single GPU.

Which is faster, the GB200 or L40S?

On paper the GB200 is faster, with 1125 FP16 TFLOPS versus 366. Real-world speed also depends on memory bandwidth and software stack.

Live prices come from public provider catalogs and can change quickly. How we collect prices.

Related GPU comparisons