H200 NVL vs L40S
Comparing the H200 NVL and L40S for cloud rental? The H200 NVL is available from 0 providers starting at —, while the L40S starts at — across 0 providers. Below is a live breakdown of pricing, VRAM, compute throughput, and where to rent each GPU.
H200 NVL vs L40S at a glance
Spec profile
Each axis is scaled to the stronger GPU — the outer edge is better.
Value for money
Live pricing for both GPUs is needed to compute value-for-money metrics. See the provider tables below for current rates.
Specs are manufacturer peak FP16 Tensor figures; real-world throughput varies by workload, precision, and software stack.
H200 NVL vs L40S specs & pricing
| Specification | H200 NVL | L40S |
|---|---|---|
| Cheapest price | — | — |
| VRAM | 141 GB | 48 GB |
| FP16 TFLOPS | 418 | 366 |
| Memory bandwidth | 4800 GB/s | 864 GB/s |
| Release year | 2024 | 2023 |
| Cloud providers | — | — |
Which should you choose?
- •H200 NVL has more VRAM (141 GB), so it fits larger models and bigger batch sizes without sharding.
- •H200 NVL offers higher FP16 throughput, training and serving faster on compute-bound workloads.
Provider availability
Frequently asked questions
How much VRAM do the H200 NVL and L40S have?
The H200 NVL has 141 GB of VRAM and the L40S has 48 GB. More VRAM lets you serve larger models and longer context windows on a single GPU.
Which is faster, the H200 NVL or L40S?
On paper the H200 NVL is faster, with 418 FP16 TFLOPS versus 366. Real-world speed also depends on memory bandwidth and software stack.
Live prices come from public provider catalogs and can change quickly. How we collect prices.
Related GPU comparisons
Explore related GPU pages
Crawlable paths from this comparison into prices, model fit, and neighboring comparisons.