A40 vs RTX 5090
Comparing the A40 and RTX 5090 for cloud rental? The A40 is available from 0 providers starting at —, while the RTX 5090 starts at — across 0 providers. Below is a live breakdown of pricing, VRAM, compute throughput, and where to rent each GPU.
A40 vs RTX 5090 at a glance
Spec profile
Each axis is scaled to the stronger GPU — the outer edge is better.
Value for money
Live pricing for both GPUs is needed to compute value-for-money metrics. See the provider tables below for current rates.
Specs are manufacturer peak FP16 Tensor figures; real-world throughput varies by workload, precision, and software stack.
A40 vs RTX 5090 specs & pricing
| Specification | A40 | RTX 5090 |
|---|---|---|
| Cheapest price | — | — |
| VRAM | 48 GB | 32 GB |
| FP16 TFLOPS | 150 | 419 |
| Memory bandwidth | 696 GB/s | 1792 GB/s |
| Release year | 2021 | 2025 |
| Cloud providers | — | — |
Which should you choose?
- •A40 has more VRAM (48 GB), so it fits larger models and bigger batch sizes without sharding.
- •RTX 5090 offers higher FP16 throughput, training and serving faster on compute-bound workloads.
Provider availability
Frequently asked questions
How much VRAM do the A40 and RTX 5090 have?
The A40 has 48 GB of VRAM and the RTX 5090 has 32 GB. More VRAM lets you serve larger models and longer context windows on a single GPU.
Which is faster, the A40 or RTX 5090?
On paper the RTX 5090 is faster, with 419 FP16 TFLOPS versus 150. Real-world speed also depends on memory bandwidth and software stack.
Live prices come from public provider catalogs and can change quickly. How we collect prices.
Related GPU comparisons
Explore related GPU pages
Crawlable paths from this comparison into prices, model fit, and neighboring comparisons.