GH200 vs A40

Comparing the GH200 and A40 for cloud rental? The GH200 is available from 0 providers starting at , while the A40 starts at across 0 providers. Below is a live breakdown of pricing, VRAM, compute throughput, and where to rent each GPU.

GH200

VRAM
96 GB
TFLOPS
495
Cheapest
See all GH200 offers

A40

VRAM
48 GB
TFLOPS
150
Cheapest
See all A40 offers

GH200 vs A40 at a glance

Spec profile

GH200A40
GH200 vs A40 spec comparison radar chartVRAMFP16BandwidthValueProviders

Each axis is scaled to the stronger GPU — the outer edge is better.

Value for money

Live pricing for both GPUs is needed to compute value-for-money metrics. See the provider tables below for current rates.

Specs are manufacturer peak FP16 Tensor figures; real-world throughput varies by workload, precision, and software stack.

GH200 vs A40 specs & pricing

SpecificationGH200A40
Cheapest price
VRAM
96 GB
48 GB
FP16 TFLOPS
495
150
Memory bandwidth
4000 GB/s
696 GB/s
Release year
2023
2021
Cloud providers

Which should you choose?

  • GH200 has more VRAM (96 GB), so it fits larger models and bigger batch sizes without sharding.
  • GH200 offers higher FP16 throughput, training and serving faster on compute-bound workloads.

Provider availability

Only GH200
None
Both GPUs
None
Only A40
None

Frequently asked questions

How much VRAM do the GH200 and A40 have?

The GH200 has 96 GB of VRAM and the A40 has 48 GB. More VRAM lets you serve larger models and longer context windows on a single GPU.

Which is faster, the GH200 or A40?

On paper the GH200 is faster, with 495 FP16 TFLOPS versus 150. Real-world speed also depends on memory bandwidth and software stack.

Live prices come from public provider catalogs and can change quickly. How we collect prices.

Related GPU comparisons