priceVerified 2026-06-20Live prices updated just now
A16 GPU cloud prices
NVIDIA A16 is commonly evaluated for LLM inference. A quad-GPU Ampere card with 64 GB total, designed for high-density VDI and light inference.
Cheapest
$0.059/hr
Vultr
Median
$0.236/hr
52 offers
Providers
1
indexed providers
730-hour floor
$43
before extras/taxes
VRAM
64GB
FP16
36 TFLOPS
Bandwidth
928 GB/s
TDP
250W
Cheapest A16 providers
Provider floors from live indexed offers. Lower is better.
Common search intents
A16 priceA16 GPU rentalA16 cloud GPU