Back to search

Find GPU for Your Model

Running Kimi K2.6? DeepSeek V4 Pro? Mercury 2? Or a custom model? Pick the parameter count and precision, and we show every GPU in our catalog that has enough VRAM — ranked by FP16 compute performance.

We calculate VRAM requirements for FP16, INT8, and FP4/GGUF quantization across batch sizes. Each result shows the maximum batch size that GPU can handle for your config.

Choose model size

Set precision and batch size

VRAM needed: 16 GB

27 GPUs can run this config

GPUVRAMFP16 TFLOPSArchitectureMax BatchTags
AMD Instinct MI300X192 GB1307CDNA 312x
datacentertraininginference
NVIDIA B200192 GB1125Blackwell12x
datacentertraininginference
NVIDIA GB200192 GB1125Blackwell12x
datacentertraininginference
NVIDIA B300288 GB1100Blackwell Ultra18x
datacentertraininginference
NVIDIA H10080 GB495Hopper5x
datacentertraininginference
NVIDIA H200141 GB495Hopper8x
datacentertraininginference
NVIDIA GH20096 GB495Hopper6x
datacentertraininginference
NVIDIA RTX 509032 GB419Blackwell2x
consumerinferencetraining
NVIDIA H200 NVL141 GB418Hopper8x
datacentertraininginference
AMD Instinct MI250X128 GB383CDNA 28x
datacentertraininginference
NVIDIA L40S48 GB366Ada Lovelace3x
datacenterinferencegraphics
NVIDIA RTX 6000 Ada48 GB365Ada Lovelace3x
workstationinferencetraining
NVIDIA A10080 GB312Ampere5x
datacentertraininginference
NVIDIA RTX 508016 GB225Blackwell1x
consumerinferencetraining
NVIDIA RTX 409024 GB165Ada Lovelace1x
consumerinferencetraining
NVIDIA RTX A600048 GB155Ampere3x
workstationinferencetraining
NVIDIA A4048 GB150Ampere3x
datacenterinferencegraphics
NVIDIA A1024 GB125Ampere1x
datacenterinference
NVIDIA A10G24 GB125Ampere1x
datacenterinference
NVIDIA V10032 GB125Volta2x
datacentertraininginference