Find GPU for Your Model
Running Kimi K2.6? DeepSeek V4 Pro? Mercury 2? Or a custom model? Pick the parameter count and precision, and we show every GPU in our catalog that has enough VRAM — ranked by FP16 compute performance.
We calculate VRAM requirements for FP16, INT8, and FP4/GGUF quantization across batch sizes. Each result shows the maximum batch size that GPU can handle for your config.
Choose model size
Set precision and batch size
VRAM needed: 16 GB
27 GPUs can run this config
| GPU | VRAM | Max Batch |
|---|---|---|
| AMD Instinct MI300X | 192 GB | 12x |
| NVIDIA B200 | 192 GB | 12x |
| NVIDIA GB200 | 192 GB | 12x |
| NVIDIA B300 | 288 GB | 18x |
| NVIDIA H100 | 80 GB | 5x |
| NVIDIA H200 | 141 GB | 8x |
| NVIDIA GH200 | 96 GB | 6x |
| NVIDIA RTX 5090 | 32 GB | 2x |
| NVIDIA H200 NVL | 141 GB | 8x |
| AMD Instinct MI250X | 128 GB | 8x |
| NVIDIA L40S | 48 GB | 3x |
| NVIDIA RTX 6000 Ada | 48 GB | 3x |
| NVIDIA A100 | 80 GB | 5x |
| NVIDIA RTX 5080 | 16 GB | 1x |
| NVIDIA RTX 4090 | 24 GB | 1x |
| NVIDIA RTX A6000 | 48 GB | 3x |
| NVIDIA A40 | 48 GB | 3x |
| NVIDIA A10 | 24 GB | 1x |
| NVIDIA A10G | 24 GB | 1x |
| NVIDIA V100 | 32 GB | 2x |