AI Model
Dolphin 2.9.1 Yi 1.5 34B
34B parameters · text-generation
VRAM (FP16)
69 GB
VRAM (INT4)
17.5 GB
Family
dphn
Compatible GPUs
NVIDIA H100
Min GPUs: 1 · fp16
NVIDIA A100
Min GPUs: 1 · fp16
NVIDIA GH200
Min GPUs: 1 · fp16
AMD Instinct MI250X
Min GPUs: 1 · fp16
NVIDIA H200
Min GPUs: 1 · fp16
NVIDIA H200 NVL
Min GPUs: 1 · fp16
NVIDIA B200
Min GPUs: 1 · fp16
NVIDIA GB200
Min GPUs: 1 · fp16
AMD Instinct MI300X
Min GPUs: 1 · fp16
NVIDIA B300
Min GPUs: 1 · fp16
NVIDIA L40S
Min GPUs: 2 · fp16
NVIDIA A40
Min GPUs: 2 · fp16
NVIDIA RTX 6000 Ada
Min GPUs: 2 · fp16
NVIDIA RTX A6000
Min GPUs: 2 · fp16
NVIDIA A16
Min GPUs: 2 · fp16
NVIDIA L4
Min GPUs: 3 · fp16
NVIDIA A10
Min GPUs: 3 · fp16
NVIDIA A10G
Min GPUs: 3 · fp16
NVIDIA RTX 4090
Min GPUs: 3 · fp16
NVIDIA RTX A5000
Min GPUs: 3 · fp16
NVIDIA RTX 3090
Min GPUs: 3 · fp16
NVIDIA V100
Min GPUs: 3 · fp16
NVIDIA RTX 5090
Min GPUs: 3 · fp16
NVIDIA T4
Min GPUs: 5 · fp16
NVIDIA P100
Min GPUs: 5 · fp16
NVIDIA RTX 5080
Min GPUs: 5 · fp16
NVIDIA RTX A4000
Min GPUs: 5 · fp16
Supported Frameworks
vLLMPyTorch
Deploy Dolphin 2.9.1 Yi 1.5 34B
Get a full deployment stack recommendation — GPU, count, framework, quantization, and projected cost.
Start deploymentHugging Face
View the model card, tokenizer, and weights on the Hugging Face Hub.
Open on Hugging FaceVRAM Usage
FP16 serving needs about 69 GB before workload-specific headroom. INT4 quantization reduces the model weights to about 17.5 GB, which is the practical path for large models on smaller GPU clusters.
Related Dolphin 2.9.1 Yi 1.5 34B resources
Move from model requirements into compatible GPU prices, deployment, and the wider model catalog.
NVIDIA H100 prices for Dolphin 2.9.1 Yi 1.5 34BRecommended GPU path for this model at fp16 precision.NVIDIA A100 cloud pricesCompatible option for Dolphin 2.9.1 Yi 1.5 34B; minimum 1 GPU.NVIDIA GH200 cloud pricesCompatible option for Dolphin 2.9.1 Yi 1.5 34B; minimum 1 GPU.AMD Instinct MI250X cloud pricesCompatible option for Dolphin 2.9.1 Yi 1.5 34B; minimum 1 GPU.Deploy Dolphin 2.9.1 Yi 1.5 34BGenerate a deployment recommendation with GPU count, framework, and estimated cost.Model VRAM leaderboardCompare FP16 and INT4 memory requirements across other deployable models.