NVIDIA Google TPU
TPU v5 Lite Pod (8 cores)
Eight-core v5 Lite pods provide substantial throughput for inference-heavy production deployments.
- Launch year
- 2024
- Memory
- 256 GB HBM
- Memory bandwidth
- >900 GB/s
- Peak FP16 / FP32
- 480 TFLOPS · 0 TFLOPS
Market snapshot
$4.08 /hr
Range $4.08 – $12.48
Catalog coverage
32 live offerings
Across 1 providers · 18 regions
- Scales via TPU pod fabric
- Low-latency inference
- Cloud-native management
Last refreshed Oct 20, 2025, 2:01 AM
Performance snapshot
Normalized versus NVIDIA A100 (=1.0). Values use public reference benchmarks for training and inference workloads.
- TPU Throughput×0.85
- Energy Efficiency×0.85
- Latency×0.60
Provider availability
Price bands per provider pulled from the live catalog.
Google Cloud32 offers
$4.08 – $12.48 /hr
Popular regions
- us-west4-b2 offers
- us-west4-a2 offers
- us-east5-a2 offers
- us-east5-c2 offers
- us-east5-b2 offers
- us-west1-c2 offers
- us-central1-a2 offers
- europe-west1-c2 offers
Weighted average price $8.30 /hr · median $9.60
Google Cloud
$4.08/hrGoogle Cloud
$4.08/hrGoogle Cloud
$4.80/hrGoogle Cloud
$4.80/hrGoogle Cloud
$4.80/hrGoogle Cloud
$5.04/hrGoogle Cloud
$5.04/hrGoogle Cloud
$5.29/hrGoogle Cloud
$5.29/hrGoogle Cloud
$5.30/hrGoogle Cloud
$5.30/hrGoogle Cloud
$5.66/hrGoogle Cloud
$5.66/hrGoogle Cloud
$6.24/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$9.60/hrGoogle Cloud
$10.57/hrGoogle Cloud
$10.57/hrGoogle Cloud
$11.12/hrGoogle Cloud
$11.12/hrGoogle Cloud
$11.12/hrGoogle Cloud
$11.33/hrGoogle Cloud
$11.33/hrGoogle Cloud
$12.33/hrGoogle Cloud
$12.48/hrGoogle Cloud
$12.48/hrGoogle Cloud
$12.48/hr