NVIDIA Google TPU
TPU v6e (4 cores)
Four-core v6e pods scale efficiency-focused TPU cores for production inference tiers.
- Launch year
- 2025
- Memory
- 256 GB HBM3
- Memory bandwidth
- >1.0 TB/s
- Peak FP16 / FP32
- 480 TFLOPS · 0 TFLOPS
Market snapshot
$3.28 /hr
Range $3.28 – $12.96
Catalog coverage
14 live offerings
Across 1 providers · 8 regions
- Auto-scaling TPU pods
- Energy-aware scheduling
- Reduced latency
Last refreshed Oct 20, 2025, 2:01 AM
Performance snapshot
Normalized versus NVIDIA A100 (=1.0). Values use public reference benchmarks for training and inference workloads.
- Projected TPU Throughput×0.85
- Projected Efficiency×0.92
- Latency×0.55
Provider availability
Price bands per provider pulled from the live catalog.
Google Cloud14 offers
$3.28 – $12.96 /hr
Popular regions
- europe-west4-a2 offers
- us-east5-a2 offers
- us-east5-c2 offers
- us-east5-b2 offers
- us-east1-d2 offers
- asia-northeast1-b2 offers
- us-south1-c1 offers
- us-south1-a1 offers
Weighted average price $9.45 /hr · median $10.80
Google Cloud
$3.28/hrGoogle Cloud
$7.56/hrGoogle Cloud
$7.56/hrGoogle Cloud
$7.56/hrGoogle Cloud
$7.56/hrGoogle Cloud
$9.08/hrGoogle Cloud
$10.80/hrGoogle Cloud
$10.80/hrGoogle Cloud
$10.80/hrGoogle Cloud
$10.80/hrGoogle Cloud
$10.80/hrGoogle Cloud
$10.80/hrGoogle Cloud
$11.88/hrGoogle Cloud
$12.96/hr
v6e-4 | europe-west4-a | $3.28/hr | |
v6e-4 | us-east5-a | $7.56/hr | |
v6e-4 | us-east5-c | $7.56/hr | |
v6e-4 | us-east5-b | $7.56/hr | |
v6e-4 | us-east1-d | $7.56/hr | |
v6e-4 | asia-northeast1-b | $9.08/hr | |
v6e-4 | us-south1-c | $10.80/hr | |
v6e-4 | us-east5-a | $10.80/hr | |
v6e-4 | us-east5-c | $10.80/hr | |
v6e-4 | us-east5-b | $10.80/hr | |
v6e-4 | us-south1-a | $10.80/hr | |
v6e-4 | us-east1-d | $10.80/hr | |
v6e-4 | europe-west4-a | $11.88/hr | |
v6e-4 | asia-northeast1-b | $12.96/hr |