Lambda Labs is purpose-built for ML teams - simple, transparent per-hour rates on H100, H200, B200, and GB200 instances with zero hidden fees. Known for responsive support and direct hardware access, it is a top choice for training runs that need predictable pricing without cloud-platform complexity.
NVIDIA GB200 cloud pricing
- Vendor
- NVIDIA
- VRAM
- 384 GB
- Architecture
- Blackwell
- FP16
- 5000 TFLOPS
- Launched
- 2025
6 results
| Provider | Plan | Price | Regions | Visit | ||||
|---|---|---|---|---|---|---|---|---|
|
|
GB200 NVL72 (per GPU) On-demand | 1 | 192 GB | 28 | 512 GB |
$5.50
/GPU-hr
Verified
|
1 country | Visit → |
|
|
GB200 NVL72 (per GPU) Reserved | 1 | 192 GB | 28 | 512 GB |
$3.58
/GPU-hr
Verified
|
1 country | Visit → |
|
|
GB200 1x On-demand | 1 | 192 GB | 30 | 480 GB |
$5.99
/GPU-hr
Verified
|
1 country | Visit → |
|
|
GB200 NVL72 cluster (per GPU) On-demand | 1 | 192 GB | 30 | 480 GB |
$5.99
/GPU-hr
Verified
|
1 country | Visit → |
|
|
GB200 1x Reserved | 1 | 192 GB | 30 | 480 GB |
$3.89
/GPU-hr
Verified
|
1 country | Visit → |
|
|
GB200 NVL72 cluster (per GPU) Reserved | 1 | 192 GB | 30 | 480 GB |
$3.89
/GPU-hr
Verified
|
1 country | Visit → |
Providers offering this GPU
CoreWeave is a specialized GPU cloud operator with massive fleets of HGX H100, H200, GB200 NVL72, and B200 systems interconnected with high-speed InfiniBand networking. Purpose-built for large-scale AI training and inference at enterprise-grade reliability, it has become a preferred alternative to hyperscalers for GPU-intensive workloads.
Together AI provides blazing-fast hosted inference for open-weight models including Llama 3.1 (8B through 405B), DeepSeek V3, Qwen 2.5, and Mistral - all at prices far below closed-model APIs. Its optimized serving infrastructure and free tier for experimentation make it the go-to platform for teams that prefer open models without self-hosting overhead.