Lambda Labs is purpose-built for ML teams - simple, transparent per-hour rates on H100, H200, B200, and GB200 instances with zero hidden fees. Known for responsive support and direct hardware access, it is a top choice for training runs that need predictable pricing without cloud-platform complexity.
NVIDIA B200 cloud pricing
- Vendor
- NVIDIA
- VRAM
- 192 GB
- Architecture
- Blackwell
- FP16
- 2250 TFLOPS
- Launched
- 2025
8 results
| Provider | Plan | Price | Regions | Visit | ||||
|---|---|---|---|---|---|---|---|---|
|
|
B200 (per GPU) On-demand | 1 | 180 GB | 28 | 283 GB |
$3.50
/GPU-hr
Verified
|
1 country | Visit → |
|
|
B200 (per GPU) Reserved | 1 | 180 GB | 28 | 283 GB |
$2.27
/GPU-hr
Verified
|
1 country | Visit → |
|
|
B200 cluster (per GPU) On-demand | 1 | 180 GB | 28 | 283 GB |
$3.99
/GPU-hr
Verified
|
1 country | Visit → |
|
|
B200 cluster (per GPU) Reserved | 1 | 180 GB | 28 | 283 GB |
$2.59
/GPU-hr
Verified
|
1 country | Visit → |
|
|
HGX B200 (per GPU) On-demand | 1 | 180 GB | 28 | 384 GB |
$4.25
/GPU-hr
Verified
|
1 country | Visit → |
|
|
HGX B200 (per GPU) Reserved | 1 | 180 GB | 28 | 384 GB |
$2.76
/GPU-hr
Verified
|
1 country | Visit → |
|
|
B200 1x On-demand | 1 | 180 GB | 28 | 283 GB |
$4.99
/GPU-hr
Verified
|
1 country | Visit → |
|
|
B200 1x Reserved | 1 | 180 GB | 28 | 283 GB |
$3.24
/GPU-hr
Verified
|
1 country | Visit → |
Providers offering this GPU
CoreWeave is a specialized GPU cloud operator with massive fleets of HGX H100, H200, GB200 NVL72, and B200 systems interconnected with high-speed InfiniBand networking. Purpose-built for large-scale AI training and inference at enterprise-grade reliability, it has become a preferred alternative to hyperscalers for GPU-intensive workloads.
Together AI provides blazing-fast hosted inference for open-weight models including Llama 3.1 (8B through 405B), DeepSeek V3, Qwen 2.5, and Mistral - all at prices far below closed-model APIs. Its optimized serving infrastructure and free tier for experimentation make it the go-to platform for teams that prefer open models without self-hosting overhead.
Hyperstack is a next-generation GPU cloud platform offering H100, A100, B200, L40S, and RTX-class accelerators at aggressive on-demand and reserved rates. With data centers in London and Oslo, Terraform support, and fast API-driven provisioning, it targets teams that want hyperscaler-grade GPU availability without the lock-in.