NVIDIA GB200 cloud price comparison | DeployCue Skip to content
DeployCue

NVIDIA GB200 cloud pricing

Vendor
NVIDIA
VRAM
384 GB
Architecture
Blackwell
FP16
5000 TFLOPS
Launched
2025
Lowest
$3.58
Median
$4.70
Highest
$5.99

6 results

Provider Plan Price Regions Visit
CoreWeave GB200 NVL72 (per GPU) On-demand 1 192 GB 28 512 GB $5.50 /GPU-hr
Verified
1 country Visit →
CoreWeave GB200 NVL72 (per GPU) Reserved 1 192 GB 28 512 GB $3.58 /GPU-hr
Verified
1 country Visit →
Lambda GB200 1x On-demand 1 192 GB 30 480 GB $5.99 /GPU-hr
Verified
1 country Visit →
Together AI GB200 NVL72 cluster (per GPU) On-demand 1 192 GB 30 480 GB $5.99 /GPU-hr
Verified
1 country Visit →
Lambda GB200 1x Reserved 1 192 GB 30 480 GB $3.89 /GPU-hr
Verified
1 country Visit →
Together AI GB200 NVL72 cluster (per GPU) Reserved 1 192 GB 30 480 GB $3.89 /GPU-hr
Verified
1 country Visit →

Providers offering this GPU

Lambda logo 4

Lambda Labs is purpose-built for ML teams - simple, transparent per-hour rates on H100, H200, B200, and GB200 instances with zero hidden fees. Known for responsive support and direct hardware access, it is a top choice for training runs that need predictable pricing without cloud-platform complexity.

CoreWeave logo 4

CoreWeave is a specialized GPU cloud operator with massive fleets of HGX H100, H200, GB200 NVL72, and B200 systems interconnected with high-speed InfiniBand networking. Purpose-built for large-scale AI training and inference at enterprise-grade reliability, it has become a preferred alternative to hyperscalers for GPU-intensive workloads.

Together AI provides blazing-fast hosted inference for open-weight models including Llama 3.1 (8B through 405B), DeepSeek V3, Qwen 2.5, and Mistral - all at prices far below closed-model APIs. Its optimized serving infrastructure and free tier for experimentation make it the go-to platform for teams that prefer open models without self-hosting overhead.

Frequently asked questions

How much does an NVIDIA GB200 cost per hour in the cloud?
The lowest on-demand NVIDIA GB200 price we track is $3.58 per GPU-hour. Spot and reserved rates are usually lower; sort the table above by price to see the current rate from every provider.
What is the cheapest NVIDIA GB200 cloud provider?
Sort the table by price (low to high) to see the cheapest NVIDIA GB200 provider right now. Marketplace and spot providers often undercut hyperscalers by a wide margin for the same NVIDIA GB200.
Which cloud providers offer NVIDIA GB200 GPUs?
Every provider with published NVIDIA GB200 availability is listed above, with per-hour pricing, the number of GPUs per instance, region coverage, and on-demand, spot, and reserved rates.
Is spot NVIDIA GB200 cheaper than on-demand?
Yes. Spot (preemptible) capacity is typically 40-70% cheaper than on-demand but can be reclaimed at short notice. Use the pricing-mode filter to compare on-demand, spot, and reserved rows side by side.
How much VRAM does the NVIDIA GB200 have?
The NVIDIA GB200 ships with 384 GB of VRAM. Larger VRAM lets you fit bigger models and batch sizes without sharding.
Is the NVIDIA GB200 good for AI training and inference?
The NVIDIA GB200 is used for both LLM training and inference. Match its VRAM and throughput (shown above) to your model size, and use spot capacity for fault-tolerant training to cut costs.