NVIDIA GB200 cloud pricing

Vendor: NVIDIA
VRAM: 384 GB
Architecture: Blackwell
FP16: 5000 TFLOPS
Launched: 2025

Lowest

$3.58

Median

$4.70

Highest

$5.99

6 results

Provider	Plan	GPUs	VRAM	vCPUs	RAM	Price	Regions	Visit
CoreWeave	GB200 NVL72 (per GPU) On-demand	1	192 GB	28	512 GB	$5.50 /GPU-hr Verified Jun 20, 2026	1 country	Visit →
CoreWeave	GB200 NVL72 (per GPU) Reserved	1	192 GB	28	512 GB	$3.58 /GPU-hr Verified Jun 20, 2026	1 country	Visit →
Lambda	GB200 1x On-demand	1	192 GB	30	480 GB	$5.99 /GPU-hr Verified Jun 20, 2026	1 country	Visit →
Together AI	GB200 NVL72 cluster (per GPU) On-demand	1	192 GB	30	480 GB	$5.99 /GPU-hr Verified Jun 20, 2026	1 country	Visit →
Lambda	GB200 1x Reserved	1	192 GB	30	480 GB	$3.89 /GPU-hr Verified Jun 20, 2026	1 country	Visit →
Together AI	GB200 NVL72 cluster (per GPU) Reserved	1	192 GB	30	480 GB	$3.89 /GPU-hr Verified Jun 20, 2026	1 country	Visit →

Providers offering this GPU

Lambda

Lambda Labs is purpose-built for ML teams - simple, transparent per-hour rates on H100, H200, B200, and GB200 instances with zero hidden fees. Known for responsive support and direct hardware access, it is a top choice for training runs that need predictable pricing without cloud-platform complexity.

CoreWeave

CoreWeave is a specialized GPU cloud operator with massive fleets of HGX H100, H200, GB200 NVL72, and B200 systems interconnected with high-speed InfiniBand networking. Purpose-built for large-scale AI training and inference at enterprise-grade reliability, it has become a preferred alternative to hyperscalers for GPU-intensive workloads.

Together AI

Together AI provides blazing-fast hosted inference for open-weight models including Llama 3.1 (8B through 405B), DeepSeek V3, Qwen 2.5, and Mistral - all at prices far below closed-model APIs. Its optimized serving infrastructure and free tier for experimentation make it the go-to platform for teams that prefer open models without self-hosting overhead.

Frequently asked questions

How much does an NVIDIA GB200 cost per hour in the cloud?

The lowest on-demand NVIDIA GB200 price we track is $3.58 per GPU-hour. Spot and reserved rates are usually lower; sort the table above by price to see the current rate from every provider.

What is the cheapest NVIDIA GB200 cloud provider?

Sort the table by price (low to high) to see the cheapest NVIDIA GB200 provider right now. Marketplace and spot providers often undercut hyperscalers by a wide margin for the same NVIDIA GB200.

Which cloud providers offer NVIDIA GB200 GPUs?

Every provider with published NVIDIA GB200 availability is listed above, with per-hour pricing, the number of GPUs per instance, region coverage, and on-demand, spot, and reserved rates.

Is spot NVIDIA GB200 cheaper than on-demand?

Yes. Spot (preemptible) capacity is typically 40-70% cheaper than on-demand but can be reclaimed at short notice. Use the pricing-mode filter to compare on-demand, spot, and reserved rows side by side.

How much VRAM does the NVIDIA GB200 have?

The NVIDIA GB200 ships with 384 GB of VRAM. Larger VRAM lets you fit bigger models and batch sizes without sharding.

Is the NVIDIA GB200 good for AI training and inference?

The NVIDIA GB200 is used for both LLM training and inference. Match its VRAM and throughput (shown above) to your model size, and use spot capacity for fault-tolerant training to cut costs.