NVIDIA V100 cloud price comparison | DeployCue Skip to content
DeployCue

NVIDIA V100 cloud pricing

Vendor
NVIDIA
VRAM
32 GB
Architecture
Volta
FP16
125 TFLOPS
Launched
2017
Lowest
$0.132
Median
$0.255
Highest
$3.06

10 results

Provider Plan Price Regions Visit
Vast.ai V100 (marketplace) On-demand 1 32 GB 8 64 GB $0.220 /GPU-hr
Verified
3 countries Visit →
Vast.ai V100 (marketplace) Spot 1 32 GB 8 64 GB $0.132 /GPU-hr
Verified
3 countries Visit →
Vast.ai V100 (marketplace) Reserved 1 32 GB 8 64 GB $0.143 /GPU-hr
Verified
3 countries Visit →
RunPod V100 (Community) On-demand 1 32 GB 8 62 GB $0.290 /GPU-hr
Verified
3 countries Visit →
RunPod V100 (Community) Spot 1 32 GB 8 62 GB $0.174 /GPU-hr
Verified
3 countries Visit →
RunPod V100 (Community) Reserved 1 32 GB 8 62 GB $0.189 /GPU-hr
Verified
3 countries Visit →
Google Cloud N1 + V100 On-demand 1 16 GB 8 30 GB $2.48 /GPU-hr
Verified
2 countries Visit →
Google Cloud N1 + V100 Reserved 1 16 GB 8 30 GB $1.61 /GPU-hr
Verified
2 countries Visit →
Amazon Web Services P3 (1x V100) On-demand 1 16 GB 8 61 GB $3.06 /GPU-hr
Verified
2 countries Visit →
Amazon Web Services P3 (1x V100) Reserved 1 16 GB 8 61 GB $1.99 /GPU-hr
Verified
2 countries Visit →

Providers offering this GPU

Amazon Web Services is the world's largest cloud provider with 200+ services across compute, storage, databases, ML, and networking. Dominates in enterprise with the broadest global region footprint and the deepest service catalog, but pricing complexity and egress fees add up at scale.

Google Cloud Platform combines world-class data analytics, AI infrastructure (TPUs, Vertex AI), and the original managed Kubernetes. Its global fiber backbone and Preemptible VMs offer compelling price-performance for data-heavy and containerized workloads.

RunPod logo 4

RunPod operates a dual-tier marketplace: community GPUs at ultra-low spot prices and a SOC 2-compliant Secure Cloud for production inference. Per-second billing, instant provisioning, and a broad catalog spanning H100, A100, RTX, and even MI300X accelerators make it flexible for projects of any scale.

Vast.ai logo 3

Vast.ai is a decentralized GPU marketplace where host operators list idle accelerators at market-driven prices - often the lowest spot rates anywhere. The trade-off is variable reliability, no compliance certs, and community-grade support; ideal for cost-sensitive experimentation, fine-tuning, and batch inference where occasional interruption is acceptable.

Frequently asked questions

How much does an NVIDIA V100 cost per hour in the cloud?
The lowest on-demand NVIDIA V100 price we track is $0.132 per GPU-hour. Spot and reserved rates are usually lower; sort the table above by price to see the current rate from every provider.
What is the cheapest NVIDIA V100 cloud provider?
Sort the table by price (low to high) to see the cheapest NVIDIA V100 provider right now. Marketplace and spot providers often undercut hyperscalers by a wide margin for the same NVIDIA V100.
Which cloud providers offer NVIDIA V100 GPUs?
Every provider with published NVIDIA V100 availability is listed above, with per-hour pricing, the number of GPUs per instance, region coverage, and on-demand, spot, and reserved rates.
Is spot NVIDIA V100 cheaper than on-demand?
Yes. Spot (preemptible) capacity is typically 40-70% cheaper than on-demand but can be reclaimed at short notice. Use the pricing-mode filter to compare on-demand, spot, and reserved rows side by side.
How much VRAM does the NVIDIA V100 have?
The NVIDIA V100 ships with 32 GB of VRAM. Larger VRAM lets you fit bigger models and batch sizes without sharding.
Is the NVIDIA V100 good for AI training and inference?
The NVIDIA V100 is used for both LLM training and inference. Match its VRAM and throughput (shown above) to your model size, and use spot capacity for fault-tolerant training to cut costs.