RunPod operates a dual-tier marketplace: community GPUs at ultra-low spot prices and a SOC 2-compliant Secure Cloud for production inference. Per-second billing, instant provisioning, and a broad catalog spanning H100, A100, RTX, and even MI300X accelerators make it flexible for projects of any scale.
NVIDIA RTX 4090 cloud pricing
- Vendor
- NVIDIA
- VRAM
- 24 GB
- Architecture
- Ada Lovelace
- FP16
- 330 TFLOPS
- Launched
- 2022
8 results
| Provider | Plan | Price | Regions | Visit | ||||
|---|---|---|---|---|---|---|---|---|
|
|
RTX 4090 (marketplace) On-demand | 1 | 24 GB | 8 | 48 GB |
$0.450
/GPU-hr
Verified
|
3 countries | Visit → |
|
|
RTX 4090 (marketplace) Spot | 1 | 24 GB | 8 | 48 GB |
$0.270
/GPU-hr
Verified
|
3 countries | Visit → |
|
|
RTX 4090 (marketplace) Reserved | 1 | 24 GB | 8 | 48 GB |
$0.292
/GPU-hr
Verified
|
3 countries | Visit → |
|
|
RTX 4090 (per GPU) On-demand | 1 | 24 GB | 8 | 60 GB |
$0.590
/GPU-hr
Verified
|
1 country | Visit → |
|
|
RTX 4090 (per GPU) Reserved | 1 | 24 GB | 8 | 60 GB |
$0.384
/GPU-hr
Verified
|
1 country | Visit → |
|
|
RTX 4090 (Community) On-demand | 1 | 24 GB | 8 | 62 GB |
$0.690
/GPU-hr
Verified
|
3 countries | Visit → |
|
|
RTX 4090 (Community) Spot | 1 | 24 GB | 8 | 62 GB |
$0.414
/GPU-hr
Verified
|
3 countries | Visit → |
|
|
RTX 4090 (Community) Reserved | 1 | 24 GB | 8 | 62 GB |
$0.449
/GPU-hr
Verified
|
3 countries | Visit → |
Providers offering this GPU
Hyperstack is a next-generation GPU cloud platform offering H100, A100, B200, L40S, and RTX-class accelerators at aggressive on-demand and reserved rates. With data centers in London and Oslo, Terraform support, and fast API-driven provisioning, it targets teams that want hyperscaler-grade GPU availability without the lock-in.
Vast.ai is a decentralized GPU marketplace where host operators list idle accelerators at market-driven prices - often the lowest spot rates anywhere. The trade-off is variable reliability, no compliance certs, and community-grade support; ideal for cost-sensitive experimentation, fine-tuning, and batch inference where occasional interruption is acceptable.