Google Cloud Platform combines world-class data analytics, AI infrastructure (TPUs, Vertex AI), and the original managed Kubernetes. Its global fiber backbone and Preemptible VMs offer compelling price-performance for data-heavy and containerized workloads.
Gemini 2.5 Pro inference pricing
- Developer
- Quality rank
- #4
- Elo
- 1390
- Context
- 2000K
- Weights
- Closed
- Lowest output
- $10.00
Lowest output
$10.00
Median
$10.00
Highest
$10.00
3 results
| Provider | Plan | Price | Regions | Visit | |||
|---|---|---|---|---|---|---|---|
|
|
Vertex Gemini 2.5 Pro | $10.00 | $1.25 | 2000K |
$10.00
/M tokens
Input $1.25/1M tokens
Blended $3.44
Verified
|
Global | Visit → |
|
|
Gemini 2.5 Pro | $10.00 | $1.25 | 2000K |
$10.00
/M tokens
Input $1.25/1M tokens
Blended $3.44
Verified
|
Global | Visit → |
|
|
Gemini 2.5 Pro (routed) | $10.00 | $1.25 | 2000K |
$10.00
/M tokens
Input $1.25/1M tokens
Blended $3.44
Verified
|
Global | Visit → |
Providers serving this model
OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.
Frequently asked questions
How much does Gemini 2.5 Pro cost per million tokens?
The lowest input price we track for Gemini 2.5 Pro is $1.25 per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.
What is the cheapest Gemini 2.5 Pro API provider?
Sort the table by output or blended price to find the cheapest Gemini 2.5 Pro endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.
Which providers serve the Gemini 2.5 Pro API?
Every provider with a published Gemini 2.5 Pro endpoint appears above, with input and output token pricing, context window, and throughput.
What is Gemini 2.5 Pro's context window?
Gemini 2.5 Pro supports a 2000K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.
Is Gemini 2.5 Pro open weight or closed source?
Gemini 2.5 Pro is a closed (proprietary) model available only through hosted APIs. Compare provider pricing above to find the cheapest endpoint.
What is blended LLM cost?
Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.