Claude Haiku 4.5 API pricing comparison | DeployCue Skip to content
DeployCue

Claude Haiku 4.5 inference pricing

Developer
Anthropic
Quality rank
#22
Elo
1320
Context
200K
Weights
Closed
Lowest output
$5.00
Lowest output
$5.00
Median
$5.00
Highest
$5.00

4 results

Provider Plan Price Regions Visit
Amazon Web Services Bedrock Claude Haiku 4.5 $5.00 $1.00 200K $5.00 /M tokens
Input $1.00/1M tokens
Blended $2.00
Verified
Global Visit →
Anthropic Claude Haiku 4.5 $5.00 $1.00 200K $5.00 /M tokens
Input $1.00/1M tokens
Blended $2.00
Verified
Global Visit →
Google Cloud Vertex Claude Haiku 4.5 $5.00 $1.00 200K $5.00 /M tokens
Input $1.00/1M tokens
Blended $2.00
Verified
Global Visit →
OpenRouter Claude Haiku 4.5 (routed) $5.00 $1.00 200K $5.00 /M tokens
Input $1.00/1M tokens
Blended $2.00
Verified
Global Visit →

Providers serving this model

Amazon Web Services is the world's largest cloud provider with 200+ services across compute, storage, databases, ML, and networking. Dominates in enterprise with the broadest global region footprint and the deepest service catalog, but pricing complexity and egress fees add up at scale.

Anthropic logo 2

Anthropic's API delivers the Claude model family - Opus, Sonnet, and Haiku - known for thoughtful reasoning, strong coding ability, and industry-leading safety alignment. With a 200K context window and a reputation for nuanced, instruction-following output, Claude is the preferred model for complex analytical and creative tasks.

Google Cloud Platform combines world-class data analytics, AI infrastructure (TPUs, Vertex AI), and the original managed Kubernetes. Its global fiber backbone and Preemptible VMs offer compelling price-performance for data-heavy and containerized workloads.

OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.

Frequently asked questions

How much does Claude Haiku 4.5 cost per million tokens?
The lowest input price we track for Claude Haiku 4.5 is $1.00 per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.
What is the cheapest Claude Haiku 4.5 API provider?
Sort the table by output or blended price to find the cheapest Claude Haiku 4.5 endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.
Which providers serve the Claude Haiku 4.5 API?
Every provider with a published Claude Haiku 4.5 endpoint appears above, with input and output token pricing, context window, and throughput.
What is Claude Haiku 4.5's context window?
Claude Haiku 4.5 supports a 200K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.
Is Claude Haiku 4.5 open weight or closed source?
Claude Haiku 4.5 is a closed (proprietary) model available only through hosted APIs. Compare provider pricing above to find the cheapest endpoint.
What is blended LLM cost?
Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.