Mistral Small 3 API pricing comparison | DeployCue Skip to content
DeployCue

Mistral Small 3 inference pricing

Developer
Mistral AI
Quality rank
#48
Elo
1240
Context
128K
Weights
Open
Lowest output
$0.600
Lowest output
$0.600
Median
$0.600
Highest
$0.600

3 results

Provider Plan Price Regions Visit
Fireworks AI Mistral Small 3 $0.600 $0.200 128K $0.600 /M tokens
Input $0.200/1M tokens
Blended $0.300
Verified
Global Visit →
Mistral AI Mistral Small 3 $0.600 $0.200 128K $0.600 /M tokens
Input $0.200/1M tokens
Blended $0.300
Verified
Global Visit →
OpenRouter Mistral Small 3 (routed) $0.600 $0.200 128K $0.600 /M tokens
Input $0.200/1M tokens
Blended $0.300
Verified
Global Visit →

Providers serving this model

Fireworks AI specializes in high-throughput open-model inference powered by its custom FireAttention kernel, delivering token generation speeds that routinely beat other hosting platforms. With HIPAA compliance and a broad catalog spanning Llama, DeepSeek, Qwen, and Mistral models, it is built for latency-sensitive production applications at scale.

Mistral AI is a European frontier-lab providing first-party API access to its own Mistral Large 2 and Mistral Small 3 models - both available as open-weight releases. Known for strong multilingual performance, efficient architectures, and EU-based infrastructure with ISO 27001 compliance, it is the leading European alternative to US-based model providers.

OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.

Frequently asked questions

How much does Mistral Small 3 cost per million tokens?
The lowest input price we track for Mistral Small 3 is $0.200 per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.
What is the cheapest Mistral Small 3 API provider?
Sort the table by output or blended price to find the cheapest Mistral Small 3 endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.
Which providers serve the Mistral Small 3 API?
Every provider with a published Mistral Small 3 endpoint appears above, with input and output token pricing, context window, and throughput.
What is Mistral Small 3's context window?
Mistral Small 3 supports a 128K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.
Is Mistral Small 3 open weight or closed source?
Mistral Small 3 is an open-weight model, so you can self-host it on any GPU provider, which usually beats managed API pricing at scale.
What is blended LLM cost?
Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.