Mistral Large 2 inference pricing

Developer: Mistral AI
Quality rank: #28
Elo: 1295
Context: 128K
Weights: Open
Lowest output: $6.00

Lowest output

$6.00

Median

$6.00

Highest

$6.00

4 results

Provider	Plan	Output $/1M	Input $/1M	Context	Price	Regions	Visit
Amazon Web Services	Bedrock Mistral Large 2	$6.00	$2.00	128K	$6.00 /M tokens Input $2.00/1M tokens Blended $3.00 Verified Jun 20, 2026	Global	Visit →
Azure AI Foundry	Azure AI Mistral Large 2	$6.00	$2.00	128K	$6.00 /M tokens Input $2.00/1M tokens Blended $3.00 Verified Jun 20, 2026	Global	Visit →
Mistral AI	Mistral Large 2	$6.00	$2.00	128K	$6.00 /M tokens Input $2.00/1M tokens Blended $3.00 Verified Jun 20, 2026	Global	Visit →
OpenRouter	Mistral Large 2 (routed)	$6.00	$2.00	128K	$6.00 /M tokens Input $2.00/1M tokens Blended $3.00 Verified Jun 20, 2026	Global	Visit →

Providers serving this model

Amazon Web Services

Amazon Web Services is the world's largest cloud provider with 200+ services across compute, storage, databases, ML, and networking. Dominates in enterprise with the broadest global region footprint and the deepest service catalog, but pricing complexity and egress fees add up at scale.

Azure AI Foundry

Azure AI Foundry (formerly Azure AI Studio) is Microsoft's enterprise model catalog, hosting GPT-5, GPT-4.1, Mistral Large, and other partner models on Azure's managed infrastructure. With FedRAMP, HIPAA, and SOC 2 compliance, private networking, and seamless Azure RBAC integration, it is the natural choice for regulated enterprises already on Microsoft's cloud.

Mistral AI

Mistral AI is a European frontier-lab providing first-party API access to its own Mistral Large 2 and Mistral Small 3 models - both available as open-weight releases. Known for strong multilingual performance, efficient architectures, and EU-based infrastructure with ISO 27001 compliance, it is the leading European alternative to US-based model providers.

OpenRouter

OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.

Frequently asked questions

How much does Mistral Large 2 cost per million tokens?

The lowest input price we track for Mistral Large 2 is $2.00 per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.

What is the cheapest Mistral Large 2 API provider?

Sort the table by output or blended price to find the cheapest Mistral Large 2 endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.

Which providers serve the Mistral Large 2 API?

Every provider with a published Mistral Large 2 endpoint appears above, with input and output token pricing, context window, and throughput.

What is Mistral Large 2's context window?

Mistral Large 2 supports a 128K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.

Is Mistral Large 2 open weight or closed source?

Mistral Large 2 is an open-weight model, so you can self-host it on any GPU provider, which usually beats managed API pricing at scale.

What is blended LLM cost?

Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.