GPT-4.1 nano inference pricing

Developer: OpenAI
Quality rank: #40
Elo: 1255
Context: 1000K
Weights: Closed
Lowest output: $0.400

Lowest output

$0.400

Median

$0.400

Highest

$0.400

3 results

Provider	Plan	Output $/1M	Input $/1M	Context	Price	Regions	Visit
Azure AI Foundry	Azure OpenAI GPT-4.1 nano	$0.400	$0.100	1000K	$0.400 /M tokens Input $0.100/1M tokens Blended $0.175 Verified Jun 20, 2026	Global	Visit →
OpenAI	GPT-4.1 nano	$0.400	$0.100	1000K	$0.400 /M tokens Input $0.100/1M tokens Blended $0.175 Verified Jun 20, 2026	Global	Visit →
OpenRouter	GPT-4.1 nano (routed)	$0.400	$0.100	1000K	$0.400 /M tokens Input $0.100/1M tokens Blended $0.175 Verified Jun 20, 2026	Global	Visit →

Providers serving this model

OpenAI

OpenAI's API is the gold standard for frontier LLM access, serving the flagship GPT-5 family alongside the cost-efficient GPT-4.1 and GPT-4.1 nano tiers. With 400K+ context windows, function calling, structured outputs, and a massive developer ecosystem, it powers the majority of production AI applications worldwide.

Azure AI Foundry

Azure AI Foundry (formerly Azure AI Studio) is Microsoft's enterprise model catalog, hosting GPT-5, GPT-4.1, Mistral Large, and other partner models on Azure's managed infrastructure. With FedRAMP, HIPAA, and SOC 2 compliance, private networking, and seamless Azure RBAC integration, it is the natural choice for regulated enterprises already on Microsoft's cloud.

OpenRouter

OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.

Frequently asked questions

How much does GPT-4.1 nano cost per million tokens?

The lowest input price we track for GPT-4.1 nano is $0.100 per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.

What is the cheapest GPT-4.1 nano API provider?

Sort the table by output or blended price to find the cheapest GPT-4.1 nano endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.

Which providers serve the GPT-4.1 nano API?

Every provider with a published GPT-4.1 nano endpoint appears above, with input and output token pricing, context window, and throughput.

What is GPT-4.1 nano's context window?

GPT-4.1 nano supports a 1000K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.

Is GPT-4.1 nano open weight or closed source?

GPT-4.1 nano is a closed (proprietary) model available only through hosted APIs. Compare provider pricing above to find the cheapest endpoint.

What is blended LLM cost?

Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.