OpenAI's API is the gold standard for frontier LLM access, serving the flagship GPT-5 family alongside the cost-efficient GPT-4.1 and GPT-4.1 nano tiers. With 400K+ context windows, function calling, structured outputs, and a massive developer ecosystem, it powers the majority of production AI applications worldwide.
GPT-4.1 nano inference pricing
- Developer
- OpenAI
- Quality rank
- #40
- Elo
- 1255
- Context
- 1000K
- Weights
- Closed
- Lowest output
- $0.400
3 results
| Provider | Plan | Price | Regions | Visit | |||
|---|---|---|---|---|---|---|---|
|
|
Azure OpenAI GPT-4.1 nano | $0.400 | $0.100 | 1000K |
$0.400
/M tokens
Input $0.100/1M tokens
Blended $0.175
Verified
|
Global | Visit → |
|
|
GPT-4.1 nano | $0.400 | $0.100 | 1000K |
$0.400
/M tokens
Input $0.100/1M tokens
Blended $0.175
Verified
|
Global | Visit → |
|
|
GPT-4.1 nano (routed) | $0.400 | $0.100 | 1000K |
$0.400
/M tokens
Input $0.100/1M tokens
Blended $0.175
Verified
|
Global | Visit → |
Providers serving this model
Azure AI Foundry (formerly Azure AI Studio) is Microsoft's enterprise model catalog, hosting GPT-5, GPT-4.1, Mistral Large, and other partner models on Azure's managed infrastructure. With FedRAMP, HIPAA, and SOC 2 compliance, private networking, and seamless Azure RBAC integration, it is the natural choice for regulated enterprises already on Microsoft's cloud.
OpenRouter acts as a unified gateway that routes API requests across dozens of inference providers - OpenAI, Anthropic, Google, Together, Groq, and more - through a single API key. It automatically selects the best available provider for each model, with transparent pricing and the ability to fallback if one endpoint goes down.