Llama 4 Maverick inference pricing
- Developer
- Meta
- Quality rank
- #30
- Elo
- -
- Context
- 1048K
- Weights
- Open
- Lowest output
- -
0 results
No offerings match your filters.
Frequently asked questions
How much does Llama 4 Maverick cost per million tokens?
The lowest input price we track for Llama 4 Maverick is - per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.
What is the cheapest Llama 4 Maverick API provider?
Sort the table by output or blended price to find the cheapest Llama 4 Maverick endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.
Which providers serve the Llama 4 Maverick API?
Every provider with a published Llama 4 Maverick endpoint appears above, with input and output token pricing, context window, and throughput.
What is Llama 4 Maverick's context window?
Llama 4 Maverick supports a 1048K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.
Is Llama 4 Maverick open weight or closed source?
Llama 4 Maverick is an open-weight model, so you can self-host it on any GPU provider, which usually beats managed API pricing at scale.
What is blended LLM cost?
Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.