gpt-oss-20b inference pricing

Developer: OpenAI
Quality rank: #44
Elo: -
Context: 131K
Weights: Open
Lowest output: -

0 results

No offerings match your filters.

Frequently asked questions

How much does gpt-oss-20b cost per million tokens?

The lowest input price we track for gpt-oss-20b is - per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.

What is the cheapest gpt-oss-20b API provider?

Sort the table by output or blended price to find the cheapest gpt-oss-20b endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.

Which providers serve the gpt-oss-20b API?

Every provider with a published gpt-oss-20b endpoint appears above, with input and output token pricing, context window, and throughput.

What is gpt-oss-20b's context window?

gpt-oss-20b supports a 131K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.

Is gpt-oss-20b open weight or closed source?

gpt-oss-20b is an open-weight model, so you can self-host it on any GPU provider, which usually beats managed API pricing at scale.

What is blended LLM cost?

Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.