Qwen 2.5 7B inference pricing

Developer: Alibaba
Quality rank: #62
Elo: -
Context: 131K
Weights: Open
Lowest output: -

0 results

No offerings match your filters.

Frequently asked questions

How much does Qwen 2.5 7B cost per million tokens?

The lowest input price we track for Qwen 2.5 7B is - per million tokens. Output tokens cost more; the table shows input, output, and blended pricing for every inference provider.

What is the cheapest Qwen 2.5 7B API provider?

Sort the table by output or blended price to find the cheapest Qwen 2.5 7B endpoint. Prices for the same model vary widely between providers, so the cheapest provider can be several times less than the most expensive.

Which providers serve the Qwen 2.5 7B API?

Every provider with a published Qwen 2.5 7B endpoint appears above, with input and output token pricing, context window, and throughput.

What is Qwen 2.5 7B's context window?

Qwen 2.5 7B supports a 131K context window. A larger context window lets you pass more tokens (documents, code, history) in a single request.

Is Qwen 2.5 7B open weight or closed source?

Qwen 2.5 7B is an open-weight model, so you can self-host it on any GPU provider, which usually beats managed API pricing at scale.

What is blended LLM cost?

Blended cost weights input and output token prices by a typical 3:1 ratio so you can rank providers by one number instead of comparing two prices separately.