Qwen3 4B
flagshipcurrently routing · 4.2k rpm
32K tokens
Context
open
License
/ ABOUT
Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.
BENCHMARKS Artificial Analysis Index
Intelligence 16
Providers for Qwen3 4B
1 routes · sorted by uptimeClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.
Provider
Context
Quant
Uptime · 30d