Skip to content
Qwen (Alibaba)

Qwen3.5 4B

flagship
Qwen (Alibaba) · released 2025-08-01 · text
currently routing · 4.2k rpm
32.8M tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
open
License
/ ABOUT

Qwen3.5 4B is the compact variant of Alibaba's Qwen3.5 model family, designed for efficient deployment in resource-constrained environments. Despite its small 4-billion parameter size, it delivers capable performance on text generation, instruction following, and reasoning tasks.

The model is optimized for on-device inference, edge deployment, and high-throughput batch processing scenarios where larger models would be impractical. It supports multilingual text processing with good coverage of Chinese and English, along with a reasonable context window for its size class.

Qwen3.5 4B is ideal for mobile applications, embedded systems, and scenarios where a small, fast model is needed without sacrificing too much quality.

BENCHMARKS Artificial Analysis Index
Intelligence 39

Providers for Qwen3.5 4B

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%