Skip to content
Qwen (Alibaba)

Qwen3.5 122B (a10b)

flagship
Qwen (Alibaba) · released 2025-08-01 · text
currently routing · 4.2k rpm
131.1M tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
open
License
/ ABOUT

Qwen3.5 122B A10B Instruct is part of the Qwen3.5 model family from Alibaba, featuring a Mixture-of-Experts architecture with 122 billion total parameters and 10 billion active per token. This highly efficient design provides strong quality with minimal inference cost.

The model represents an efficiency-focused approach in the Qwen lineup, offering competitive performance on reasoning, coding, and multilingual tasks while activating only a small fraction of its parameters. It's designed for high-throughput production deployments where cost efficiency is important.

Qwen3.5 122B A10B is an excellent choice for applications requiring good quality at very low inference cost, offering one of the best quality-per-dollar ratios in the Qwen model family.

BENCHMARKS Artificial Analysis Index
Intelligence 42

Providers for Qwen3.5 122B (a10b)

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%