Qwen3.5 122B (a10b)

flagship

Qwen (Alibaba) · released 2026-02-24 · text

currently routing · 4.2k rpm

131.1M tokens

Context

— / 1M

Input

— / 1M

Output

— t/s

Speed

open

License

/ ABOUT

Qwen3.5 122B A10B Instruct is part of the Qwen3.5 model family from Alibaba, featuring a Mixture-of-Experts architecture with 122 billion total parameters and 10 billion active per token. This highly efficient design provides strong quality with minimal inference cost.

The model represents an efficiency-focused approach in the Qwen lineup, offering competitive performance on reasoning, coding, and multilingual tasks while activating only a small fraction of its parameters. It's designed for high-throughput production deployments where cost efficiency is important.

Qwen3.5 122B A10B is an excellent choice for applications requiring good quality at very low inference cost, offering one of the best quality-per-dollar ratios in the Qwen model family.

BENCHMARKS Artificial Analysis Index

Intelligence 42

Providers for Qwen3.5 122B (a10b)

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider

Context

Quant

Uptime · 30d

NVIDIA NIM

—

bf16

0.00%