MODEL · ALIBABA · 35B TOTAL / 3B ACTIVE
Qwen 3.5 35B-A3B
The 24 GB-VRAM unlock — dense-27B quality at 3B-active speed, and the community workhorse for mixed coding / chat / docs where breadth matters. 256 experts with 8 routed + 1 shared per token.
License: Apache 2.0 · Context: 262K native, extendable to ~1M via YaRN · Released: February 16, 2026
The decision in five lines
- The call
- Buy — for coding
- Best for
- coding · chat · docs · agents
- Runs on
- 16 hardware picks fit (cheapest: Minisforum UM890 Pro · $463)
- Watch out
- AMD ROCm on older llama.cpp builds (MoE HIP kernels had stability issues through 2025; check release notes before pulling).
- Evidence
- Measured
- 35B total
- PARAMETERS
- MOE
- TYPE
- 262K
- CONTEXT
- ~17 GB
- VRAM AT Q4
Where we recommend this
Every tier slot in the planner where this model is a top or alternate pick. Pulled live from planner.js — when the planner refreshes, this table stays current.
The call
The 24 GB-VRAM unlock — dense-27B quality at 3B-active speed, and the community workhorse for mixed coding / chat / docs where breadth matters. 256 experts with 8 routed + 1 shared per token.
When not to use: AMD ROCm on older llama.cpp builds (MoE HIP kernels had stability issues through 2025; check release notes before pulling). Also skip for pure coding where Qwen3-Coder-30B-A3B is sharper.
Runner notes
Ollama tag `qwen3.5:35b` (the bare 35b tag is the A3B MoE; `:35b-a3b` does not resolve). Q4 fits a single 24 GB card with context headroom. MoE shines on vLLM/SGLang; Ollama works but slower on MoE routing than dedicated engines.
Hardware that fits
Every hardware pick whose memory fits this model at the quant we recommend. Sorted cheapest-first — the top row is your best-value fit. Click through for the full buyer’s guide.
- Minisforum UM890 ProGood · 1.3× 32 GB DDR5 (shared) · $463–$580 all-in
- AMD Radeon RX 7900 XTXGood · 1.3× 24 GB · $760 used / ~$1,500 new
- NVIDIA RTX 3090 (used, single)Good · 1.3× 24 GB · $950–$1,200
- MacBook Air M5 24 GBRequires tweak · 1.2× 24 GB unified · $1,299–$1,699
- Mac Mini M4 Pro 24 GBRequires tweak · 1.2× 24 GB unified · $1,399
- Dual RTX 3090 (used)Perfect · 2.6× 48 GB · $1,800–$2,500 all-in
- Framework Desktop (Ryzen AI Max+ 395)Perfect · 4.6× 128 GB unified · $1,999–$2,851
- NVIDIA RTX 4090Good · 1.3× 24 GB · $2,200–$2,800
- M5 Pro MacBook Pro 48 GBPerfect · 1.7× 48 GB unified · $2,599–$3,099
- NVIDIA RTX 5090Perfect · 1.7× 32 GB · $2,910–$4,300
- Mac Studio M4 Max 64 GBPerfect · 2.3× 64 GB unified · $3,199
- NVIDIA RTX A6000 (48 GB, used)Perfect · 2.6× 48 GB ECC · $3,500–$4,500
- Mac Studio M3 Ultra 96 GBPerfect · 3.5× 96 GB unified · $3,999
- M5 Max MacBook Pro 64 GBPerfect · 2.3× 64 GB unified · $4,499
- NVIDIA DGX SparkPerfect · 4.6× 128 GB unified · $4,699
- Dual RTX 5090Perfect · 3.5× 64 GB (2×32) · $8,500–$10,500
Next step
Find-by-model — see what hardware runs this→