Qwen3-14B

Last-generation Qwen3 14B dense with thinking mode enabled by default and strong tool-calling. Still a solid 16 GB-VRAM pick when you want dense behaviour over MoE. Qwen 3.5 skipped the 14B slot.

License: Apache 2.0 · Context: 40K native, 131K via YaRN · Released: May 2025

The decision in five lines

The call: Consider — for coding
Best for: coding
Runs on: 23 hardware picks fit (cheapest: Intel Arc B580 12 GB · $249)
Watch out: Long-context work — its 40K native ceiling (131K with YaRN) is much smaller than Qwen 3.5 siblings (262K).
Evidence: Measured · last verified July 2026

14B dense: PARAMETERS
DENSE: TYPE
40K: CONTEXT
~8 GB: VRAM AT Q4

Where we recommend this

Every tier slot in the planner where this model is a top or alternate pick. Pulled live from planner.js — when the planner refreshes, this table stays current.

CODING · MID

Qwen3-14BSticky 14B workhorse; 128K context; Apache 2.0; broad runner support.

The call

Last-generation Qwen3 14B dense with thinking mode enabled by default and strong tool-calling. Still a solid 16 GB-VRAM pick when you want dense behaviour over MoE. Qwen 3.5 skipped the 14B slot.
When not to use: Long-context work — its 40K native ceiling (131K with YaRN) is much smaller than Qwen 3.5 siblings (262K). Also no multimodal.

Runner notes

Ollama tag `qwen3:14b`. `enable_thinking=False` available if you want faster non-reasoning responses. No platform quirks.

License: Apache 2.0
Released: May 2025
Maker: Alibaba
Model card: huggingface.co/Qwen/Qwen3-14B →

Hardware that fits

Every hardware pick whose memory fits this model at the quant we recommend. Sorted cheapest-first — the top row is your best-value fit. Click through for the full buyer’s guide.

Next step

Find-by-model — see what hardware runs this→