MODEL · GOOGLE · 31B DENSE / 26B TOTAL + 3.8B ACTIVE (MOE)
Gemma 4 (31B dense + 26B A4B MoE)
Google's April 2026 refresh — Arena top 5 in its first week, 256K context native, vision + audio multimodal. Big news: Gemma 4 moved to Apache 2.0 from the custom Gemma Terms. The current Apache-2.0 "best dense under 70B" pick.
License: Apache 2.0 (moved off Gemma Terms) · Context: 256K · Released: April 2, 2026
The decision in five lines
- The call
- Buy — for chat
- Best for
- chat · docs
- Runs on
- 16 hardware picks fit (cheapest: Minisforum UM890 Pro · $463)
- Watch out
- Tight VRAM budgets under 16 GB — even Gemma 4 26B MoE wants 15 GB at Q4.
- Evidence
- Estimated
- 31B dense
- PARAMETERS
- DENSE + MOE
- TYPE
- 256K
- CONTEXT
- ~18 GB (31B dense) / ~15 GB (26B MoE)
- VRAM AT Q4
Where we recommend this
Every tier slot in the planner where this model is a top or alternate pick. Pulled live from planner.js — when the planner refreshes, this table stays current.
The call
Google's April 2026 refresh — Arena top 5 in its first week, 256K context native, vision + audio multimodal. Big news: Gemma 4 moved to Apache 2.0 from the custom Gemma Terms. The current Apache-2.0 "best dense under 70B" pick.
When not to use: Tight VRAM budgets under 16 GB — even Gemma 4 26B MoE wants 15 GB at Q4. For those budgets, Qwen 3.5 9B fits better.
Runner notes
Ollama tags `gemma4:31b` and `gemma4:26b`. Ollama may lag on the audio modality path — use llama.cpp head for full multimodal. MoE routing overhead can hurt vLLM concurrency vs dense equivalents under heavy batching.
Hardware that fits
Every hardware pick whose memory fits this model at the quant we recommend. Sorted cheapest-first — the top row is your best-value fit. Click through for the full buyer’s guide.
- Minisforum UM890 ProPerfect · 1.4× 32 GB DDR5 (shared) · $463–$580 all-in
- AMD Radeon RX 7900 XTXPerfect · 1.4× 24 GB · $760 used / ~$1,500 new
- NVIDIA RTX 3090 (used, single)Perfect · 1.4× 24 GB · $950–$1,200
- MacBook Air M5 24 GBRequires tweak · 1.3× 24 GB unified · $1,299–$1,699
- Mac Mini M4 Pro 24 GBRequires tweak · 1.3× 24 GB unified · $1,399
- Dual RTX 3090 (used)Perfect · 2.9× 48 GB · $1,800–$2,500 all-in
- Framework Desktop (Ryzen AI Max+ 395)Perfect · 5.1× 128 GB unified · $1,999–$2,851
- NVIDIA RTX 4090Perfect · 1.4× 24 GB · $2,200–$2,800
- M5 Pro MacBook Pro 48 GBPerfect · 1.9× 48 GB unified · $2,599–$3,099
- NVIDIA RTX 5090Perfect · 1.9× 32 GB · $2,910–$4,300
- Mac Studio M4 Max 64 GBPerfect · 2.6× 64 GB unified · $3,199
- NVIDIA RTX A6000 (48 GB, used)Perfect · 2.9× 48 GB ECC · $3,500–$4,500
- Mac Studio M3 Ultra 96 GBPerfect · 3.8× 96 GB unified · $3,999
- M5 Max MacBook Pro 64 GBPerfect · 2.6× 64 GB unified · $4,499
- NVIDIA DGX SparkPerfect · 5.1× 128 GB unified · $4,699
- Dual RTX 5090Perfect · 3.8× 64 GB (2×32) · $8,500–$10,500
Next step
Find-by-model — see what hardware runs this→