the AI bench
VERIFIED JULY 2026
All models

MODEL · DEEPREINFORCE AI · 9B DENSE / 31B DENSE / 35B-A3B MOE / 397B MOE — POST-TRAINED ON GEMMA 4 + QWEN 3.5 BASES

Ornith-1.0 (9B / 31B / 35B-MoE / 397B-MoE)

A self-improving open-source family of agentic coding models, post-trained on Gemma 4 and Qwen 3.5 bases. DeepReinforce reports SOTA among comparable open models on Terminal-Bench 2.1, SWE-Bench (Verified/Pro/Multilingual), NL2Repo, and OpenClaw. The training idea: RL that jointly optimizes both the solution rollout and the scaffold that drives it. MIT, no regional limits. The 9B / 31B / 35B are single-GPU-deployable; the 397B MoE is hosted / big-iron.

License: MIT · Context: 128K–400K depending on variant · Released: June 21, 2026

The decision in five lines

The call
Consider — runnable locally, family reference
Best for
Local evaluation and family reference
Runs on
23 hardware picks fit (cheapest: Intel Arc B580 12 GB · $249)
Watch out
General chat or non-coding work — Ornith is tuned specifically for agentic coding loops.
Evidence
Estimated · last verified July 2026

9B dense
PARAMETERS
AGENTIC CODING MODELS
TYPE
128K–400K
CONTEXT
~5–6 GB (9B) / ~18 GB (31B / 35B-A3B) at Q4 — 397B is big-iron
VRAM AT Q4

Where we recommend this

This model isn’t currently in an active planner slot. See the runner notes below if you’re running it anyway.

The call

A self-improving open-source family of agentic coding models, post-trained on Gemma 4 and Qwen 3.5 bases. DeepReinforce reports SOTA among comparable open models on Terminal-Bench 2.1, SWE-Bench (Verified/Pro/Multilingual), NL2Repo, and OpenClaw. The training idea: RL that jointly optimizes both the solution rollout and the scaffold that drives it. MIT, no regional limits. The 9B / 31B / 35B are single-GPU-deployable; the 397B MoE is hosted / big-iron.

When not to use: General chat or non-coding work — Ornith is tuned specifically for agentic coding loops. Benchmarks are vendor-reported; verify on your own repo before switching from Qwen3-Coder-30B-A3B or North Mini Code.

Runner notes

vLLM with a modified Qwen chat template (see the model card). 35B-A3B fits a single 24 GB rig at Q4; the 9B runs on ~8 GB. Community GGUFs emerging. Pair with a real agent harness (OpenHands, Claude Code, mini-SWE-agent) — it targets those loops.

License
MIT
Released
June 21, 2026
Maker
DeepReinforce AI

Hardware that fits

Every hardware pick whose memory fits this model at the quant we recommend. Sorted cheapest-first — the top row is your best-value fit. Click through for the full buyer’s guide.

Next step

Find-by-model — see what hardware runs this