the AI bench
VERIFIED JUNE 2026
All models

MODEL · OPENBMB · 8B (SIGLIP-400M + WHISPER-MEDIUM + CHATTTS-200M + QWEN2.5-7B)

MiniCPM-o 2.6

GPT-4o-class omnimodal 8B — vision, speech input, speech output, voice cloning in one model. End-to-end with full-duplex live streaming.

License: Apache 2.0 (commercial use requires registration questionnaire) · Context: 32K · Released: January 2025

The decision in five lines

The call
Skip for local — for voice
Best for
voice
Runs on
23 hardware picks fit (cheapest: Intel Arc B580 12 GB · $249)
Watch out
Pure TTS or pure STT — the full multimodal stack is overkill.
Evidence
Estimated · last verified June 2026

8B (SigLip-400M + Whisper-medium + ChatTTS-200M + Qwen2.5-7B)
PARAMETERS
MULTIMODAL
TYPE
32K
CONTEXT
~7 GB (int4) / ~16 GB (FP16)
VRAM AT Q4

Where we recommend this

Every tier slot in the planner where this model is a top or alternate pick. Pulled live from planner.js — when the planner refreshes, this table stays current.

VOICE · LOW
MiniCPM-o 2.6 (int4)Apache 2.0 (commercial use needs registration questionnaire); 8B multimodal at ~7GB VRAM; full-duplex voice + vision on laptop GPUs.

The call

GPT-4o-class omnimodal 8B — vision, speech input, speech output, voice cloning in one model. End-to-end with full-duplex live streaming.

When not to use: Pure TTS or pure STT — the full multimodal stack is overkill. Use Kokoro or faster-whisper for single-purpose pipelines.

Runner notes

llama.cpp for CPU, vLLM for throughput, or Ollama (`openbmb/minicpm-o2.6:8b`). int4 variant fits ~7 GB VRAM. Commercial use requires filling OpenBMB's registration form — not fully frictionless. Sibling `MiniCPM-V-4.6` (May 2026, vision-only) is the lighter and newer alternative when you don't need voice I/O.

License
Apache 2.0 (commercial use requires registration questionnaire)
Released
January 2025
Maker
OpenBMB

Hardware that fits

Every hardware pick whose memory fits this model at the quant we recommend. Sorted cheapest-first — the top row is your best-value fit. Click through for the full buyer’s guide.

Next step

Find-by-model — see what hardware runs this