Model Routing
Configure primary, secondary, and tertiary LLM providers per agent
Anthropic via OpenRouter
16
Google via OpenRouter
2
OpenAI direct
1
Groq direct
1
Claude Max (local)
1
Total agents
21
Agent Model Assignments
Click a model slot to change it. Changes require confirmation and emit an audit event.
| # | Agent | Risk | Primary | Secondary | Tertiary |
|---|---|---|---|---|---|
| 01 | Chief of Staff | HIGH | |||
| 02 | Product Engineer | MEDIUM-HIGH | |||
| 03 | Product Manager | MEDIUM | |||
| 04 | Business Dev | MEDIUM | |||
| 05 | Ops Manager | LOW-MEDIUM | |||
| 06 | Code Reviewer | CRITICAL | |||
| 07 | Verifier | LOW | |||
| 08 | Sales Researcher | LOW | |||
| 09 | Inbox Manager | LOW-MEDIUM | |||
| 10 | DevOps Engineer | CRITICAL | |||
| 11 | Backend/ML Engineer | MEDIUM-HIGH | |||
| 12 | Best Practices Bot | MEDIUM | |||
| 13 | QA Tester | LOW | |||
| 14 | Innovator | MEDIUM | |||
| 15 | Synthesizer | LOW | |||
| 16 | Architect | MEDIUM-HIGH | |||
| 17 | CFO | CRITICAL | |||
| 20 | Director of Engineering | HIGH | |||
| 21 | Director of Operations | HIGH | |||
| 22 | Director of Reliability | HIGH | |||
| 23 | Inspector General | MEDIUM |
Available Models
Direct API (Billed)
Claude Opus 4.7
anthropicclaude-opus-4-7
$15/$75/1M
Anthropic API direct billing. Latest Opus — best for complex reasoning, agents.
Claude Opus 4.6
anthropicclaude-opus-4-6
$15/$75/1M
Anthropic API direct billing. Previous Opus generation.
Claude Sonnet 4.6
anthropicclaude-sonnet-4-6
$3/$15/1M
Anthropic API direct billing. Best quality/cost for most tasks.
Claude Sonnet 4.5
anthropicclaude-sonnet-4-5
$3/$15/1M
Anthropic API direct billing. Previous Sonnet generation.
Claude Haiku 4.5
anthropicclaude-haiku-4-5
$0.8/$4/1M
Anthropic API direct billing. Fastest Claude model.
Claude Haiku 4.5 (Oct)
anthropicclaude-haiku-4-5-20251001
$0.8/$4/1M
Full model ID form of claude-haiku-4-5 (2025-10-01 release).
GPT-4o
openaigpt-4o
$5/$15/1M
OpenAI API direct billing. Strong general-purpose model.
GPT-4o Mini
openaigpt-4o-mini
$0.15/$0.6/1M
OpenAI API direct billing. Budget option for simple tasks.
Gemini 2.5 Pro
googlegemini-2.5-pro
$1.25/$5/1M
Google AI API direct billing. Strong at long context and reasoning.
Gemini 2.5 Flash Preview
googlegemini-2.5-flash-preview-04-17
$0.1/$0.4/1M
Google AI API direct billing. Fast and cost-effective.
Grok 3
xaigrok-3
$3/$15/1M
xAI API. Director of Contrarian Review. Strong at adversarial analysis.
Grok 3 Mini
xaigrok-3-mini
$0.3/$0.5/1M
xAI API. Budget Grok variant for fast tasks.
Local Bridges ($0)
Claude Max (local bridge)
localclaude-max-bridge
Free
Routes via `claude --print` using Ethan's Claude Max subscription. $0 per call, but uses personal usage quota. Max concurrency=3, p95 latency ~3-5 min under load.
Gemini CLI (local bridge)
localgemini-bridge
Free
Routes via `gemini -p` using Google subscription. $0 per call, tight daily quota (~50 calls). Fallback only.
Local Ollama ($0)
Phi-4 14B (local)
ollamaphi4:14b
Free
⚠ Tool calls broken (registry change 2026-04-19). Use llama3.1:8b for tool-capable tasks. Runs on Mac Studio GPU, $0.
Gemma 3 27B (local)
ollamagemma3:27b
Free
⚠ No tool call support. Text generation only. Runs on Mac Studio GPU, $0.
Llama 3.1 8B (Ollama)
ollamallama3.1:8b
Free
Runs on Mac Studio via Ollama. $0, no quota, tool-call capable. Best local fallback for agents.
Free API Tier
Llama 3.3 70B
groqllama-3.3-70b-versatile
Free
Groq free tier. Rate-limited (~30 req/min). No per-token cost but has daily caps.
Llama 3.1 8B Instant
groqllama-3.1-8b-instant
Free
Groq free tier. Very fast inference, rate-limited. Best for simple/fast tasks.
Routing Change History
Last 20 model routing changes
No routing changes recorded yet.