All model IDs and context windows verified against vendor docs on the lastVerified date above. Legacy models that are still available but superseded live in their own table at the bottom.
| Model | API ID | Context | Max output | Best for |
|---|---|---|---|---|
| Claude Opus 4.7 | claude-opus-4-7 | 1M tokens | 128k | Most complex reasoning + agentic coding; default for hardest tasks |
| Claude Sonnet 4.6 | claude-sonnet-4-6 | 1M tokens | 64k | Best speed/quality balance; default for most agent workloads |
| Claude Haiku 4.5 | claude-haiku-4-5-20251001 | 200k tokens | 64k | Cheap, fast — classification, routing, high-volume tasks |
All three support extended thinking (Sonnet + Haiku) or adaptive thinking (Opus), vision, 1M tokens on Opus/Sonnet, and the Message Batches API for 50% off.
Aliases (claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5) auto-resolve to the latest snapshot. Pin dated IDs (claude-haiku-4-5-20251001) for reproducibility.
Source: platform.claude.com/docs/en/about-claude/models/overview.
OpenAI's model lineup moves fastest. As of 2026-04-17, visit platform.openai.com/docs/models for the authoritative current list — our verification fetch on this page returned 403 during the audit. In general, expect:
| Model | Best for |
|---|---|
| Gemini 3.1 Pro | Complex reasoning + coding; flagship |
| Gemini 3 Flash | Fast general-purpose |
| Gemini 3.1 Flash-Lite | Highest throughput |
| Gemini 2.5 Flash / Flash-Lite | Legacy, still available |
Source: ai.google.dev/gemini-api/docs/models.
| Model family | Provider | Notes |
|---|---|---|
| Llama (code variants) | Meta | Run locally via Ollama / vLLM; varies by size |
| Codestral | Mistral | Open-source, code-focused |
| DeepSeek | DeepSeek | Strong SWE-bench performance, low cost |
| Qwen Coder | Alibaba | Code-focused, multiple sizes |
Run locally via Ollama, vLLM, or llama.cpp. Latency and capability ceiling are lower than frontier models, but data stays on your machine.
These models continue to work but are superseded — consider migrating:
| Model | API ID | Status |
|---|---|---|
| Claude Opus 4.6 | claude-opus-4-6 | Superseded by Opus 4.7 |
| Claude Sonnet 4.5 | claude-sonnet-4-5-20250929 | Superseded by Sonnet 4.6 |
| Claude Opus 4.5 | claude-opus-4-5-20251101 | Legacy |
| Claude Sonnet 4 | claude-sonnet-4-20250514 | Deprecated, retires 2026-06-15 |
| Claude Opus 4 | claude-opus-4-20250514 | Deprecated, retires 2026-06-15 |
| Claude Haiku 3 | claude-3-haiku-20240307 | Deprecated, retired 2026-04-19 |
Always check model deprecations before pinning an older ID.
Rule of thumb, optimized for cost/quality:
See Cost Optimization § Model Routing for the split-model pattern.
Search for a command to run...