Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
41 changes: 21 additions & 20 deletions docs/cli/user-guides/choosing-your-model.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -28,26 +28,27 @@ This guide was last updated on Wednesday, June 3rd 2026.

## 1 · Current stack rank (March 2026)

| Rank | Model | Why we reach for it |
| ---- | ----------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------ |
| 1 | **Claude Opus 4.7** | Newest Anthropic flagship with **Max** reasoning; top pick for the hardest work. Promotional 1× multiplier through April 30 (2× afterward). |
| 2 | **Claude Opus 4.6** | Previous Anthropic flagship with **Max** reasoning; still excellent depth and safety for complex work. |
| 3 | **Claude Opus 4.6 Fast** | Opus 4.6 tuned for faster response times; 12× multiplier. |
| 4 | **Claude Opus 4.5** | Proven quality-and-safety balance; strong default for TUI and exec. |
| 5 | **Claude Sonnet 4.6** | **Max** reasoning at the Sonnet price point (1.2×); strong daily driver for planning and implementation. |
| 6 | **GPT-5.4** | Latest OpenAI model with 922K context, 128K output, verbosity support, and **Extra High** reasoning; excellent for large-context tasks. |
| 7 | **Claude Sonnet 4.5** | Strong daily driver with balanced cost/quality; great general-purpose choice when you don't need Opus-level depth. |
| 8 | **GPT-5.3-Codex** | Newest OpenAI coding model with **Extra High** reasoning and verbosity support; strong for implementation-heavy tasks. |
| 9 | **GPT-5.2-Codex** | Proven OpenAI coding model with **Extra High** reasoning; solid for implementation-heavy tasks. |
| 10 | **GPT-5.2** | OpenAI model with verbosity support and reasoning up to **Extra High**. |
| 11 | **Claude Haiku 4.5** | Fast, cost-efficient for routine tasks and high-volume automation. |
| 12 | **Gemini 3.1 Pro** | Newer Gemini Pro generation with strong structured outputs and mixed reasoning controls for research-heavy tasks. |
| 13 | **Gemini 3 Flash** | Fast, cheap (0.2× multiplier) with full reasoning support; great for high-volume tasks where speed matters. |
| 14 | **Droid Core (MiniMax M2.7)** | Open-source, 0.12× multiplier with reasoning support (Low/Medium/High) and image support; cheapest model available. |
| 15 | **Droid Core (GLM-5.1)** | Open-source, 0.55× multiplier, newer GLM option for bulk automation and air-gapped environments; no image support. |
| 16 | **Droid Core (GLM-5)** | Open-source, 0.4× multiplier, stable choice for bulk automation and air-gapped environments; no image support. |
| 17 | **Droid Core (Kimi K2.6)** | Open-source, 0.4× multiplier with image support and optional High reasoning; good for cost-sensitive work when you still want a thinking toggle. |
| 18 | **Droid Core (Kimi K2.5)** | Open-source, 0.25× multiplier with image support; older Kimi option for cost-sensitive work. |
| Rank | Model | Why we reach for it |
| ---- | ---------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------ |
| 1 | **Claude Opus 4.7** | Newest Anthropic flagship with **Max** reasoning; top pick for the hardest work. Promotional 1× multiplier through April 30 (2× afterward). |
| 2 | **Claude Opus 4.6** | Previous Anthropic flagship with **Max** reasoning; still excellent depth and safety for complex work. |
| 3 | **Claude Opus 4.6 Fast** | Opus 4.6 tuned for faster response times; 12× multiplier. |
| 4 | **Claude Opus 4.5** | Proven quality-and-safety balance; strong default for TUI and exec. |
| 5 | **Claude Sonnet 4.6** | **Max** reasoning at the Sonnet price point (1.2×); strong daily driver for planning and implementation. |
| 6 | **GPT-5.4** | Latest OpenAI model with 922K context, 128K output, verbosity support, and **Extra High** reasoning; excellent for large-context tasks. |
| 7 | **Claude Sonnet 4.5** | Strong daily driver with balanced cost/quality; great general-purpose choice when you don't need Opus-level depth. |
| 8 | **GPT-5.3-Codex** | Newest OpenAI coding model with **Extra High** reasoning and verbosity support; strong for implementation-heavy tasks. |
| 9 | **GPT-5.2-Codex** | Proven OpenAI coding model with **Extra High** reasoning; solid for implementation-heavy tasks. |
| 10 | **GPT-5.2** | OpenAI model with verbosity support and reasoning up to **Extra High**. |
| 11 | **Claude Haiku 4.5** | Fast, cost-efficient for routine tasks and high-volume automation. |
| 12 | **Gemini 3.1 Pro** | Newer Gemini Pro generation with strong structured outputs and mixed reasoning controls for research-heavy tasks. |
| 13 | **Gemini 3 Flash** | Fast, cheap (0.2× multiplier) with full reasoning support; great for high-volume tasks where speed matters. |
| 14 | **Droid Core (NVIDIA Nemotron 3 Ultra)** | Open-source, 0.4× multiplier, best NVIDIA model for long running agents. |
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[P1] Keep Nemotron reflected in config sections

This table adds Droid Core (NVIDIA Nemotron 3 Ultra) and describes it as open-source, but the later Reasoning effort settings list and Open-source models section don’t mention it; that makes the guide internally inconsistent and leaves readers without configuration guidance for the newly-recommended model (also consider bumping the “last updated” date since the page content changed).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[P2] Hyphenate “long-running” in the new row

In the new Nemotron entry, “long running” is used as a compound adjective; hyphenating it reads more naturally and avoids the common ambiguity in English.

Suggested change
| 14 | **Droid Core (NVIDIA Nemotron 3 Ultra)** | Open-source, 0.4× multiplier, best NVIDIA model for long running agents. |
| 14 | **Droid Core (NVIDIA Nemotron 3 Ultra)** | Open-source, 0.4× multiplier, best NVIDIA model for long-running agents. |

| 15 | **Droid Core (MiniMax M2.7)** | Open-source, 0.12× multiplier with reasoning support (Low/Medium/High) and image support; cheapest model available. |
| 16 | **Droid Core (GLM-5.1)** | Open-source, 0.55× multiplier, newer GLM option for bulk automation and air-gapped environments; no image support. |
| 17 | **Droid Core (GLM-5)** | Open-source, 0.4× multiplier, stable choice for bulk automation and air-gapped environments; no image support. |
| 18 | **Droid Core (Kimi K2.6)** | Open-source, 0.4× multiplier with image support and optional High reasoning; good for cost-sensitive work when you still want a thinking toggle. |
| 19 | **Droid Core (Kimi K2.5)** | Open-source, 0.25× multiplier with image support; older Kimi option for cost-sensitive work. |

<Note>
We ship model updates regularly. When a new release overtakes the list above, we update this page and the
Expand Down
Loading