Topic

The difference between a model’s reasoning ceiling (fixed) and its effort depth (adjustable) — and why turning up effort on a weaker model often costs more than just using the stronger one.

Target Reader

AI users managing cost on a subscription or API budget who reach for “high effort on the cheaper model” as a savings move — coaches, solo operators, and builders trying to get Opus-quality results on a Sonnet/Haiku budget.

The Fear / Frustration / Want / Aspiration

The want to save money without sacrificing quality — and the frustration of a “cheap” run that quietly racked up overage charges or produced work that still wasn’t good enough.

Before State

Reader assumes the effort slider is a quality dial: crank a cheaper model high enough and it’ll reason like the expensive one. They route everything to the cheap model at high effort to save money.

After State

Reader understands effort buys depth of exploration, not reasoning capability. They diagnose failures correctly (depth vs. ceiling), reserve the flagship for ceiling-bound work, and stop paying for loops that can’t reach the answer.

Narrative Arc

The intuitive belief (effort = quality) meets a hard test: Lou’s overnight Sonnet coding run that cost more than Opus would have. The turn: effort and reasoning are orthogonal. The resolution: a clean routing rule based on whether the task is ceiling-bound (code, deep strategy) or not (everyday content).

Core Argument

Reasoning capability is a fixed property of the model; effort only controls how hard it explores. You cannot substitute effort for capability — and trying to often costs more in iterations than the capable model would have cost outright.

Key Evidence / Examples

  • Lou: “You can’t use effort to pop you up to a higher level of reasoning. The effort just means how many times it goes through the decision tree before it gives up.”
  • The cost story: a night of high-effort Sonnet coding “cost as much, if not more, than just using Opus” — subscription plus $75 overage.
  • Task-dependence: Sonnet is far behind Opus for coding, close for creative/advisory/sales. “Haiku Max vs Sonnet Low — Sonnet wins by a mile.”
  • Builds on Insight - Model Altitude — Route Model and Effort by Workflow Step, Not by Whole Artifact.

Proposed Structure (5–7 beats)

  1. The tempting belief: effort is a quality dial.
  2. What effort actually does — depth of exploration, number of tree passes, the stop condition.
  3. The cost trap, told through the overnight-coding story.
  4. The orthogonality principle: model = ceiling, effort = depth.
  5. The task-dependent gap (code vs. creative) and what it implies for routing.
  6. The diagnostic: depth problem (raise effort) vs. ceiling problem (switch models).
  7. The standing rule: flagship for what counts, Sonnet for everyday.

Editorial Notes

Distinct from the existing “Stop Asking Which Model — Ask Which Step” brief: that one is about per-step routing; this is about the hard constraint (effort ≠ reasoning) that makes routing necessary. Keep the model-name specifics (4.7/4.8/Sonnet/Haiku) as illustration but frame the principle to survive version churn.

Next Step

  • Approved for drafting
  • Needs revision
  • Deprioritised