← agent skills / steelman-then-break

steelman-then-break

Pressure-test a design or tech choice in two forced passes — strongest case for, then strongest case against, then a calibrated verdict that's willing to say 'proceed'.

install · cli
$ curl -fsSL https://raw.githubusercontent.com/Osipchuk/agent-skills/main/install.sh | sh -s -- steelman-then-break

One line, works with any agent — installs just this skill into ~/.claude/skills/. Needs only curl; it fetches uv and the askill CLI if they're missing.

Installs skills/steelman-then-break/SKILL.md into ~/.claude/skills/steelman-then-break/; Claude reads it on the next session start. The plugin tab pulls the whole library instead.

Fires on "review this proposal", "should we go with X", "steelman X", "red-team this design", or when choosing between an architecture, library, or framework. It forces the second pass that design discussions skip, and pulls prior failures from references/scars.md so objections argue from the team's own evidence.

# skills/steelman-then-break/SKILL.md — frontmatter
name: steelman-then-break
description: Pressure-test a design proposal or technology choice in two separated passes: first the strongest possible case FOR it (steelman — argue the best version, even stronger than the author put it), then a rigorous case AGAINST it (red-team — name the anti-pattern, lead with what breaks first, cite the team's prior scars), then finish with a calibrated verdict. Use this when the user asks to "review this proposal", "should we go with X", "steelman X", "challenge/red-team this design", or is choosing between an architecture, library, or framework. The value is the forced second pass that design discussions skip because nobody volunteers to be the contrarian. Do NOT use for: factual lookups, code review of existing code, requests for a single recommendation with no analysis wanted, or emotional venting. Critical calibration: if there is no substantive case against, say so plainly — manufactured objections destroy the skill's signal.
  • Forced separation: the case-for has no caveats, the case-against leads with what breaks first — no hedging mush.
  • Cites the team's own scars (references/scars.md), so the red-team argues from evidence not generic risk.
  • Calibrated verdict: willing to conclude 'the steelman holds, proceed' instead of reflexively finding fault.
  • Built-in guard against the failure mode — manufactured objections destroy the signal, so it says when there's no real case against.
example · Should we adopt a graph orchestration framework? 2 turns · 700 tok
input
user
We should adopt a graph-based orchestration framework for our agents. Steelman it then break it.
↓ skill fires
skill
[steelman-then-break] proposal restated, running both passes
output
assistant
Target: replace our orchestration with a graph framework. STEELMAN: an explicit, serializable DAG gives checkpoints, resume, replay, and first-class human-in-the-loop for dual-control steps; a well-funded ecosystem lowers API risk. BREAK: you already have a working orchestrator, so migration cost dwarfs the benefit; the framework taxes you without solving your hard parts (queue, auth, audit); debug-by-graph is harder than debug-by-stack-trace for closed-contour on-call; pre-1.0 API churn. Scar 2024-Q3: we did exactly this and unwound it after 11 months. VERDICT: don't adopt now — the deciding factor is that your complexity isn't where the framework helps. What would flip it: if orchestration logic itself became the bottleneck rather than the surrounding services.
An engineer proposes adopting a framework; the skill argues both sides and lands a calibrated call against, citing a prior scar.