steelman-then-break

Pressure-test a design or tech choice in two forced passes — strongest case for, then strongest case against, then a calibrated verdict that's willing to say 'proceed'.

github.com/Osipchuk/agent-skills/skills/steelman-then-break ↗ SKILL.md

Install

install · cli

$ curl -fsSL https://raw.githubusercontent.com/Osipchuk/agent-skills/main/install.sh | sh -s -- steelman-then-break

One line, works with any agent — installs just this skill into ~/.claude/skills/. Needs only curl; it fetches uv and the askill CLI if they're missing.

install · claude code plugin

$ /plugin marketplace add Osipchuk/agent-skills
$ /plugin install skills@askill

Two slash commands inside Claude Code — installs the whole library as one plugin and lets Claude pick the right skill by context. No permission prompts, no remote-code-execution dance.

Installs skills/steelman-then-break/SKILL.md into ~/.claude/skills/steelman-then-break/; Claude reads it on the next session start. The plugin tab pulls the whole library instead.

When it fires

Fires on "review this proposal", "should we go with X", "steelman X", "red-team this design", or when choosing between an architecture, library, or framework. It forces the second pass that design discussions skip, and pulls prior failures from references/scars.md so objections argue from the team's own evidence.

# skills/steelman-then-break/SKILL.md — frontmatter
name: steelman-then-break
description: Pressure-test a design proposal or technology choice in two separated passes: first the strongest possible case FOR it (steelman — argue the best version, even stronger than the author put it), then a rigorous case AGAINST it (red-team — name the anti-pattern, lead with what breaks first, cite the team's prior scars), then finish with a calibrated verdict. Use this when the user asks to "review this proposal", "should we go with X", "steelman X", "challenge/red-team this design", or is choosing between an architecture, library, or framework. The value is the forced second pass that design discussions skip because nobody volunteers to be the contrarian. Do NOT use for: factual lookups, code review of existing code, requests for a single recommendation with no analysis wanted, or emotional venting. Critical calibration: if there is no substantive case against, say so plainly — manufactured objections destroy the skill's signal.

What it does

— Forced separation: the case-for has no caveats, the case-against leads with what breaks first — no hedging mush.
— Cites the team's own scars (references/scars.md), so the red-team argues from evidence not generic risk.
— Calibrated verdict: willing to conclude 'the steelman holds, proceed' instead of reflexively finding fault.
— Built-in guard against the failure mode — manufactured objections destroy the signal, so it says when there's no real case against.

Example

example · Should we adopt a graph orchestration framework? 2 turns · 700 tok

input

user

We should adopt a graph-based orchestration framework for our agents. Steelman it then break it.

↓ skill fires

skill

[steelman-then-break] proposal restated, running both passes

output

assistant

Target: replace our orchestration with a graph framework. STEELMAN: an explicit, serializable DAG gives checkpoints, resume, replay, and first-class human-in-the-loop for dual-control steps; a well-funded ecosystem lowers API risk. BREAK: you already have a working orchestrator, so migration cost dwarfs the benefit; the framework taxes you without solving your hard parts (queue, auth, audit); debug-by-graph is harder than debug-by-stack-trace for closed-contour on-call; pre-1.0 API churn. Scar 2024-Q3: we did exactly this and unwound it after 11 months. VERDICT: don't adopt now — the deciding factor is that your complexity isn't where the framework helps. What would flip it: if orchestration logic itself became the bottleneck rather than the surrounding services.

An engineer proposes adopting a framework; the skill argues both sides and lands a calibrated call against, citing a prior scar.