Iron Laws and the four canonical discipline-prose patterns
This page covers the patterns superpowers added on top of the agent-skills spec for discipline-shaped skills. Layer 2 conventions, all four. None of them are required by the spec; all four are observably effective for skills that need to reliably steer agent behavior. The patterns:- Iron Law (block-quoted code-fenced single sentence).
- Forbidden-rationalization table (
| Excuse | Reality |). - Red Flag list (“STOP and Start Over”).
- Spirit-vs-letter clause.
Iron Law
A non-negotiable rule. Single sentence, all-caps (or close to it), no exceptions, block-quoted with code-fence styling so it visually stands out from prose. The shape:Karpathy-wiki Iron Laws
Fromkarpathy-wiki/skills/karpathy-wiki/SKILL.md lines 48-60:
.manifest.json origin to "file", "mixed", the evidence type, the empty string, or a relative path.
Forbidden-rationalization table
A two-column markdown table. Left column: verbatim rationalizations the agent might produce under pressure. Right column: the counter to each. The table is built from observation, not invention. You write down the actual rationalizations you saw in pressure tests or in production failures, then write the counter. Inventing rationalizations does not work; the agent will produce different ones than you guessed.Karpathy-wiki rationalization table
Fromkarpathy-wiki/skills/karpathy-wiki/SKILL.md lines 435-458 (excerpt; the full table has 16 rows):
needs_more_detail rejection clauses.
Red Flag list
A bulleted list of thought patterns the agent should recognize as signaling a violation in progress. Each red flag closes with a single action: STOP, expand, capture, etc. The list is written from the agent’s first-person perspective (paradoxically; this is the one place first-person is allowed because the agent is recognizing its own thought).Karpathy-wiki red flag example
Fromkarpathy-wiki/skills/karpathy-wiki/SKILL.md lines 175-181:
Spirit-vs-letter clause
A single sentence that closes the loophole “but I followed the letter of the rule.” The clause makes the spirit of the rule normative; it forbids rationalization-by-technicality. The shape:Violating the letter of the rules is violating the spirit of the rules.Or, in the karpathy-wiki shape:
“I have a good reason to skip this capture” (any rationalization not literally in this table). Cite the exact SKILL.md line that justifies skipping. If you cannot quote it, you are fabricating.The cure for the technicality loophole is to require explicit citation. If the agent must quote the line that authorizes the deviation, the agent cannot rationalize without leaving evidence.
When to use which pattern
- Iron Law. For absolute invariants. Use sparingly; three Iron Laws are powerful, twenty are noise. Karpathy-wiki has three plus one strengthened rule.
- Rationalization table. When you have observed real rationalizations in pressure testing or production. Do not invent rows; observe them.
- Red Flag list. For thought patterns the agent should recognize as warning signs. Use when the violation has a recognizable upstream signature.
- Spirit-vs-letter clause. Use once per skill, near the rationalization table. It is the meta-rule that protects all the others.
docs/07-mechanism-vs-decoration.md for the audit method.
Sources
LANDSCAPE1.2 (superpowers patterns).LANDSCAPE3.4 (the five canonical patterns: this doc covers four; the fifth, the “Don’t X. Don’t Y. Don’t Z.” enumeration, is folded into the rationalization table).KP-SKILLlines 48-60 (Iron Laws block).KP-SKILLlines 435-458 (rationalization table).KP-SKILLlines 175-181 (red flag list).
docs/07-mechanism-vs-decoration.md (every Iron Law begs a mechanism question).