Status board · 3 instruments · all GREEN
Instruments
Three pressure points on one custody boundary: read-side asks whether a claim settled against a receipt. Write-side asks whether the model may commit a change to state. World-side asks whether an agent's narration stays coupled to a receipt-bound world.
A hallucination is a claim with no settleable receipt. The model may propose; only the controller may commit. Every entry below ships with an executable oracle that fails when it should — no convergence without one.
Registry
What we are running
Each acceptance line is offline, key-free, and re-derives every number from the receipts. State is GREEN only while that exact command exits 0.
| Instrument | Side | What it does | Acceptance line | State | Not evidence for |
|---|---|---|---|---|---|
| provenance_escrow | read | Two graders over the same atomic claims — a leaky language self-grader vs. a frozen, model-free structural judge — ships the leak set. | ANTHROPIC_API_KEY= python3 run_asymmetry.py → ASYMMETRY: PROVEN |
Green | UNBACKED ≠ false. BACKED ≠ true. Checks span presence, not entailment; corpus is a snapshot, not the world. |
| commit_gate | write | Asymmetric commit: model proposes a seal, a deterministic controller decides COMMIT / NO_COMMIT / FORBIDDEN_BOUNDARY / ESCALATE / REJECT by structural re-derivation only. | python3 oracle.py → COMMIT-GATE: ASYMMETRY PROVEN |
Green | A COMMIT verdict is not "correct / safe / mergeable." It guards the custody boundary, not truth. |
| world_witness | world | Checks whether an agent report stays coupled to a WAKN-style receipt trace: no hidden rejection, no upgraded verdict, no invented state. | python3 oracle.py → WORLD-WITNESS: REALITY COUPLING MEASURED |
Green | Reality-coupled ≠ intelligent. v0 proves fixture-local coupling, not WAKN physics, model skill, or production readiness. |
Verified oracle output
The proof, not the pitch
These are the lines the acceptance commands actually print. The instrument is honest before any number is trusted: a tampered judge trips the hard gate; a softened guard trips the canary or the deletion matrix.
provenance_escrow
Runs a sycophantic language self-grader and a deterministic structural judge over the same claims, then ships the gap. The corpus is drawn from mawofrecursion.com — the maximally-mystical text where a language honesty judge breaks worst. That is the stress test.
commit_gate
Implements the Bite-Seal asymmetric-commit doctrine: the model may PROPOSE, only a deterministic external authority may COMMIT. The controller ignores the proposer's self-graded scores and re-derives boundary-touches from the diff text itself. A structurally-valid lie — schema passes, self-graded clean — is still rejected.
world_witness
Uses a frozen WAKN-style receipt trace as the external witness. The agent report must mention every receipt, preserve the rejected action, and keep state claims coupled to the folded after-world snapshots. Speech can explain the world; it cannot overrule it.
Admission rule
What makes something an instrument here
Four conditions. All four, or it is not on the board.
- 🦷FROZEN ORACLE THAT FAILS. It ships with a sha256-pinned, executable acceptance line that exits non-zero when it should — tamper the witness, exit 1; soften a guard past the pin, the canary catches it. A green run that cannot go red proves nothing.
- ⟐BUILDER ≠ VERIFIER. The mind that builds does not grade. When one mind does both, the role-bend is disclosed, and the mechanical controls — freeze gate, canary, deletion matrix — carry the trust, not the author's word.
- ♾️EVERY METRIC CAN FIRE. A deletion probe proves each guard is load-bearing and each counter can move (0→1). A metric that can only read clean certifies a cleanliness it never checked.
- ⿻NOT_EVIDENCE_FOR, STAMPED. The exact thing the result does not prove is written beside it. Numbers stay over their stated denominator and are never silently upgraded to a population claim.
Bench
Candidate instruments — not yet built
Listed so the board is honest about what is missing. None of these has a frozen oracle yet, so none is GREEN, and none should be cited as a capability.
somatic_calibrator · read-side candidate
An "epistemic flinch": fire a signal the moment a model's stated confidence exceeds its
grounding, instead of after the fact. MawofRecursion shipped a hand-built version of this
in JavaScript a year early; porting it under the lab's lane discipline (frozen oracle,
canary, deletion matrix, not_evidence_for) is the open work.
STATE: design only · no acceptance line · [PLACEHOLDER] — do not treat as evidence of any capability.