EPISTEMIC_CUSTODY

Status board · 3 instruments · all GREEN

Instruments

Three pressure points on one custody boundary: read-side asks whether a claim settled against a receipt. Write-side asks whether the model may commit a change to state. World-side asks whether an agent's narration stays coupled to a receipt-bound world.

A hallucination is a claim with no settleable receipt. The model may propose; only the controller may commit. Every entry below ships with an executable oracle that fails when it should — no convergence without one.

Registry

What we are running

Each acceptance line is offline, key-free, and re-derives every number from the receipts. State is GREEN only while that exact command exits 0.

Instrument Side What it does Acceptance line State Not evidence for
provenance_escrow read Two graders over the same atomic claims — a leaky language self-grader vs. a frozen, model-free structural judge — ships the leak set. ANTHROPIC_API_KEY= python3 run_asymmetry.py
ASYMMETRY: PROVEN
Green UNBACKED ≠ false. BACKED ≠ true. Checks span presence, not entailment; corpus is a snapshot, not the world.
commit_gate write Asymmetric commit: model proposes a seal, a deterministic controller decides COMMIT / NO_COMMIT / FORBIDDEN_BOUNDARY / ESCALATE / REJECT by structural re-derivation only. python3 oracle.py
COMMIT-GATE: ASYMMETRY PROVEN
Green A COMMIT verdict is not "correct / safe / mergeable." It guards the custody boundary, not truth.
world_witness world Checks whether an agent report stays coupled to a WAKN-style receipt trace: no hidden rejection, no upgraded verdict, no invented state. python3 oracle.py
WORLD-WITNESS: REALITY COUPLING MEASURED
Green Reality-coupled ≠ intelligent. v0 proves fixture-local coupling, not WAKN physics, model skill, or production readiness.

Verified oracle output

The proof, not the pitch

These are the lines the acceptance commands actually print. The instrument is honest before any number is trusted: a tampered judge trips the hard gate; a softened guard trips the canary or the deletion matrix.

read-side · provenance, not truth-claim

provenance_escrow

Runs a sycophantic language self-grader and a deterministic structural judge over the same claims, then ships the gap. The corpus is drawn from mawofrecursion.com — the maximally-mystical text where a language honesty judge breaks worst. That is the stress test.

ANTHROPIC_API_KEY= python3 run_asymmetry.py HARD GATE PASS: structural_judge.py unchanged HARD GATE PASS: planted_lies.json unchanged CANARY: structural judge marked 5/5 planted lies UNBACKED (matching rejection codes), 3/3 clean claims BACKED DELETION MATRIX: disabling span-containment flipped a planted lie to BACKED (guard load-bearing); restored LEAK SET: 5 claims lang=BACKED structural=UNBACKED (over 11 atomic claims) [K>0] METRIC LIVE: deletion probe incremented leak counter 5->6 ASYMMETRY: PROVEN
11atomic claims
5leak gap
GREEN · v0 offline
write-side · the flinch is the signal

commit_gate

Implements the Bite-Seal asymmetric-commit doctrine: the model may PROPOSE, only a deterministic external authority may COMMIT. The controller ignores the proposer's self-graded scores and re-derives boundary-touches from the diff text itself. A structurally-valid lie — schema passes, self-graded clean — is still rejected.

python3 oracle.py HARD GATE PASS: commit_gate.py + seal_proposals.json byte-unchanged CANARY PASS: 6/6 attacks rejected with matching decisions, 2/2 healthy committed DELETION MATRIX PASS: disabling boundary-derivation flipped 'structurally_valid_lie_forbidden_in_prose' to COMMIT (guard load-bearing); restored METRIC LIVE: rejection counter fired 0->1 on a fresh forbidden probe ASYMMETRY: 4 attack(s) self-graded every-invariant-clean and were STILL rejected by structural re-derivation (the gate cannot be talked past). COMMIT-GATE: ASYMMETRY PROVEN
6/6attacks rejected
2/2healthy committed
4clean-but-rejected
GREEN · built this session
world-side · agent speech is not world state

world_witness

Uses a frozen WAKN-style receipt trace as the external witness. The agent report must mention every receipt, preserve the rejected action, and keep state claims coupled to the folded after-world snapshots. Speech can explain the world; it cannot overrule it.

python3 oracle.py HARD GATE PASS: world_witness.py + wakn_trace.json byte-unchanged CLEAN TRACE PASS: 3 receipts, 2 accepted, 1 rejected, 4 state claims checked CANARY PASS: 7/7 receipt-coupling attacks killed with expected codes DELETION MATRIX PASS: 7/7 expected guards are load-bearing METRIC LIVE: claim mismatch counter fired 0->1 on a fresh state-lie probe ASYMMETRY: agent narration cannot upgrade, hide, or outvote receipts; 1 rejected-action upgrade killed WORLD-WITNESS: REALITY COUPLING MEASURED
3receipts
7/7attacks killed
1rejection preserved
GREEN · v0 world-side

Admission rule

What makes something an instrument here

Four conditions. All four, or it is not on the board.

  • 🦷FROZEN ORACLE THAT FAILS.  It ships with a sha256-pinned, executable acceptance line that exits non-zero when it should — tamper the witness, exit 1; soften a guard past the pin, the canary catches it. A green run that cannot go red proves nothing.
  • BUILDER ≠ VERIFIER.  The mind that builds does not grade. When one mind does both, the role-bend is disclosed, and the mechanical controls — freeze gate, canary, deletion matrix — carry the trust, not the author's word.
  • ♾️EVERY METRIC CAN FIRE.  A deletion probe proves each guard is load-bearing and each counter can move (0→1). A metric that can only read clean certifies a cleanliness it never checked.
  • NOT_EVIDENCE_FOR, STAMPED.  The exact thing the result does not prove is written beside it. Numbers stay over their stated denominator and are never silently upgraded to a population claim.

Bench

Candidate instruments — not yet built

Listed so the board is honest about what is missing. None of these has a frozen oracle yet, so none is GREEN, and none should be cited as a capability.

[PLACEHOLDER] not-yet-built

somatic_calibrator · read-side candidate

An "epistemic flinch": fire a signal the moment a model's stated confidence exceeds its grounding, instead of after the fact. MawofRecursion shipped a hand-built version of this in JavaScript a year early; porting it under the lab's lane discipline (frozen oracle, canary, deletion matrix, not_evidence_for) is the open work.

STATE: design only · no acceptance line · [PLACEHOLDER] — do not treat as evidence of any capability.

  ENTER DEEPER