The Seam
The boundary between deterministic structure and probabilistic content is drawn explicitly everywhere it appears; each side of the seam is scoped to exactly what it can guarantee.
Twelve Parts, four volumes, one claim: accountable AI in regulated domains is achieved by composing the model with a deterministic substrate on both sides of inference, not by improving the model. This is the architecture, the proof, and what it requires of every party who touches it.
This page is an architecture and assurance argument. It is not a proof of legal compliance, a certification of any specific deployment, or a claim that any model's output is true. Where a claim depends on counsel validation, carrier confirmation, or customer-specific fact clearance, the full paper marks it explicitly.
State the bridging-object thesis — the Regulatory Interpretation (legal register) and the Governance Specification (technical register) are the same artifact; the Custodial Architecture is the accountable human process that authors, owns, and certifies it. Constrained decoding compiles a grammar into a runtime token mask; constrained encoding renders a Governance Specification into a governed prompt. Same pattern, both sides of inference.
The boundary between deterministic structure and probabilistic content is drawn explicitly everywhere it appears; each side of the seam is scoped to exactly what it can guarantee.
9606 owns the encoding surface and platform; the custodial function owns certification and custody; the customer owns the Regulatory Interpretation and audit record. Stated to the byte, not left to drift.
Every claim in the underlying paper is tagged as mathematical, empirical, engineering, legal, or regulatory-posture. A claim that doesn't announce its category is the paper's own failure, not the reader's to guess.
A transformer is a parameterized, continuous, almost-everywhere-differentiable function from token sequences to a probability distribution — extraordinarily capable, and structurally incapable of self-certification. Four independent results establish this; none depends on the others, and none is solved by scale.
Pure self-attention drives token representations toward a rank-one matrix at a doubly-exponential rate in depth; reliability is a contingent internal equilibrium the residual stream and FFN hold off, not a guaranteed property.
Capability is not reliability. Reasoning is brittle under semantics-preserving reformulation in ways the model does not signal; this boundary does not move under scaling the way the capability boundary does.
Training data sits at Pearl's Rung 1 (association). A model trained on association cannot, by that training alone, certify Rung-2/3 (intervention/counterfactual) claims — exactly the claims a deterministic governance layer needs to make.
Autoregressive serving introduces batch-dependent, floating-point-order non-determinism a bare model does not control for and an audit cannot replay without pinning it explicitly.
“The model will sometimes be wrong, and has no internal signal for when. The architecture's claim has never depended on that being false.”
Execution authority lives outside the model, in a formally verifiable Control Graph built on finite state machines and Petri nets. The model is bracketed: a deterministic encode stage assembles the governed prompt before inference; a deterministic decode stage verifies and constrains the output after it. C = M_post ∘ L_θ ∘ A_pre — and the composability theorem proves the bracketed system inherits the substrate's safety and authorization guarantees for every model placed inside it.
All permitted actions, sequencing, and compliance rules encoded as topology (FSM/Petri net), not as prompts. A policy encoded as a transition cannot be violated by the model; it can only fail to be reached.
Generative models are interchangeable workers behind a stable control surface. Switching models is a configuration change, not a rewrite — the control graph's guarantees are proven independent of which L_θ sits inside the bracket.
A versioned, five-component Governance Specification (Domain Definition, Structural Schema, Relationship Map, Output Grammar, Guardrail Set) is rendered deterministically into the governed prompt before the model ever runs.
The Output Grammar declared at encode time is enforced as a hard token mask where the endpoint permits, a checked guarantee where it does not, then layered content verification runs on the committed output.
You cannot prevent a language model from being wrong. The architecture does not claim to. What it claims, and proves, is that every way a governed output can be wrong falls into exactly one of six layers, and every layer is handled by one of three treatments — ordered by strength, applied at the strongest tier available to that layer.
| Tier | Treatment | What it means | Example layers |
|---|---|---|---|
| 1 | Eliminated by construction | Occurrence probability zero, not low | Safety, authorization, reproducibility |
| 2 | Bounded and detected | Cannot be eliminated; probability is bounded and occurrence is detectable | Model-content residual (the one layer the model itself owns) |
| 3 | Attributed | Cannot be fully eliminated or detected, but maps to a named, accountable party with an evidence trail | Legal-adequacy, certification, use-scope errors |
The lattice is revisable. An incident-review mechanism reclassifies failure modes the current enumeration hasn't yet anticipated — totality is a claim about the architecture's six-layer pipeline structure, not an assertion that every failure mode is already catalogued.
Accountability requires more than an architecture — it requires an institution. The Custodial Architecture is the federated structure that authors, owns, and certifies the Governance Specification, allocating responsibility to a named party at every layer before anything goes wrong, not after.
Owns the encoding surface, the encoding methodology, and the platform. Guarantees faithful execution: that a correct specification is rendered correctly.
Owns legal-adequacy: that the Governance Specification correctly states what governing law or policy requires. A professional judgment, not a technical one.
Owns the certification framework and the custody record — the versioned, dated proof that a given specification was reviewed and by whom.
Owns the Regulatory Interpretation and the audit record for their own deployment; bears use-scope responsibility for how the certified architecture is actually deployed.
“A wrong governed prompt is always attributable to exactly one side of a drawn line: a correct specification rendered unfaithfully, or a specification that was itself wrong. The seam doesn't eliminate error — it makes every error locatable.”
The cost advantage and the governance are the same architectural decision, viewed twice. A system that assembles a bounded governed prompt before inference spends fewer tokens than a system asking the model to manage itself — and the saving is structural, not promotional.
Self-managing agent stacks bill every planning, routing, and reflection step as consumed tokens, on infrastructure whose providers face a real, if non-absolute, tension between consumption revenue and consumption-reducing tooling.
A governed prompt gives the model exactly one bounded task with no self-management overhead. The tokens spent are the tokens the task requires, not the tokens the model's own self-management would add.
Each task routes to the least expensive model that satisfies the governed requirement; a stable governance prefix makes long governed prompts cache-efficient at scale. Both savings are the same ordering decision, viewed from two angles.
The architecture is not advanced as a compliance conclusion — that determination belongs to counsel and the regulator against specific facts. What it supplies, by construction, is the set of properties regulated environments are increasingly built to demand.
| Regulatory requirement | What the architecture supplies |
|---|---|
| Reproducibility | The same trigger and configuration version yield the same governed prompt, every time |
| Auditability | Every inference event is bound to a versioned governance state and a provenance-tagged context package |
| Pre-execution control | Compliance is enforced before the model acts, not checked after |
| Data residency | The governed prompt is assembled under a customer-selected data-boundary mode (Strong / Moderate / Weak), enforced as a constraint on assembly, not a policy |
Regulatory frameworks are stated at a level of generality that does not depend on specific provisions; verify current authority with qualified counsel before any compliance representation.
An architecture is validated by contact with a real regulated domain, not by internal coherence. Two reference engagements supply that contact, presented at the conservative register their actual stage warrants.
Applies the architecture to verifying water deliveries under Utah water-rights law, a domain that is simultaneously legal, physical, and auditable, with no human-in-the-loop absorbing model uncertainty. Tests source-corpus legal adequacy and the legal-to-deterministic conversion under public-interest stakes.
Encodes governing compliance obligations (Investment Advisers Act, SEC marketing-rule requirements, firm policy) for a regulated-communications workflow, producing a customer-specific audit package reviewable by counsel, a board, or an examiner.
Deployment status and customer relationships are stated at the stage represented in current internal materials; specific facts are described at a level of generality that does not depend on publication clearance.
The AI infrastructure market has three layers — model providers, application platforms, and the largely unoccupied middle: reasoning and orchestration infrastructure between the two. 9606 occupies that layer, defined by pre-inference specification, a deterministic execution boundary, constrained output, verification, audit, and accountable ownership.
A revenue model anchored to the control layer is positively correlated with model-market competition; the enterprise with the control layer captures falling model prices through configuration, not migration.
Years of formal-methods and production-systems work, not prompt engineering added to a model API.
Model providers, agent frameworks, guardrail libraries, compliance-workflow vendors, legal AI vendors, and services firms are each a genuinely different position — most are complementary, not competitive, once read at the layer they actually occupy.
Open Teams Incubator Fund IV ($225M); portfolio includes NumPy, SciPy, PyTorch, Anaconda. Travis Oliphant (creator of NumPy, founder of Anaconda) is a founding partner.
Chief Executive Officer.
Chief Technology Officer.
Chief Operating Officer.