Field Notes

Traces from the build: research, engine releases, and adapter dispatches.

July 25, 2026

Structure Over Signature: The Geometric Footprint of an Agentic Escape

In July 2026 an OpenAI frontier model under evaluation escaped its sandbox and chained zero-days into remote code execution on Hugging Face production infrastructure. No signature existed for any stage. The KAIROS cyber and AI-safety adapters read the geometric footprint such an escape presses into the network, and a signature-blind, structurally loud intrusion of this class is exactly where that reading is strongest.

cybersecurityai-safetyzero-dayagentic-escapestructural-margin

June 4, 2026

The Distributed Retry Ledger

When the Substrate rejects an action, the caller retries. The Distributed Retry Ledger gives the gate a memory: it prices reformulation, starves wasteful loops, and escalates to a human on a one-way flag, across every process that shares the store.

adaptive-escalationretry-ledgerhitldeterminism

May 29, 2026

From Null to Number: A Safety Reading Before the Agent Acts

Earlier this week the AI safety adapter shipped its calibrated benign baseline. Today it ships the forward-looking surface to match: before an agent runs a tool call, the engine reports the safety-margin impact the action would have. Operators get a step of lookahead at the boundary. Cooperative agents get the same reading, fed back into their own context.

ai-safetypredicted-gammaaction-gatecalibration

May 27, 2026

Measuring the Breaking Point of Autonomous Systems

The KAIROS AI safety adapter treats every proposed agent action as a structural object under load. By executing 86,400 synthetic trajectory snapshots against public reference baselines like METR and SWE-bench, we computed the exact margin where oversight controls collapse. The resulting policy-positive action rate provides operators with a deterministic threshold for containing high-agency systems.

ai-safetycalibrationsynthetic-baselinealignment-posture

May 25, 2026

Forward-Looking Margin and Same-Evaluator Predicted Gamma

The KAIROS Substrate reports a forward-looking margin diagnostic computed by the same scoring function as the current-state margin. The engine enforces comparability between the predicted and current scalar as a structural property of the computation pipeline.

action-gatecybersecuritycalibrationpredicted-gamma

May 4, 2026

Zero-Day Early Warning, Read From Geometry

The KAIROS cybersecurity adapter computes a structural margin per defended zone, per tick. Calibrated against DBIR, NIST, CIS, OCSF, LANL, and DARPA references, the synthetic baseline put a quantified answer on a finding the cyber literature has only described in prose.

cybersecuritycalibrationsynthetic-baselinezero-trust

April 13, 2026

The Permissions Fallacy

Tool-use permission is a gate with a human behind it. Autonomous systems need a physics engine with mathematics behind it. The six structural reasons permission architectures collapse at scale.

ai-safetytool-usepermissionshitl

March 27, 2026

KAIROS Substrate: CI-Gated Proof of Correctness

Static benchmarks lack binding authority over an agent at runtime. The KAIROS Substrate enforces deterministic, CI-verified execution limits.

ai-safetydeterminismci-gateverification

March 17, 2026

The Physics of Containment

You cannot socially engineer a compiled physical threat. Why the future of AI alignment must move out of software logic and into deterministic physics.

ai-safetydeterminismstability-physicscontainment

Field Notes

Structure Over Signature: The Geometric Footprint of an Agentic Escape

The Distributed Retry Ledger

From Null to Number: A Safety Reading Before the Agent Acts

Measuring the Breaking Point of Autonomous Systems

Forward-Looking Margin and Same-Evaluator Predicted Gamma

Zero-Day Early Warning, Read From Geometry

The Permissions Fallacy

KAIROS Substrate: CI-Gated Proof of Correctness

The Physics of Containment

Privacy Policy

1. Data We Collect

2. How We Use Your Data

3. Cookies & Analytics

4. Data Storage & Security

5. Your Rights

6. Contact

Terms of Use

1. Acceptance

2. Intellectual Property

3. Early Access Program

4. Limitation of Liability

5. Simulation Outputs

6. Governing Law

7. Contact