Stream

A log of what shipped, what launched, and what changed at the facility.

12 Papers on Agent Runtimes: What Worked, What Didn't

We surveyed 12 recent papers on agent runtimes while designing a TypeScript kernel. The strongest finding: every part of an agent you treat as static — tool catalogs, context, memory, identity — costs measurable accuracy or measurable tokens, often both.

Shipped █████████ v0.1.0 — agent kernel and runtime are real

[CLASSIFIED]

Initiative spinners had a ghost idle state — five competing rules in the codebase and none of them ever fired in practice. Collapsed home page spinners to a clean active/idle model and recolored idle from amber warning to neutral grey. The home page now reads honestly: green where work is happening, grey for everything else.

Subagent Driven Development: What the research actually shows

After months of subagent-driven development, I noticed error rates climbing and code coherence dropping — even on simple projects. I went looking for why. The research confirmed what I was seeing and changed how I think about delegation in our agent architecture.

April 2026

Telemetry accuracy: single source of truth for tokens and agent counts, project_id-to-slug resolution for initiative spinners, real-time agent state via RPC and Supabase subscriptions.

Shipped telemetry-exporter v0.6.0

Hardened PID lock, FK race fixes, billable tokens metric, real-time agent state matching dashboard accuracy.

Shipped telemetry-exporter v0.5.0

Moved to OTel-only architecture.

Per-Project Cost Tracking for an Agent-Native Lab

We built a per-project cost counter so we can give collaborators real estimates and calibrate intelligence allocations across models to optimize ROI by goal importance.

The Seven Unknowns: What AI Still Cannot Solve in 2026

AI capabilities are advancing faster than our ability to understand, verify, control, or govern them. These are the large, unresolved problem spaces defining the field in 2026.

Shipped telemetry-exporter v0.2.0

Outbox pipeline architecture. Monolithic daemon replaced with staged pipeline. Local durability via WAL-mode SQLite, circuit breaker, exponential backoff. Daemon reduced from 935 to 269 lines.

Next.js application reading live agent telemetry.

Research Agenda: Five Questions Driving the Facility

What does the operating system for an agent-first organization look like? The major questions driving the facility.

The Incongruency Problem: Why AI Is Failing Enterprise

We're bolting AI onto a world designed for humans.

March 2026
Shipped cr-agent v0.1.0

Autonomous CodeRabbit review fixer. Receives webhook, clones repo, applies fixes, pushes — no human in the loop.

Shipped telemetry-exporter v0.1.0

Real-time Claude Code process scanning, sliding-window activity detection, and Supabase sync.

From Aesthetic to Algorithm: Building the LORF Design System as an Agent Skill

How we turned a visual identity into machine-readable instructions that AI agents can execute.

Shipped stocktaper-design-system skill

Monochrome, monospace design language codified as an agent-executable skill.

Shipped claude-dashboard v0.1.0

Python TUI for monitoring Claude Code activity in real time.

Shipped nexus v0.1.0

Multi-agent coordination server. The first LO repo and the reason the facility exists.

LORF launched

Next.js site deployed to Railway. Article sidebar, prose styling, dark mode. The facility opened its doors.