WRITING ON AI, AGENTS & SOCIETAL IMPACT
MAY 12, 2026
7 min read
Most agent demos work on the happy path. Real agentic systems live or die by how they handle the failure path — the calls that error, the tools that lie, the steps that get half-done.
APR 24, 2026
6 min read
The model gets the headlines. The harness — tools, sandbox, permissions, observability — is what actually determines whether an agent ships or quietly gets shelved.
APR 02, 2026
5 min read
Interpretability is often framed as a research concern. For builders, it is the only honest answer to the question every stakeholder eventually asks: why did the model do that?
MAR 14, 2026
8 min read
RAG retrieves once and prays. Agentic search reasons over results, refines its queries, and follows the lead. They solve different problems — and people pick the wrong one all the time.
FEB 18, 2026
7 min read
Most reliability work stops at the prompt or the model card. Production AI needs operational safety: permissions, logs, rollback paths, and explicit human ownership.
JAN 30, 2026
6 min read
Chain-of-thought made models look smarter. Verification — by another model, a tool, or a human — is what actually keeps them honest on the problems that matter.