🔮 The only AI curve that matters

There is one AI metric that we keep a really close eye on: How many actions can a system take at 99% reliability before a human must intervene? We call this the 99% step-length: the number of sequential actions an AI can execute with at least 99% reliability without...

Claude Code sub-agents

Claude Code includes the ability to run sub-agents, where a separate agent loop with a fresh token context is dispatched to achieve a goal and report back when it’s done. I wrote a bit about how these work in June when I traced Claude Code’s activity by...