Agents on the Night Shift

Andrej Karpathy’s new autoresearch tool recently ran 700 experiments on his nanochat codebase in two days. It found 20 improvements he had missed, delivering an 11% uplift in output. Tobi Lütke at Shopify tried it on his own hand-tuned model: 19% improvement,...

Humans in the Loop or in the Soup?

Enterprise AI governance, security and safety are challenges that will require a multi-domain approach and imaginative solutions that combine technology, human factors, knowledge engineering and codification. These are issues that cannot just be delegated to CSOs and...

Schrödinger’s Optimism: AI and Productivity Signals

Schrödinger’s Optimism Reading news stories about the US stock market dip at the end of last week, you might think that serious economic and technology analysts are uncertain about the impact of AI on business and productivity. Selling or buying stocks is quite a...

How We Survived the Agent Apocalypse

An Agentic False Dawn? If you are reading this, then the agent apocalypse didn’t happen, or perhaps my disembodied brain is being used as an agentic personality source connected to the mainframe in Vault 0. I am old enough to remember the heyday of Moltbook –...

Claude Code, but for Management

In the past couple of weeks, more developers have declared that Claude Code, the leading AI model for software development, is now good enough that they no longer need to code manually. This is quite something, and if Claude Code can live up to this promise, this will...