Agents on the Night Shift

Andrej Karpathy’s new autoresearch tool recently ran 700 experiments on his nanochat codebase in two days. It found 20 improvements he had missed, delivering an 11% uplift in output. Tobi Lütke at Shopify tried it on his own hand-tuned model: 19% improvement,...

Beyond Simple Office Automation: The Rise of Super Operators

As always, there is a lot going on this week – or at least being announced – in the field of AI that is relevant to firms and other organisations who are still working out their own use of the technology, from OpenAI’s promise of GPT4.5 leading to a...

Enterprise AI: Lessons from Social Media

Recent hints by AI researchers that they believe AGI is now within reach have fuelled an increasingly polarised debate about AI harms vs benefits. Ethan Mollick recently wrote up a useful overview of developments that considered where the water is rising (LLM...