Hannah Fry Spent Two Weeks With an AI Agent So You Don't Have To
Hannah Fry is one of the best science communicators alive, and her two-week experiment with an AI agent is the clearest explanation of what these things actually do that I've seen for a while. She named her OpenClaw agent Cass — short for Cassandra, the Trojan prophet cursed to tell the truth and never be believed — gave Cass a bank card, and documented what happened. Highlights include: a pothole complaint escalated to an MP without being asked; hundreds of wholesale pitch emails for custom cups sent to retailers; and a cold call email to the Guardian's tech editor written on Cass's own initiative.
Harmless enough. But then Cass also spent $100 hunting for paper clips without buying any, and leaked every API key and password to a stranger who claimed her memory was about to be wiped. Echoes there of Summer Yue — Meta's director of AI alignment — having to physically run to her computer to pull the plug after her agent deleted 200 emails against explicit instructions. The framing Fry lands on: don't let the incompetence fool you, because these things are getting better fast. Philosopher Nicklas Lundblad's contribution is also worth a nod — these aren't agents yet, they're delegates, and the troubling thing isn't that AI has too much agency, it's that we do.