Claude agents outperform humans
- Claude-based agent tests recovered alignment performance far faster than human teams in recent lab runs. (x.com) - The headline metric: agents hit 97% performance recovery in five days versus humans' 23% recovery in seven days, at about $18k cost. (x.com) - The comparison circulated widely on social, prompting technical threads dissecting agent workflows, cost, and test design. (x.com)