Engineers Are Becoming 'Sorcerers'
The role of software engineers is fundamentally changing to supervising fleets of AI agents, according to OpenAI's head of API platform engineering, Sherwin Wu. He stated on Lenny's Podcast that engineers are becoming "wizards or sorcerers" who "cast spells" by designing, prompting, and refining AI agent behavior. At OpenAI, 95% of engineers use Codex for daily tasks, and 100% of pull requests are reviewed by the AI tool.
- The productivity impact of AI on software engineering varies, with one 2024 study of over 500 engineers showing 23% reporting a productivity increase of 50% or more. However, another study of experienced open-source developers found that using AI tools actually increased the time to complete tasks by 19%, even though the developers believed they were 20% faster. - The concept of managing "fleets of AI agents" involves engineers at OpenAI running 10 to 20 parallel Codex threads at once, where their primary role is to prompt the agents, check their progress, and steer the outputs rather than writing the code themselves. - For product leaders building enterprise-grade AI, a key concept is the emergence of "agentic AI design patterns" which provide architectural blueprints for making autonomous systems more predictable and governable. Common patterns include "Tool Use" (allowing agents to interact with external APIs), "Reflection" (enabling an agent to critique its own work), and "Planning" (decomposing complex tasks into smaller steps). - The transformation of workflows extends beyond engineering; in HR and finance, AI is being used to automate payroll calculations, enhance compliance with labor laws, and generate data-driven insights into compensation trends and workforce costs. Boston Consulting Group projects that a balanced human-AI strategy could boost HR productivity by up to 30%. - Despite rapid adoption, a trust deficit remains a significant challenge, with one 2025 survey finding that while 42% of code is AI-assisted, 96% of developers do not fully trust the output from AI. Developers report that while AI makes them faster, they must now spend more time reviewing and validating code that can contain subtle errors. - According to Sherwin Wu, the industry is shifting away from the idea of a single, general-purpose "god model" toward a portfolio of specialized AI systems. This involves using custom fine-tuning and Reinforcement Fine-Tuning (RFT) APIs to shape model behavior with proprietary data for specific tasks. - Looking ahead, Wu predicts that within 12 to 18 months, AI models will be capable of coherently handling complex tasks for upwards of six hours at a time, a significant leap from current models optimized for tasks that take only a few minutes.