Devs Merge 18k Lines of 'Senior-Engineer Quality' AI Code
A small team managed to build a full product in under a month by merging 18,000 lines of Claude-generated code. Engineer Josh Albrecht detailed the workflow, which used 3-6 AI agents plus human reviewers, CI enforcement, and style guides to achieve what he described as senior-engineer quality output.
The project's use of 3-6 AI agents reflects an emerging industry pattern called multi-agent workflows. This approach moves beyond single-purpose tools like Copilot, assigning specialized AI agents to different software development tasks such as planning, coding, testing, and documentation, mirroring the structure of a human engineering team. The developer's role evolves into that of an orchestrator, guiding the team of agents. Josh Albrecht, the CTO of AI research lab Imbue, advocates for a structured methodology to manage AI code generation. His workflow emphasizes forcing the AI to create a detailed plan before writing any code and using strict style guides and documentation as first-class parts of the process to mitigate common problems with AI output. The choice of Anthropic's Claude was significant, as its "Claude Code" variant is known for successfully handling large, complex codebases where other AI agents often fail. This capability is reinforced by Anthropic's own internal practices, where the company now uses Claude to write the majority of its own code, with human engineers primarily reviewing and verifying large, AI-generated pull requests. While the output was described as "senior-engineer quality," developers find that AI-generated code can introduce significant bloat. One engineer reported reverting 43 AI-generated commits, removing 14,000 lines of overly-engineered code—a 34% reduction—with no loss in functionality. The models, trained on vast public datasets, often produce unnecessarily complex solutions based on common patterns. The financial aspect of using AI agents at scale goes beyond simple subscription fees. The true cost for professional, heavy use is estimated at $100-$200 per developer per month, not the low-tier advertised rates. Hidden costs