Shopify's AI-first playbook

- Bessemer published a detailed April 2 interview with Shopify engineering chief Farhan Thawar, laying out how Shopify runs AI-first development across engineering and beyond. - The sharpest detail is the plumbing: one LLM proxy under many tools — Cursor, Copilot, Claude Code and others — with spend alerts and usage tracking. - What matters is the posture shift: AI is becoming operating infrastructure, not a side tool, and review capacity may become the real bottleneck.

Software teams have spent two years arguing about which AI coding tool wins. Shopify’s answer is basically: that’s the wrong layer to standardize. The interesting part of its AI-first playbook is not Cursor versus Copilot. It’s the internal plumbing underneath them — plus the cultural rules around when humans still have to step in. That matters because Shopify is one of the clearest examples yet of a large engineering org treating AI like core infrastructure instead of a perk. (bvp.com) ### What actually changed? The new piece is a Bessemer interview and companion PDF published on April 2, 2026, built around comments from Farhan Thawar, Shopify’s VP and head of engineering. Thawar says his team is roughly 20% more productive, but he frames that as a conservative estimate and not a story about cranking out more lines of code. The gains show up in faster prototyping, trying more approaches, and getting better first drafts from both engineers and non-engineers. (bvp.com) ### Why isn’t this just “everyone uses one AI tool”? Because Shopify standardized the gateway, not the app. The company built an LLM proxy that routes requests from multiple tools through one control layer, which lets teams use Cursor, Claude Code, GitHub Copilot, OpenAI Codex, Gemini Jules, and internal tools without losing visibility into cost or usage. That is a much more durable bet in a market where the best model and the best interface keep changing. (bvp.com) ### What does the proxy buy them? Three things. Cost control, analytics, and flexibility. The proxy lets Shopify buy tokens in bulk, track usage by team or project, and trigger spend alerts when usage spikes. The PDF says the company also caps iteration depth on agentic loops after learning that a single person can run up five-figure weekly token bills. So the guardrails are real, but they sit behind the scenes rather than blocking experimentation up front. (bvp.com) ### Why does that matter more than the tools? Because tool churn is brutal right now. If you hardwire a company to one coding assistant, you are really betting that one vendor will keep winning on model quality, UX, and pricing. Shopify’s setup treats the assistants like interchangeable front ends. Think of it like standardizing the electrical system in a building instead of standardi(bvp.com) Shopify’s architecture and Bessemer’s framing, but it fits the design pretty cleanly. (bvp.com) ### Is this only for engineers? No — and that is one of the more revealing parts. Shopify says designers, PMs, sales, finance, and HR are also building higher-fidelity outputs with AI. The company even learned that Cursor was the wrong fit for many non-engineers because it was expensive and awkward for non-coders, so it shifted those teams toward workflow tools like Gumloop and Shopify’s LibreChat. That (bvp.com)verage. (bvp.com) ### What are the guardrails? Human review still matters. Shopify says production code still requires human pull-request review, reversion rates have not worsened, and AI is also used for security analysis and fuzzing rather than just generation. There is also a cultural rule here: engineers are still expected to understand the systems below their work. Thawar’s warning is that AI should remove toil, not remove thinking. (bvp.com) ### So what’s the catch? The bottleneck may move. If AI makes code generation much faster, then code review and system comprehension become the scarce resources. Shopify basically says that out loud. That is why the interesting story here is not headcount replacement. It is that engineering management starts to look more like traffic control — managing review bandwidth, tool routing, and quality gates across a much faster stream of output. (bvp.com) ### Bottom line? Shopify’s playbook is a pretty clean signal for where bigger software orgs are heading. AI is turning into shared infrastructure with budgets, observability, and policy — not just a shiny assistant in someone’s editor. If that model spreads, the winners may be the companies that build the best internal control plane, not the ones that pick the “best” chatbot this quarter. (bvp.com)

Shopify's AI-first playbook

Get your own daily briefing