DeepMind: AlphaEvolve now scaling
- Google DeepMind said on May 7 AlphaEvolve has moved past lab demos and is now being used across Google infrastructure, science work, and Google Cloud. - The clearest proof is operational: one AlphaEvolve scheduling heuristic recovers 0.7% of Google’s global compute continuously, while BASF says planning sped up 80%. - This matters because AI coding agents are shifting from copilots to optimization systems that can change real production stacks, not just write snippets.
AlphaEvolve is not just another coding assistant. It is Google DeepMind’s attempt to build a system that searches for better algorithms — then actually ships them into working infrastructure. The news this week is that DeepMind and Google are no longer talking about it as a promising research toy. They are talking about it as something already running inside Google’s data centers, hardware stack, scientific workflows, and now customer work on Google Cloud. (deepmind.google) ### What is AlphaEvolve, exactly? Basically, it is a Gemini-powered coding agent wrapped in an evolutionary loop. The model proposes code changes, tests them against a measurable target, keeps the winners, and uses those as seeds for the next round. That matters because a lot of hard optimization work is not “write a function” work — it is “find a slightly better algorithm in a giant search space” work. DeepMind’s origin(deepmind.google)c discovery, not ordinary autocomplete. (deepmind.google) ### Why is this different from a normal AI coder? A normal coding model helps a human write software. AlphaEvolve is aimed at problems where success can be scored automatically — latency, throughput, error rates, hardware efficiency, scheduling quality, circuit simplification. That scoring loop is the trick. If the system can test thousands of variants and keep only the ones that improve the metric, it can grind toward solutions humans might not think to try. (deepmind.google) ### What changed this week? The update is really a scale claim. DeepMind says AlphaEvolve has spent the past year moving from research results into deployed use across Google’s computing ecosystem, and Google Cloud is now positioning it as something customers can use on optimization-heavy problems. The company also used this week’s post to stack up concrete examples across data centers, chip design, quantum circuits, genomics, and supply-chain planning. (deepmind.google) ### What is the strongest real-world proof? The load-bearing example is Google’s cluster scheduling. DeepMind says one AlphaEvolve-discovered heuristic has been running in Borg — Google’s internal cluster manager — and continuously recovers an average 0.7% of Google’s worldwide compute resources. At Google scale, that is a huge number, because even tiny efficiency gains compound across massive fleets of machines. (deepm([deepmind.google)-coding-agent-for-designing-advanced-algorithms/)) ### Is it only helping Google? No — and that is part of why this update matters. Google Cloud put out a companion post saying BASF used AlphaEvolve in supply-chain planning with Google Cloud and prognostica, cutting decision-making time by 80% across a complex production network. That does not mean the system is a general business oracle. But it does show the product is escaping the lab and leaving Google’s own walls. (blog.google) ### What kinds of problems fit this best? Problems with a scoreboard. That includes neural-network training kernels, data-center scheduling, circuit layouts, logistics, and some scientific search tasks. DeepMind also says AlphaEvolve helped design quantum circuits for Willow experiments with 10x lower error than earlier conventionally optimized baselines. The catch is that messy human tasks without clear evaluation metrics are much harder for this approach. (deepmind.google) ### Why mention safety in the same week? Because Google DeepMind also signed a new agreement with CAISI, the U.S. government’s Center for AI Standards and Innovation, for pre-deployment frontier-model testing. That is a separate story, but the timing is not random — if labs are pushing AI systems deeper into real infrastructure, pressure grows for more formal evaluation before release. (nist.gov)ational-security-testing)) ### Bottom line? The important shift is not that AlphaEvolve can write code. Lots of systems can do that now. The shift is that DeepMind says it can find better algorithms, prove they work against hard metrics, and deploy them into production systems that matter. If that holds up, coding agents stop being assistants and start becoming infrastructure optimizers. (deepmind.google)