Anthropic pushes custom chips plan

- Anthropic is in early talks to buy inference chips from London startup Fractile, adding a possible fourth silicon supplier beside Nvidia, Google, and Amazon. - The pitch is speed and cost: Fractile says its SRAM-based design could run frontier-model inference up to 25x faster and 10x cheaper. - The backdrop is brutal compute demand — Anthropic just locked in multi-gigawatt Google TPU and AWS capacity, but still wants more leverage.

Anthropic is no longer treating chips like a boring supplier decision. It is treating them like product strategy. That is the real news here. The company is in early talks with Fractile, a London startup building inference chips, while also exploring whether it should design more of its own silicon stack over time. That matters because Claude’s bottleneck is no longer just model quality — it is the cost and speed of serving those models at huge scale. (theinformation.com) ### What happened today? The immediate story is a reported set of talks between Anthropic and Fractile. Fractile is building chips aimed at inference — the part where a trained model actually answers your prompt — and Anthropic is looking at them as another supply option alongside Nvidia GPUs, Google TPUs, and Amazon’s Trainium line. The talks appear early, and Fractile’s hardware is not expected to be commercially ready until around 2027. (theinformation.com) ### Why inference chips, not just more GPUs? Because the problem has shifted. Training giant models is still expensive, but serving them to millions of users all day can become the bigger recurring bill. Inference is full of memory traffic, latency penalties, and wasted power. Fractile’s whole bet is that if memory and compute sit together on the sa(theinformation.com)ms big gains in speed and cost. (fractile.ai) ### What is Fractile actually selling? Basically, a different architecture. Standard AI systems lean heavily on external DRAM, which is fast but still forces constant movement between memory and the processor. Fractile says its “memory-compute fusion” design keeps more of that work on-chip in SRAM. The sales pitch is simple — less movement, lower latency, better economics for running large models. Whether those(fractile.ai)orkloads is the open question, but the target is clear: frontier-model inference, not generic compute. (fractile.ai) ### Why is Anthropic suddenly so aggressive about supply? Because its scale changed fast. In early April, Anthropic said its run-rate revenue had passed $30 billion, up from about $9 billion at the end of 2025. In the same stretch, it signed for multiple gigawatts of next-generation TPU capacity with Google and Broadcom starting in 2027. Then it expanded its AWS relationship too, committing more than $100 billi(fractile.ai) and access to Amazon’s custom silicon roadmap. A company does not make those deals unless compute has become existential. (anthropic.com) ### So is Anthropic building its own chips too? Maybe — but that part is still exploratory. Reporting in April said Anthropic was weighing custom chip design in response to persistent AI chip shortages and rising demand. The important distinction is that this is not yet a launched chip program with a public roadmap. It is more like Anthropic deciding that “buy whatever is available” is no longer a serious long-term plan. (cnbc.com) ### Why does this matter beyond Anthropic? Because the AI race is moving up the stack. First the fight was over models. Then it was over cloud capacity. Now it is over who controls the economics of inference. If Anthropic can mix suppliers — and maybe eventually shape its own silicon — it gets more than cheaper compute. It gets bargaining power, more c(cnbc.com)’s roadmap. (cnbc.com) ### What is the catch? Time. Fractile’s chips are not ready now, and custom silicon is brutally hard. Advanced chip programs can cost hundreds of millions of dollars before they prove anything. So the near-term reality is still cloud deals, TPU allocations, and Trainium capacity. The Fractile talks matter less as immediate deployment news than as a signal that Anthropic wants optionality everywhere it can get it. (tech.yahoo.com) ### Bottom line Anthropic is trying to stop compute from becoming someone else’s tax on Claude. Fractile may or may not become a real supplier, and an in-house chip may or may not happen. But the direction is unmistakable — leading AI labs now think silicon choices can shape product speed, margins, and independence just as much as model breakthroughs can. (theinformation.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.