Google AI Head Warns of Memory 'Choke Point'

Google's Demis Hassabis has identified memory as the new supply chain bottleneck for AI development. As model sizes and context windows increase, the scarcity of high-bandwidth memory (HBM) is beginning to dictate the price and availability of leading-edge compute.

- The High-Bandwidth Memory (HBM) market is an oligopoly controlled by SK Hynix, Samsung, and Micron, with SK Hynix holding over 50% of the market share in the second quarter of 2025. The supply of HBM is reportedly sold out through 2026, creating a critical chokepoint for all AI hardware producers. - Major cloud providers are aggressively developing custom silicon to mitigate dependency on third-party chip suppliers and optimize for their specific AI workloads. This includes Google's TPU v7, Microsoft's Maia 200, Amazon's Trainium and Inferentia chips, and Meta's MTIA, all designed to reduce the cost of high-volume inference tasks. - This shift to in-house chips is creating a new competitive dynamic; while NVIDIA dominates the market for training frontier AI models, hyperscalers' custom ASICs are increasingly handling inference workloads to reduce operational costs. This "build vs. buy" decision is driven by the potential to lower AI query costs by as much as 50% compared to general-purpose hardware. - The HBM shortage is causing a ripple effect, forcing memory manufacturers to reallocate production away from conventional DRAM used in PCs and smartphones. This has led to DRAM price surges of 80-90% in a single quarter and is expected to increase the cost of consumer electronics. - Venture capital investment in the AI hardware sector is surging, with over $1 billion flowing into AI-specific hardware companies in Q4 2025. Startups are attracting significant funding for new chip architectures and AI-powered design tools that promise to accelerate the creation of custom silicon. - The upfront cost of developing a custom chip at the 3nm or 5nm node can exceed $500 million. However, at the scale of hyperscalers like Microsoft, which is investing over $100 billion in new AI data centers, the long-term total cost of ownership makes vertical integration a mandatory strategy. - The go-to-market strategy for new AI chip companies is adapting to this constrained environment. Toronto-based startup Taalas, which raised $169 million, is marketing its custom silicon as being 20 times less expensive to build and 10 times more power-efficient by physically hard-wiring a specific AI model onto a chip. - This strategic focus on vertical integration could lead to hardware "vendor lock-in," where AI models optimized for one hyperscaler's custom silicon cannot be easily migrated to a competitor's infrastructure. This presents a challenge for AI startups and enterprise ML teams who need to consider the trade-offs of different compute providers.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.