Nvidia Preps New Inference-Focused Chip

Nvidia is reportedly set to unveil a game-changing AI chip at its upcoming GTC event. The new hardware is said to be focused on rapid query processing, directly addressing the growing market for inference acceleration where competitors are gaining ground.

The market for AI inference is projected to surge from approximately $106 billion in 2025 to over $250 billion by 2030, shifting the primary battleground from model training to real-time query processing. This explosive growth is forcing a strategic pivot from general-purpose GPUs toward more specialized, efficient hardware. Nvidia's move comes amid rising pressure from competitors who are gaining traction in the inference space. Hyperscalers like Google with its TPUs and Amazon with Inferentia are developing custom silicon, while startups like Groq have demonstrated significant performance gains with specialized architectures like their Language Processing Unit (LPU). The upcoming chip, expected to be unveiled at GTC 2026, may incorporate technology from Groq, a startup whose team was largely "acqui-hired" by Nvidia. This talent acquisition is a key move in the Silicon Valley ecosystem, aimed at solving the memory and latency bottlenecks that currently plague large language model responses. This new hardware is rumored to be part of the next-generation architecture, codenamed Blackwell. The Blackwell B100 chip, manufactured on a custom TSMC 4NP process, features a dual-die design with 192GB of HBM3e memory, a significant jump from the 80GB of HBM3 in the current H100 generation. OpenAI has reportedly committed to becoming a lead customer for the new processor, securing a massive purchase of "dedicated inference capacity." This signals a strong industry push for deploying more complex and autonomous "agentic AI" systems, which require faster and more efficient underlying hardware.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.