AI compute is tightening
Spot prices for recent Nvidia GPUs have risen sharply over the last two months, signalling growing scarcity and higher run costs for model inference and training (the-decoder.com). At the same time, U.S. export approvals for chips to China are slowing because the licensing agency has lost roughly a fifth of its staff, and companies report AI is already compressing traditionally long engineering tasks — Nvidia says an internal model cut an 80‑person‑month chip design task to an overnight run on one GPU (tomshardware.com) (videocardz.com).
Prices for the newest Nvidia artificial intelligence chips have climbed in recent weeks, while the United States is taking longer to approve some China-bound exports and companies are finding new ways to squeeze more work out of each processor. (the-decoder.com) The Decoder reported that spot prices for Nvidia graphics processing units, or GPUs, rose sharply over roughly the last two months, with demand for recent accelerators outpacing immediate supply in secondary markets. Those chips are the workhorses that train large models and serve answers to users, so tighter supply raises the cost of both training and inference. (the-decoder.com) Tom’s Hardware reported on April 11 that the Bureau of Industry and Security, the Commerce Department office that reviews export licenses, has lost nearly one-fifth of its licensing staff. Companies told the outlet that the staffing drop is slowing approvals for chip shipments to China under existing United States export controls. (tomshardware.com) At the same time, Nvidia is arguing that better software can offset some of the hardware squeeze. VideoCardz reported that Nvidia said an internal artificial intelligence model reduced an 80-person-month chip-design task to an overnight run on a single GPU, a claim the company presented as evidence that more engineering work can be automated. (videocardz.com) A GPU is a specialized processor built to do many calculations at once, which makes it useful for both graphics and artificial intelligence. The current bottleneck is not just making enough chips, but also packaging them with high-bandwidth memory and fitting them into servers, power systems, and data centers fast enough to meet demand. ( ) That constraint has been building for more than a year. Nvidia said in February 2025 that demand for Blackwell systems was “amazing,” and major cloud providers including Microsoft, Amazon, Google, and Meta have kept raising capital spending tied to artificial intelligence infrastructure. ( ) Export controls add a second pressure point. Since October 2022, Washington has tightened rules on advanced chips and chipmaking tools for China several times, and companies including Nvidia have redesigned products to stay within the rules before later restrictions narrowed those paths again. ( ) Nvidia has said repeatedly that restrictions on sales to China can hurt its business while also pushing Chinese customers toward domestic alternatives. United States officials have said the rules are meant to limit China’s access to advanced computing for military modernization. ( ) The result is a market where every available processor matters more. If GPUs are scarcer, approvals are slower, and software keeps making each chip more productive, the near-term contest in artificial intelligence shifts from who has the best model to who can secure, route, and fully use the compute they already have. ( )