GPU clouds are sold out but margin‑strained
Cloud GPU providers are reportedly 'completely sold out' even as they wrestle with tight margins, creating procurement pain for startups stuck balancing cost and capacity. That dynamic is driving more conversations about hybrid and on‑prem options. (x.com)
NVIDIA’s Q3 fiscal 2026 earnings release quoted CEO Jensen Huang saying “Blackwell sales are off the charts, and cloud GPUs are sold out.” (investor.nvidia.com) Industry coverage and analyst write-ups report the newest datacenter GPUs are sold out through 2026 with cloud-provider waitlists stretching into the next quarter. (digitaldigest.com) Internal cloud documents and reporting show renting out Nvidia-powered GPU capacity yields unusually thin gross margins for some providers, complicating aggressive capacity expansion. (msn.com) A proliferation of “GPU‑first” and specialty clouds is advertising steep price gaps — industry comparisons put some providers at roughly 50–70% lower cost than hyperscalers for comparable GPU instances. (gpu.fm) Cost math is pushing startups to re-evaluate: market analyses note example cloud bills such as ~$35,000/month for eight H100s and model on‑prem break‑even windows of roughly 7–12 months under sustained utilization. (introl.com) Consulting and vendor analyses say enterprises and growing AI teams are increasingly opting for hybrid architectures and on‑prem GPU clusters to control recurring cloud spend and avoid multi‑quarter procurement queues driven by HBM and advanced packaging constraints. (deloitte.com)