Gcore to Offer NVIDIA AI Inference as a Service
Infrastructure provider Gcore announced the integration of NVIDIA's open-source Dynamo framework into its platform. The company will now deliver high-performance AI inference as a fully managed service. This will allow for one-click deployment of the framework across various cloud and on-premise environments.
- AI inference is the ongoing operational cost of using a trained AI model, which can grow significantly with user engagement, in contrast to the one-time capital expense of training it. Services like Gcore's are designed to manage these recurring costs, making large-scale AI applications more financially predictable. - The AI inference market is projected to grow to $169 billion by 2032, indicating a massive industry shift toward deploying and running AI models as a core business function. - For fashion, this technology powers applications like AI-driven trend forecasting used by Zara, personalized styling recommendations pioneered by Stitch Fix, and the creation of AI-generated campaign imagery. - The NVIDIA Dynamo framework is open-source software that optimizes how AI models run, specifically by reducing latency and efficiently scheduling tasks across multiple processors, which is critical for real-time customer interactions like virtual try-ons. - Gcore's service leverages a global network of more than 180 edge nodes, which are data centers located closer to end-users to ensure faster response times of under 30 milliseconds. - This service is built on powerful, industry-leading NVIDIA hardware, including A100 and H100 Tensor Core GPUs, which are designed to handle the demanding computational loads of AI and machine learning tasks. - In October 2025, Gcore launched a related product, the AI Cloud Stack, in partnership with VAST Data and Nokia, aimed at allowing other companies to build their own private AI clouds using Gcore's software and NVIDIA hardware. - Key competitors in the broader cloud and AI infrastructure space include major players like Amazon Web Services (AWS), Google, Microsoft, and Cloudflare.