Zero Latency launches Blackwell edge

- Zero Latency said on May 11 it has standardized its U.S. edge AI network on Red Hat AI Factory with NVIDIA Blackwell GPUs. - The system, called Zerogrid, pushes inference into local industrial hubs to cut the “latency tax” for millisecond-sensitive automation and transactions. - It matters because AI compute is starting to spread beyond giant campuses into distributed sites, with operations and cooling now part of the product.

AI infrastructure usually means giant data centers in remote places. But that model breaks when the job needs an answer in milliseconds, not after a round trip to a faraway cloud region. That is the gap Zero Latency is trying to fill. On May 11, at Red Hat Summit, the company said it has built its U.S.-wide network on Red Hat AI Factory with NVIDIA and is using NVIDIA Blackwell GPUs in edge nodes it calls Zerogrid. ### What actually launched? Not a new chip, and not a consumer product. Zero Latency — branded as 0.lat — launched a distributed AI inference network that places GPU capacity in decentralized edge data centers near industrial centers around the U.S., then routes workloads across them with a common Kubernetes-based control layer. Red Hat is the orchestration layer. NVIDIA Blackwell is the compute layer. (redhat.com) ### Why put GPUs at the edge? Because some AI jobs are allergic to delay. Industrial automation, machine vision, and real-time transactions can need responses on millisecond timescales. If the model sits in a distant hyperscale region, network travel becomes part of the problem. Zero Latency’s whole pitch is that centralized cloud creates a “latency tax,” and local inference can remove enough of that delay to make these use cases practical. (redhat.com) ### Why does Blackwell matter here? Blackwell is NVIDIA’s current architecture for high-performance AI workloads, and NVIDIA is pitching it as a step up in performance and efficiency for generative AI and accelerated computing. In plain English, that means the edge nodes are not lightweight toys — they are serious inference boxes. That matters because edge compute used to imply compromise. The new version is closer to “small, local AI factory” than “weaker mini-server.” (redhat.com) ### So is this just another cloud? Basically, it is a neocloud pitch. Instead of one giant pool of GPUs in one place, the company aggregates many smaller pools spread across locations and dispatches jobs to the right site. The trick is making that feel like one network to customers. That is where Red Hat’s cluster management matters — one workflow, many physical locations. (nvidia.com) ### Why is this showing up now? Because AI demand has collided with power, land, and latency limits. One response is to build ever bigger campuses. Another is to distribute compute physically closer to users and data. You can see the same logic in a much stranger place too — Ars Technica reported today on SPAN’s plan to test mini data-center units at homes, with a 100-home trial this year using liquid-cooled NVIDIA Blackwell-based systems. That is a more radical version of the same decentralization trend. (redhat.com) ### Does this change the job mix? Yes — a bit. If compute spreads out, the work spreads out too. You still need software people, but you also need people who can deploy hardware in many places, run edge operations, manage fleets, and deal with heat and power in tighter footprints. The catch is that distributed infrastructure is operationally messier than one big building. But that operational mess is increasingly the product. This last point is an inference from how these systems are being built and marketed. (arstechnica.com) ### What is the real takeaway? Zero Latency’s announcement is a sign that AI infrastructure is splitting into two layers. One layer stays in giant centralized campuses. The other moves outward — into factories, local hubs, and maybe even neighborhoods. If your workload cares about milliseconds, data locality, or both, the edge layer is where the action is moving. (redhat.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.