Google Cloud Rolls Out HPC VMs

Google Cloud has released its H4D virtual machines, which are optimized for high-performance computing (HPC) workloads. The move gives startups working on AI, complex simulations, or heavy data science on-demand access to near-supercomputer-level resources.

The H4D instances are built around AMD's 5th Gen "Turin" EPYC processors, packing 192 single-threaded vCPUs into a single virtual machine. This high core density is paired with up to 1,488 GB of DDR5 memory and is specifically optimized for tightly-coupled workloads where processes need to communicate frequently. Performance benchmarks show significant generational leaps. Compared to Google's previous C3D instances, the H4D offers 1.8 times higher performance per VM on the High-Performance Linpack (HPL) benchmark. Against the older C2D generation, the performance jump is even more dramatic, with specific applications like the GROMACS molecular dynamics simulation running up to 5.8 times faster. A key enabler for this performance is the use of Google's custom Titanium offload processors. This is the first CPU-based VM in Google's lineup to leverage Titanium for Cloud Remote Direct Memory Access (RDMA), which provides high-bandwidth, low-latency networking at up to 200 Gbps, minimizing communication bottlenecks between nodes. These VMs are aimed squarely at workloads like computational fluid dynamics, weather forecasting, genomics, and complex financial modeling. For instance, in manufacturing simulations using Ansys Fluent, the H4D delivered a 4.1x speedup over previous generation C2D virtual machines. For managing these large-scale jobs, H4D integrates with Google Kubernetes Engine (GKE) and the cloud-native Batch service for scheduling. It also supports the Dynamic Workload Scheduler, allowing startups to procure capacity for as low as 3 cents per core-hour without long-term commitments, offering a flexible consumption model for bursty workloads. This move places Google in direct competition with other major cloud providers like AWS and Microsoft Azure, which also offer specialized HPC instances. While AWS has touted networking speeds of up to 400 Gbps on some instances, Google's use of its custom Titanium offload processors for RDMA is its strategic play to optimize for both performance and cost-efficiency in large-scale scientific and engineering computations.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.