Apple Deploys M5 Chips for Private Cloud Compute

Apple is now using its M5-series chips in the Private Cloud Compute servers that power its "Apple Intelligence" features. This infrastructure, revealed in recent software releases, is designed to process sensitive user queries in a secure, Apple-controlled environment, reinforcing the company's focus on privacy-by-design for its AI services.

- The M5 chip is built on a third-generation 3-nanometer process and features a 10-core CPU, a 10-core GPU, and a 16-core Neural Engine. Apple claims the CPU delivers up to 15% faster multithreaded performance and over 4x the peak GPU compute performance for AI compared to the M4 chip. - This move represents a significant upgrade, as Private Cloud Compute servers have historically used M2 Ultra chips, with Apple largely skipping the M3 and M4 generations for this infrastructure. - The Private Cloud Compute architecture is designed to be "stateless," meaning user data is only used to fulfill a request, is never stored or made accessible to Apple, and is deleted after processing. Apple allows independent experts to inspect the server code to verify this privacy promise. - The hardware upgrade is accompanied by a new "Private Cloud Compute Agent Worker" software component. This system runs a specialized version of iOS with a new "agentic architecture" designed to serve AI requests, and code to interface with it is present in iOS 26.4. - These M5-powered servers are being assembled at a factory in Houston, Texas, as part of Apple's $600 billion investment in domestic infrastructure. - This M5 deployment is considered an intermediate step before Apple deploys fully custom-designed AI server chips. Analyst Ming-Chi Kuo reports that those dedicated chips are slated for mass production in the second half of 2026 for deployment in 2027.

Apple Deploys M5 Chips for Private Cloud Compute

Get your own daily briefing