Apple's Cloud AI Strategy in Flux

Apple is reportedly in talks to use Google servers for its upgraded AI Siri, a move to balance complex workloads with privacy. Meanwhile, some of Apple's own AI servers are said to be sitting unused due to low adoption of Apple Intelligence, highlighting significant uncertainty in its cloud infrastructure strategy.

The multi-year partnership with Google to integrate Gemini models is a strategic pivot, aiming to enhance Siri and other AI features after its own development faced delays. This move outsources the foundational reasoning layer to Google, allowing Apple to focus on user experience and privacy while trying to catch up with competitors in the AI race. This hybrid cloud approach, combining Apple's private cloud with services from Google and AWS, is a significant shift from Apple's traditional strategy of controlling every layer of its stack. The decision is influenced by a need to manage the massive capital expenditure required to build out a competitive AI infrastructure independently. Google is reportedly setting up dedicated servers in its data centers that must comply with Apple's stringent privacy standards. Contradictory reports paint a confusing picture of Apple Intelligence adoption. A Morgan Stanley survey from March 2025 indicated stronger-than-expected engagement, with nearly 80% of eligible U.S. iPhone owners having used the features. However, other surveys from late 2024 and mid-2025 suggest a majority of users find the new AI features add little value, prioritizing battery life and storage over AI capabilities when upgrading. The underutilization of Apple's own AI servers, powered by M2 Ultra chips, is a direct consequence of these adoption challenges and delays in deploying a next-generation Siri. The custom chips, designed for consumer devices, are reportedly not optimized for the large-scale AI workflows required for advanced models like Gemini, further complicating the in-house strategy. This infrastructure pivot comes as Apple reorganizes its AI leadership, with Amar Subramanya now reporting to software chief Craig Federighi, who had previously vetoed using Google Cloud over privacy concerns. The company is also quietly acquiring smaller AI startups like WhyLabs and Pointable AI to bring more specialized talent and technology in-house. While the Google partnership is a significant step, Apple continues to develop its own server-class AI chips, though large-scale deployment isn't expected until 2027. This long-term silicon roadmap aims to eventually reduce reliance on external providers by creating hardware specifically tailored for neural network workloads and inference at a massive scale, aligning chip design with software and security needs.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.