Apple Pursues Hybrid AI Strategy with Google Gemini
Apple is deepening its commitment to on-device AI by leveraging the unified memory and neural engines in Apple Silicon for privacy and performance. However, the company is also pursuing a hybrid approach, reportedly integrating Google's Gemini AI for certain cloud-augmented features in upcoming Siri enhancements. This strategy aims to balance the benefits of local processing with the power of larger, cloud-based models from competitors.
- The multi-year collaboration with Google provides Apple access to a 1.2 trillion-parameter Gemini model, internally known as "Apple Foundation Models Version 10," which has been modified to run on Apple's servers. A more powerful version is expected to follow, which may run on Google's infrastructure. - Siri's overhaul is planned in two main stages: an initial update, expected in iOS 26.4, will introduce better understanding of on-screen context and personal data, while a more significant chatbot-style assistant, internally codenamed "Campo," is slated for iOS 27 and will leverage the more advanced custom Gemini models. - To handle cloud-based tasks without compromising privacy, Apple developed "Private Cloud Compute," a system that uses data centers running entirely on custom Apple Silicon servers. This vertical integration strategy avoids reliance on third-party cloud infrastructure and chip vendors for server-side AI processing. - Apple's on-device AI relies on its own smaller, ~3 billion parameter language model, which is optimized for efficiency on Apple Silicon's Neural Engine and handles tasks directly on the device. The system is designed to escalate more complex queries to the larger server-based models from Apple or its partners. - The pivot towards partnerships follows internal challenges; Apple was reportedly caught off guard by the rapid advancement of generative AI with ChatGPT's release, forcing a refocus of its own long-running internal LLM project, codenamed "Ajax." - Apple is opening its on-device models to third-party developers through a "Foundation Models" framework, allowing them to build AI features directly into their apps. This strategy aims to deepen the integration of AI within the Apple ecosystem and leverage its developer community.