Apple’s AI pivot: device first
Apple is retooling its AI strategy around hardware‑software‑App Store integration instead of chasing cloud model scale — offering rare AI team bonuses and reportedly leaning on third‑party ‘Extensions’ for Siri in iOS 27. The move signals Apple will monetize orchestration and on‑device intelligence rather than raw model leadership. ( )
Apple confirmed WWDC 2026 runs June 8–12 and the keynote on June 8 is the likely venue for the iOS 27 reveal. (apple.com) Reports say Apple issued out‑of‑cycle restricted stock bonuses worth “several hundred thousand dollars,” with outlets citing amounts up to $400,000 provided as RSUs that vest over four years. (macrumors.com) Sources close to the company told Bloomberg the awards targeted members of the iPhone Product Design organization as Apple seeks to retain hardware talent amid recruiting from AI device startups. (bloomberg.com) Apple’s own machine‑learning blog describes a roughly 3‑billion‑parameter on‑device language model alongside larger server‑based models served via its Private Cloud Compute infrastructure running on custom Apple silicon. (machinelearning.apple.com) Reporting from The Information, summarized by MacRumors, says Google gave Apple “complete access” to Gemini for distillation work so Apple can produce smaller, on‑device derivatives of Gemini for Siri and other features. (macrumors.com) MacObserver and multiple outlets say iOS 27 is being tested with a new Siri app and an “Extensions” system to let third‑party chatbots like Gemini, Claude and ChatGPT plug into Siri, with Apple positioning the App Store as the commercial path for third‑party subscriptions. (macobserver.com) Apple’s recent M5 silicon announcement lists a refreshed 16‑core Neural Engine and a near‑30% uplift in unified memory bandwidth to 153GB/s—specs Apple links directly to faster on‑device AI performance. (apple.com) Separate coverage notes Apple has begun shipping custom, Houston‑built server hardware to power Private Cloud Compute, confirming the company’s dual investment in edge silicon and its own server infrastructure for larger model workloads. (tomshardware.com)