Apple's device‑first AI push
Apple is leaning into “invisible,” device‑first AI—prioritizing on‑device, privacy‑focused intelligence rather than cloud‑heavy generative models, a strategy analysts say is deliberate and sustainable. The approach hints at new system‑level AI services and APIs that abstract capabilities away from explicit “AI” features and favor local inference. (macworld.com)
WWDC on June 9, 2025 opened developer access to Apple’s on‑device foundation model so third‑party apps can call the same local model powering Apple Intelligence. (apple.com/newsroom/2025/06/apple-intelligence-gets-even-more-powerful-with-new-capabilities-across-apple-devices/) The Foundation Models framework is shipped for iOS 26/iPadOS 26/macOS 26/visionOS 26 and exposes features such as tool calling, guided generation, streaming, and session‑based context management to apps. (developer.apple.com/documentation/foundationmodels) (youtube.com) Multiple reports describe Apple’s on‑device model as a compact, efficiency‑focused model in the ~3 billion‑parameter class for local inference, while larger reasoning tasks are routed to Apple’s cloud layer. (libertify.com/interactive-library/apple-intelligence-foundation-models-2025/) (beebom.com) Private Cloud Compute (PCC) hosts Apple’s larger server models on custom Apple silicon inside a “hardened” OS image and is designed so user data used for inference isn’t retained or accessible to Apple after the task completes. (security.apple.com/blog/private-cloud-compute/) (technologyreview.com) Apple’s silicon already powers local inference — the M4’s 16‑core Neural Engine was specified at 38 TOPS for iPad Pro devices, and coverage of Apple’s M5 roadmap highlights a multi‑fold neural‑performance uplift and dedicated neural accelerators per GPU core. (tomshardware.com/pc-components/cpus/apple-debuts-m4-processor-in-new-ipad-pros-with-38-trillion-operations-per-second-on-neural-engine/) (techradar.com) (9to5mac.com) Apple positioned the Foundation Models framework as an on‑device, low‑latency API with no per‑token inference billing and optimizations for energy and responsiveness, enabling offline workflows and tool automation inside apps. (adtmag.com/articles/2025/06/10/apple-launches-ondevice-ai-framework-and-tools.aspx) (macobserver.com) Apple reported an installed base exceeding 2.5 billion active devices in its January 29, 2026 earnings update, a scale that underpins wide distribution of on‑device AI across iPhone, iPad, Mac, and other platforms. (macrumors.com/2026/01/29/apple-2-5-billion-active-devices/)