Apple's On-Device AI Poses Nvidia Challenge

Apple's integrated strategy of on-device AI, custom silicon, and tight hardware-software integration could position it as a major competitor to Nvidia, particularly if scaled to data centers. The M-series Neural Engine already delivers over 15 TOPS, enabling high-quality, whisper-class models to run locally with high privacy. This on-device power is a key part of Apple's competitive moat.

Apple's Neural Engine performance has seen exponential growth, starting at 11 trillion operations per second (TOPS) in the M1, increasing to nearly 16 TOPS in the M2, and jumping to roughly 133 TOPS with the M5 chip. This leap in the M5 is attributed to new dedicated Neural Accelerators within each GPU core, enabling the graphics hardware to directly handle AI workloads. While competitors are projected to spend a combined $700 billion on capex in 2026, much of it on AI data centers, Apple's planned expenditure is a fraction of that at around $14 billion. The company is increasing investment in its own private cloud infrastructure, powered by custom Apple Silicon under initiatives codenamed Project ACDC (Apple Chips in Data Centers). This follows a capital-efficient model, aiming to avoid the massive costs of cloud inference for its billion-plus user base. For model training, Apple has strategically used Google's Tensor Processing Units (TPUs) instead of relying solely on Nvidia, employing large clusters of both TPUv5p and TPUv4 chips. This hybrid approach, combining in-house server silicon with third-party resources, allows Apple to diversify its hardware strategy while avoiding dependency on a single provider for large-scale model development. The on-device execution is managed by frameworks like Core ML, which is optimized to leverage the CPU, GPU, and Neural Engine for maximum performance with minimal power consumption. There are indications of a shift toward a new "Core AI" framework, intended to better integrate third-party AI models and emphasize generative capabilities directly within developers' apps. This focus on custom silicon extends into Apple's supply chain, where AI and predictive analytics are used to optimize demand forecasting, inventory management, and logistics. However, the industry's center of gravity is shifting; suppliers like TSMC and Foxconn now see greater revenue and margin growth from AI server clients like Nvidia than from consumer electronics. This creates new competition for critical components, altering a supply chain dynamic Apple once dominated.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.