NPUs push edge AI into phones

- Qualcomm, Apple, Google, and Samsung have turned phone AI into a silicon race, pushing NPUs and on-device models as the selling point. - The hard numbers matter: Apple’s A18 Pro has a 16-core Neural Engine, Qualcomm touts a faster NPU, and GenAI phones topped 400 million shipments in 2025. - That shifts smartphone competition from megapixels alone to who can run useful AI locally, quickly, and without burning battery.

Smartphone AI has stopped being a cloud story. It’s turning into a chip story. The big change is the NPU — the part of the phone’s main processor built to run neural networks fast and with less power than a CPU or GPU. That matters because the features people actually notice now — photo cleanup, live translation, call summaries, scam detection, offline writing help — work best when they run on the phone itself, not after a round trip to a server. (qualcomm.com) ### What is an NPU, really? An NPU is basically the phone’s dedicated AI engine. Qualcomm’s Hexagon block, Apple’s Neural Engine, and the AI blocks inside Google Tensor and MediaTek Dimensity all do the same broad job — accelerate inference, which is the step where a trained model actually does something useful for you. The reason phone makers care is simple: local inference is(qualcomm.com). (qualcomm.com) ### Why is this suddenly a phone-buying feature? Because AI features have moved from demo bait to everyday utilities. Apple built the iPhone 16 Pro line around the A18 Pro and Apple Intelligence. Google has been pushing Gemini Nano on Pixel, including offline and low-latency features. Samsung has made Galaxy AI central to translation, writing, and photo tools, then wrapped that(qualcomm.com)eatures become habitual, the chip running them stops being invisible. (apple.com) ### Why not just do all this in the cloud? Because cloud AI is slower, costlier, and less dependable in the moments people actually use phones. If you want live call translation, scam detection during a call, or camera processing the instant you hit the shutter, latency kills the magic. The privacy angle matters too — keeping sensitive (apple.com) server. (store.google.com) ### So what are the companies actually competing on? Not just raw TOPS. TOPS is the easy number to market, but it’s only part of the story. The real contest is model efficiency, memory bandwidth, thermal limits, software tooling, and whether developers can actually access the accelerator cleanly. Qualcomm is selling a broader on-device AI stack through(store.google.com)le still wins points for tight hardware-software integration. Basically — the chip matters, but the stack matters more. (qualcomm.com) ### Why do cameras keep coming up? Because cameras are where AI feels concrete. Every flagship already has “good enough” raw sensors. The next jump comes from local image understanding — subject separation, denoising, generative edits, zoom cleanup, scene optimization, and video processing that happens without lag. Qualcomm is even pitching triple 20-bit ISPs and more dynamic r(qualcomm.com) race and the NPU race have become. (qualcomm.com) ### Is this only a flagship thing? Not anymore. Counterpoint expects GenAI-capable smartphones to top 400 million units in 2025, about one-third of global shipments, up from one in five in 2024. It also expects the next 500 million cumulative units to arrive much faster as lighter models and cheaper AI chipsets move downmarket. That’s the real market shift — edge AI is escaping the ultra-premium tier. (counterpointresearch.com) ### What’s the catch? A lot of “AI phone” marketing still outruns reality. Some features are partly cloud-backed. Some chips post huge benchmark numbers but don’t translate that into better apps. And battery remains the hard governor — a phone can’t feel smart if the AI drains it by lunch. The winners won’t be the brands with the loudest NPU claims. They’ll be the ones that make local AI feel instant, useful, and boringly reliable. (qualcomm.com) ### Bottom line Phones are becoming edge AI devices first and camera slabs second. The NPU is the part making that possible — but the real product is the whole on-device AI system wrapped around it. (qualcomm.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.