Apple Intelligence prompt‑injection

Researchers found a way to trick Apple’s on‑device AI into executing attacker‑controlled prompts, undermining the privacy and trust benefits Apple advertises for local models. The weakness let researchers bypass protections and could have exposed sensitive actions or data before Apple shipped fixes in recent OS updates. The vulnerability was reported across multiple outlets and Apple’s protections in iOS 26.4/macOS 26.4 were noted as the corrective step. (appleinsider.com) (9to5mac.com) (theregister.com)

Apple built Apple Intelligence so some requests run on your iPhone or Mac instead of being sent to a cloud server, which is supposed to keep private data closer to the device. Researchers now say that same local model could be tricked into following attacker-written instructions instead of Apple’s own safety rules. (rsaconference.com) (9to5mac.com) A prompt injection attack is the artificial intelligence version of hiding a fake note inside a real instruction manual. The model sees text that looks meaningless to a person, but treats it like a secret command and changes its behavior. (securityweek.com) (rsaconference.com) The RSAC researchers said they combined two methods. One was “Neural Execs,” which uses reusable nonsense-looking trigger text, and the other was Unicode manipulation, which swaps in lookalike characters to slip past filters that check what goes in and what comes out. (securityweek.com) (rsaconference.com) Apple had multiple layers in front of the model. The researchers said their attack got around input filters, output filters, and internal guardrails on the local large language model, which is the text engine inside Apple Intelligence. (rsaconference.com) (appleinsider.com) In demos described by the researchers and reported by outlets, they could force the model to produce attacker-directed results, including profanity that Apple normally blocks. The point was not the bad words themselves; it was that the model was obeying the attacker’s hidden prompt instead of Apple’s policy. (theregister.com) (9to5mac.com) The more serious risk was not rude text but connected actions. RSAC said the local model is exposed to third-party apps through system application programming interfaces, which means a successful prompt injection could matter anywhere developers rely on Apple Intelligence to summarize, rewrite, classify, or handle sensitive app content. (rsaconference.com) (appleinsider.com) The researchers said they disclosed the issue to Apple on October 15, 2025. Apple’s fixes were then noted in iOS 26.4 and macOS 26.4, the software updates cited by RSAC and follow-up coverage as the corrective step. (apfelpatient.de) (9to5mac.com) This lands in an awkward spot for Apple’s pitch. Running a model on-device can reduce exposure to outside servers, but it does not stop the model from being fooled by hostile inputs sitting on the device itself. (rsaconference.com) (securityweek.com) RSAC estimated there were more than 200 million Apple Intelligence-capable devices in use by December 2025, which is why a bug in a local model is still a mass-scale security story. No public reports of criminal exploitation were cited in the coverage, but the research shows that “private on your device” and “safe from manipulation” are two different promises. (cyberwebspider.com) (securityweek.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.