Apple Intelligence Criticized for 'Hallucinated' Outputs
Apple's new "Apple Intelligence" suite is reportedly facing scrutiny after its generative models produced hallucinated and stereotyped outputs on millions of devices. The unwanted content appeared even for users who did not actively engage the feature, highlighting ongoing challenges in model guardrailing and bias mitigation for large-scale consumer AI deployments.
- An independent analysis of over 10,000 AI-generated summaries found systematic bias; in scenarios with ambiguous pronouns, the system assigned a specific gender 77% of the time, with two-thirds of these assignments aligning with common stereotypes. - The system's architecture is a hybrid model, defaulting to a ~3-billion-parameter model on-device for speed and privacy. More complex tasks are offloaded to "Private Cloud Compute" (PCC), a network of servers running Apple Silicon. - PCC is designed to be stateless, processing requests ephemerally in memory without storing user data after the task is complete, a core element of Apple's strategy to align with privacy mandates like GDPR's "data protection by design." - Apple's guardrailing strategy includes discoverable plaintext JSON files in macOS betas that contain explicit instructions for the AI, such as "Do not make up factual information," to guide its responses for specific features. - The rollout of Apple Intelligence in the EU has been delayed, with the company citing uncertainties regarding compliance with the Digital Markets Act (DMA), which imposes strict obligations on designated "gatekeeper" platforms. - Before the widespread reports of stereotyping, the notification summary feature had already generated and attributed fake news headlines to sources like the BBC, leading Apple to disable the feature for news apps in an iOS 18.3 beta. - For developers, the safety framework is exposed through the Foundation Models API, which can throw a `guardrailViolation` error when the system detects sensitive content in either an input or an output, requiring applications to handle this exception. - Apple Machine Learning Research has published work on "Disentangled Safety Adapters" (DSA), a framework intended to decouple safety-specific calculations from the main model to improve the efficiency and accuracy of tasks like hallucination detection.