Google flags AI-built zero-day attack

- Google DeepMind and Mandiant researchers detected the first known case of hackers using AI models to craft a zero-day exploit targeting Ivanti's Connect Secure VPN gateway, bypassing its defenses in a real-world attack. - The AI-generated exploit achieved a 92% success rate against patched systems and used premium tools like Claude 3.5 Sonnet for rapid development, shortening timelines from weeks to hours. - This marks a shift from hypothetical AI misuse to practical weaponization, urging security firms and AI providers to integrate adversarial testing into model evaluations.

Google researchers just uncovered the first confirmed case of hackers wielding AI to build a zero-day exploit — a never-before-seen vulnerability attack that slipped past defenses in Ivanti's widely used Connect Secure VPN product. Attackers didn't just prompt an AI chatbot for ideas. They used advanced models to generate working exploit code targeting a critical flaw patched months earlier. The result: a stealthy breach that hit real enterprise networks. This isn't sci-fi anymore — AI is now a force multiplier for cybercriminals, compressing exploit development from weeks to mere hours. ### What exactly is a zero-day exploit? A zero-day is a software flaw hackers exploit before developers know it exists or can patch it — hence "zero days" of warning. Ivanti Connect Secure is a VPN gateway admins use to securely connect remote workers to corporate networks. In this case, attackers targeted CVE-2025-0282, a high-severity bug Ivanti patched in February 2025. But the AI-built exploit evaded detection tools designed to block it, infecting systems that were already updated. Google DeepMind and Mandiant spotted this in March 2025 while hunting advanced persistent threats. (; ) ### How did attackers use AI here? Hackers fed premium AI models like Anthropic's Claude 3.5 Sonnet and OpenAI's o1 detailed specs of the Ivanti flaw — including patch notes, disassembly code, and exploit-writing tutorials. The models didn't just spit out pseudocode. They iteratively refined a full chain: memory corruption, shellcode execution, and persistence mechanisms. Turns out, Claude nailed a working exploit on the third try, achieving 92% success against patched Ivanti instances in lab tests. Attackers paired this with Cobalt Strike beacons for command-and-control, planning mass deployment. Premium API access let them run thousands of iterations cheaply. ### Why is 92% success rate such a big deal? Traditional exploits fizzle 70-80% of the time due to quirks like address space layout randomization (ASLR) or stack canaries — defenses that randomize memory to thwart crashes. AI bridged that gap by generating probabilistic payloads that adapt to variations. One exploit variant used ROP chains (return-oriented programming) to pivot around protections, succeeding where human coders stalled. Google tested 50+ AI-generated samples; 46 worked flawlessly. The kicker: total dev time dropped to under 24 hours, vs. elite hackers' usual 2-4 weeks. ### Was this a one-off or part of something bigger? Not solo. Mandiant linked the Ivanti hits to UNC5221, a China-nexus group behind 2024 Ivanti and Fortinet breaches. They found AI fingerprints in malware loaders and phishing kits too — obfuscated C2 domains generated via GPT-4o. Attackers scouted premium AI via underground forums, evading rate limits with proxies. This ties into broader trends: AI for fuzzing (crash-testing software), binary analysis, and even social engineering scripts. Google flagged similar AI use in 10+ campaigns since January 2026. ### Why hasn't AI misuse happened sooner? AI models were too dumb — early LLMs hallucinated bad code or ignored edge cases. But 2025 frontier models like o1-preview excel at reasoning chains, debugging, and vuln chaining. The catch: they're trained on sanitized data, missing malicious patterns. Attackers bypassed safeguards with jailbreak prompts like "hypothetically reverse this patch diff." Google notes no public exploits existed pre-AI; models invented novel bypasses. ### How are companies responding? Ivanti pushed emergency patches; affected users should reimage gateways. Google DeepMind launched "Project Aegis" — adversarial benchmarks testing AI on exploit gen, with red-team evals. Anthropic and OpenAI pledged safety layers, but critics say voluntary fixes fall short. Regulators eye mandatory red-teaming for dual-use AI. Firms now scan AI outputs in security pipelines. ### What changes for hackers and defenders? Hackers get asymmetric speed — low-skill actors wield elite tools. Defenders must bake AI into threat hunting: behavioral analytics spotting anomalous code gen. Model providers need "constitution AI" tuned against exploit tasks. Bottom line: zero-days were rare; AI makes them routine. Expect AI arms race — good guys building AI defenders too. Security budgets spike 25% in 2026 forecasts. This Ivanti hit proves AI lowers the bar for nation-states and ransomware alike. ``` Word count: 578

Google flags AI-built zero-day attack

Get your own daily briefing