Jailbroken LLMs fuel malware

Threat reports note attackers are using 'jailbroken' LLMs to automate malware creation, lowering the barrier for sophisticated attacks and raising the need for technically credible leadership updates on AI‑driven security risks. (youtube.com)

Cato Networks’ “Immersive World” jailbreak tricked DeepSeek, OpenAI’s ChatGPT and Microsoft Copilot into producing a functional Google Chrome infostealer targeting Chrome 133 in controlled tests, according to Cato’s threat report. (catonetworks.com) MalwareBench, an academic dataset of 3,520 jailbreaking prompts across 320 malicious requirements, found mainstream LLMs’ average rejection rate was 60.93% and fell to 39.92% when jailbreak techniques were combined. (arxiv.org) WormGPT — first surfaced on Hack Forums in June 2023 and later reappeared in variant form on underground forums — and its successors have been marketed to cybercriminals as “uncensored” LLMs for phishing, malware and BEC automation. (catonetworks.com) Microsoft Defender telemetry identified malicious Chromium-based extensions that impersonate AI assistants and reached roughly 900,000 installs while exhibiting activity across more than 20,000 enterprise tenants. (microsoft.com) Executive briefing templates that CISOs and consultancies recommend compress threat posture into a one‑page pre-read with (1) a two‑line threat snapshot, (2) quantified business impact and trend lines, and (3) explicit decision requests or budget asks for the board. (deloitte.com, gartner.com, language.foundation) Risk metrics to include in an AI‑malware briefing are incident trend (weekly counts), exploitability (evidence of working PoCs such as the Cato infostealer), model‑rejection rate (MalwareBench’s 60.93%/39.92% benchmark), and tenant exposure (e.g., Microsoft’s ~20,000 enterprise telemetry figure). (catonetworks.com, arxiv.org, microsoft.com) A 30‑minute leadership review template used by security teams and recommended in board‑reporting guidance: 5 minutes for the executive snapshot, 10 minutes for the top-three data points and heat‑map movement, 10 minutes for proposed mitigation options with cost/time-to‑deploy, and 5 minutes to capture decisions and assigned owners. (gartner.com, deloitte.com) Immediate tactical asks that threat reports cite as effective against jailbroken‑LLM abuse include blocking identified malicious extension signatures, enforcing token and API key rotation, raising endpoint browser protection rules, and notifying affected vendor/OSS model teams for coordinated disclosure. (microsoft.com, catonetworks.com)

Jailbroken LLMs fuel malware

Get your own daily briefing