AI Model Finds Major Firefox Bugs

Anthropic's Claude Opus 4.6 AI model, in partnership with Mozilla, uncovered 22 vulnerabilities in the Firefox web browser in just two weeks. Fourteen of the bugs were rated as high-severity, representing 20% of all high-severity fixes Mozilla has planned for 2025, showcasing AI's growing power in cybersecurity.

This collaboration was born from a need for a more challenging test for Anthropic's AI. After its previous model, Opus 4.5, nearly mastered a standard cybersecurity benchmark, researchers sought a more complex and heavily-scrutinized target, selecting Firefox's open-source codebase. The initial goal was to see if the AI could reproduce known, historical vulnerabilities before they tasked it with finding entirely new bugs. Within just twenty minutes of analyzing Firefox's JavaScript engine, Claude Opus 4.6 identified a severe memory corruption flaw known as a Use-After-Free vulnerability. This initial success prompted a wider scan of nearly 6,000 C++ files, ultimately uncovering classes of logic errors that decades of traditional automated testing, known as fuzzing, had failed to catch. Mozilla received 112 unique crash reports in total from the effort. The early detection of such flaws represents a significant economic advantage, as the cost to fix a software bug after a product's release can be up to 100 times more than fixing it during the design phase. The total cost of poor software quality in the U.S. has been estimated at $2.41 trillion annually, a figure that includes everything from cybersecurity failures to operational glitches and unsuccessful projects. This partnership is a high-profile example of a rapidly growing market. The global AI in cybersecurity market was valued at over $25 billion in 2024 and is projected to grow at a compound annual growth rate of over 24%, reaching nearly $94 billion by 2030. This growth is driven by the increasing complexity of cyber threats that outpace the capabilities of traditional, rule-based defense systems. Beyond just identifying bugs, Anthropic tested the AI's ability to create functional exploits for them. In an experiment costing approximately $4,000 in API credits, Claude Opus 4.6 succeeded in only two out of several hundred attempts, creating crude exploits that worked only in a test environment with key security features disabled. This highlights that, for now, finding vulnerabilities is significantly cheaper and more reliable for AI than exploiting them. Following the project's success, Mozilla has announced it will begin integrating AI-assisted analysis into its internal security workflows. This move signals a broader industry shift, where AI is used not just as a one-time research tool but as a continuous part of the software development lifecycle to proactively harden defenses.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.