Pentagon Pressures Anthropic on AI Safety
The U.S. Department of Defense is in a standoff with AI developer Anthropic over military applications for the company's technology. The Pentagon has reportedly pressured the $380 billion firm to remove certain safety guardrails from its advanced AI models. This conflict highlights the growing tension between national security imperatives and the ethical boundaries of using AI in warfare.
The dispute centers on two specific "red lines" drawn by Anthropic: a ban on using its AI for the mass surveillance of U.S. citizens and a prohibition on its integration into fully autonomous weapons systems that operate without human control. The Pentagon is demanding the removal of these restrictions, insisting on the ability to use the technology for "any lawful use." The Department of Defense gave Anthropic a deadline of 5:01 p.m. on Friday, February 27, to comply with its demands. Officials have threatened to invoke the Defense Production Act to force compliance, cancel contracts worth up to $200 million, and designate the company a "supply chain risk"—a label typically reserved for foreign adversaries. In a public statement, Anthropic CEO Dario Amodei said the company "cannot in good conscience accede to their request." He argued that current AI systems are not yet reliable enough to power lethal weapons without human oversight and that mass domestic surveillance is "incompatible with democratic values." Pentagon officials, including Chief Technology Officer Emil Michael, argue that the military must be trusted to do the right thing and that a private company should not dictate national security policy. The department maintains it has no interest in mass domestic surveillance or fully autonomous weapons but needs to be prepared for adversaries' advancements in AI. The conflict puts Anthropic's foundational principles to the test. The company was established in 2021 by former OpenAI employees who left over safety concerns, and it has consistently marketed itself as a "safety-first" AI lab. Discussions between the two parties have reportedly included high-stakes hypothetical scenarios, such as using AI to counter a nuclear missile attack on the United States. Coinciding with the dispute, Anthropic recently relaxed a key safety policy that pledged to halt AI development if it was deemed potentially dangerous, though the company stated the change was unrelated to the Pentagon's pressure.