AI legal fight is widening
Publishers’ litigation over training data is spilling into access to long‑running archives like the Internet Archive, and major AI firms are funding think‑tanks and policy papers as public approval softens. The debate is also extending into new legal territory such as whether interactions with health chatbots could receive special confidentiality protections in court. (San Francisco Examiner, The Guardian, (mashable.com))
The fight over artificial intelligence training data is spreading beyond copyright lawsuits into web archives, Washington policy shops, and even courtroom rules for chatbot conversations. (sfexaminer.com) News publishers have started limiting what the Internet Archive can capture or expose after concluding that archived pages could become an easy back door for artificial intelligence crawlers. Nieman Journalism Lab reported on January 28 that The Guardian restricted its article pages in the Wayback Machine’s URLs interface and excluded itself from the Archive’s application programming interfaces. (niemanlab.org) The Internet Archive says it is preserving history, not building chatbots. The nonprofit’s site says it offers access to more than 624 billion archived web pages, and the Electronic Frontier Foundation said on March 16 that the Wayback Machine now holds more than one trillion archived pages used by journalists, researchers, and courts. (archive.org, eff.org) That archive fight grew out of a broader publisher backlash. The San Francisco Examiner reported on April 12 that rights holders have already beaten the Internet Archive in a book-publishers case and that the nonprofit also settled with major record labels last year, leaving it exposed as publishers harden defenses against artificial intelligence scraping. (sfexaminer.com) At the same time, major artificial intelligence companies are trying to shape the policy debate themselves. OpenAI published a 13-page paper on April 6 called *Industrial Policy for the Intelligence Age* and said it will pair the effort with research grants of up to $100,000, up to $1 million in application programming interface credits, and a new OpenAI Workshop opening in Washington in May. (openai.com, cdn.openai.com) Anthropic made a parallel move on March 11, announcing the Anthropic Institute under co-founder Jack Clark. The company said the institute will study jobs, economics, forecasting, and how powerful artificial intelligence could interact with the legal system. (anthropic.com) Those moves come as public skepticism remains high in the United States. Pew Research Center said on March 12 that, in a June 2025 survey, 50 percent of U.S. adults said wider artificial intelligence use made them feel more concerned than excited, while 10 percent said they were more excited than concerned. (pewresearch.org) The legal frontier is widening again in health care chatbots. Mashable reported on April 11 that lawyers are debating whether conversations with artificial intelligence health bots could someday receive a court shield similar to attorney-client or therapist-patient privilege, even though those protections are defined today by existing state rules and Federal Rule of Evidence 501. (mashable.com) That debate is not only about user privacy. Mashable reported that legal experts said stronger privilege for chatbot conversations could also keep company chat logs out of discovery in lawsuits, a point raised as OpenAI has faced court demands to preserve and potentially hand over user chats. (mashable.com) So the artificial intelligence legal fight now has at least three fronts at once: who can copy the internet, who gets to write the rules, and what users can expect to stay confidential. The next court rulings may decide less about one chatbot than about the record of the web itself. (sfexaminer.com, mashable.com, openai.com)