OpenAI Accuses DeepSeek of Model Distillation
OpenAI has accused rival AI firm DeepSeek of distilling proprietary U.S. models to improve its own offerings. The accusation highlights growing concerns around model security, intellectual property, and the competitive practices within the foundation model industry. The issue of model distillation, where a smaller model is trained on the outputs of a larger one, is a key point of contention.
- In a memo to the U.S. House Select Committee on China, OpenAI alleged that DeepSeek engaged in "free-riding" by using "new, obfuscated methods" to circumvent access restrictions and distill knowledge from U.S. foundation models. This included routing queries through third-party services to mask their origin. - The legal standing of this accusation is complex; while OpenAI's terms of service explicitly prohibit using their models to develop competing products, the copyrightability of AI-generated outputs is not well-established in U.S. law. This forces the core of the dispute into the realm of contract law rather than intellectual property infringement. - Some users have anecdotally reported that DeepSeek's model, when prompted about its identity, would claim to be a large language model trained by OpenAI, suggesting its training data was heavily influenced by ChatGPT's outputs. - The controversy highlights a significant business model threat for companies like OpenAI and Anthropic, who invest heavily in compute and charge for premium access. Chinese AI firms like DeepSeek often offer their models for free, and distillation offers a low-cost shortcut to competitive performance. - This accusation is unfolding while OpenAI itself faces multiple lawsuits from entities like The New York Times, alleging that its own models were trained on copyrighted material without permission, creating a complicated public narrative around data usage and intellectual property in AI. - OpenAI has also framed the issue as a matter of national security, warning lawmakers that safeguards and ethical filters built into U.S. models may not be transferred during the distillation process, potentially allowing for misuse in sensitive areas like biology or chemistry. - Model distillation is a widely used technique in machine learning for creating smaller, more efficient "student" models by training them on the outputs of a larger "teacher" model. The core of the issue is not the technique itself, but its application to proprietary models in violation of terms of service. - In response to the perceived threat, OpenAI has stated it is taking "aggressive, proactive countermeasures" to protect its models and is working with the U.S. government to safeguard American AI development.