OpenAI Faces Scrutiny Under California AI Safety Law

OpenAI's recently released GPT-5.3-Codex is facing allegations from a watchdog group of violating California’s new AI safety law. The complaint claims the high-risk coding model has insufficient safeguards, signaling increased regulatory pressure on AI labs to provide auditable documentation of data sourcing and validation.

- California's SB 1047, the "Safe and Secure Innovation for Frontier Artificial Intelligence Models Act," applies to models trained using over 10^26 floating-point operations (FLOPS) and costing more than $100 million to train. It mandates developers to implement a "full shutdown" capability, conduct annual third-party audits starting January 1, 2026, and report any "AI safety incidents" that risk mass casualties or over $500 million in damages within 72 hours. - Reinforcement Learning from Human Feedback (RLHF) is a core technique used by labs like OpenAI to align models, but sourcing high-quality human preference data is a significant operational expense and bottleneck. The process involves training a separate reward model on human-ranked outputs, which then guides the main model's behavior through reinforcement learning. - Anthropic's Constitutional AI is an alternative alignment method that reduces reliance on extensive human labeling. The model learns to critique and revise its own outputs based on a predefined set of principles (a "constitution"), a process called Reinforcement Learning from AI Feedback (RLAIF), which Anthropic claims is more efficient and transparent than standard RLHF. - Evaluating agentic AI, which can use tools and make multi-step decisions, requires new methods beyond traditional metrics. Frameworks like Microsoft's Azure AI Evaluation library now include agent-specific metrics such as Task Adherence (did the agent solve the right problem?) and Tool Call Accuracy (did it use its tools correctly?). Benchmarks like GAIA and ToolBench are emerging to test for general intelligence and correct tool usage. - For tasks requiring nuanced understanding, cultural context, or domain-specific expertise, human-labeled data consistently outperforms synthetic data. While synthetic data offers scalability and can be generated much faster, models trained on it can underperform by 12-18% on complex reasoning tasks compared to those trained on human-labeled data. A hybrid approach is often optimal, using synthetic data for scale and human data for refining critical or subjective capabilities. - The fundraising climate for AI infrastructure startups remains strong, with AI companies raising a third of all venture capital in 2024. At the seed stage, AI startups saw a median pre-money valuation of $17.9 million, 42% higher than non-AI companies. However, investors are increasingly scrutinizing claims of "AI-washing" and expect founders to articulate clearly how AI enhances economic value rather than just using it as a buzzword. - Go-to-market strategies for AI startups are shifting towards hyper-targeted, outbound sales motions, with 86% of startups focusing strategic efforts there. Startups using AI in their GTM strategies report achieving success 2.3 times faster and raising 15-20% more funding than those with traditional approaches. For early-stage B2B sales to technical buyers, investors prioritize founders who have deep domain expertise and have "lived the problem" they are solving. - The future of work in data annotation will likely involve a workforce skilled in providing more nuanced, subjective feedback to train models on qualities like tone and empathy, areas where synthetic data falls short. As AI takes over more routine labeling, the human workforce's value shifts to handling ambiguity, mitigating bias, and providing the "ground truth" for complex, high-stakes applications like medical or legal analysis.

OpenAI Faces Scrutiny Under California AI Safety Law

Get your own daily briefing