OpenAI Debuts Real-Time Coding AI
OpenAI has introduced a new version of its Codex model capable of real-time coding. The model is reportedly powered by a new, dedicated AI chip, signaling a trend toward more specialized and high-performance AI infrastructure.
- The new model, named GPT-5.3-Codex-Spark, is a smaller, faster version of the larger GPT-5.3-Codex released earlier in the month and features a 128k context window. - The custom hardware is the "Wafer Scale Engine 3" (WSE-3) from Cerebras Systems, which boasts over 4 trillion transistors. This is the first milestone in a multi-year, $10 billion partnership between OpenAI and Cerebras, marking a significant move to diversify its AI hardware beyond Nvidia. - "Real-time" performance is quantified as the model's ability to generate over 1,000 tokens per second, enabling developers to interactively edit, reshape logic, and refine interfaces with near-instant feedback. - The original Codex, which powered the first version of GitHub Copilot, was introduced in 2021 but later deprecated. The name was revived for a new generation of more autonomous "agentic" AI systems that can handle entire software development tasks. - This release is part of a dual-mode strategy: Codex-Spark is optimized for rapid, low-latency collaboration, while the larger GPT-5.3-Codex model is designed for more complex, long-running tasks that require deeper reasoning. - An internal OpenAI team recently shipped a beta product where every line of code was generated by Codex agents, reportedly building it in one-tenth of the time it would have taken human engineers. - Beyond just writing code, the new model is designed to be a "daily productivity driver" for tasks across the entire software lifecycle, including debugging, deployment, monitoring systems, and writing documentation.