OpenAI Debuts Real-Time Coding AI

OpenAI has introduced a new version of its Codex model capable of real-time coding. The model is reportedly powered by a new, dedicated AI chip, signaling a trend toward more specialized and high-performance AI infrastructure.

- The new model, named GPT-5.3-Codex-Spark, is a smaller, faster version of the larger GPT-5.3-Codex released earlier in the month and features a 128k context window. - The custom hardware is the "Wafer Scale Engine 3" (WSE-3) from Cerebras Systems, which boasts over 4 trillion transistors. This is the first milestone in a multi-year, $10 billion partnership between OpenAI and Cerebras, marking a significant move to diversify its AI hardware beyond Nvidia. - "Real-time" performance is quantified as the model's ability to generate over 1,000 tokens per second, enabling developers to interactively edit, reshape logic, and refine interfaces with near-instant feedback. - The original Codex, which powered the first version of GitHub Copilot, was introduced in 2021 but later deprecated. The name was revived for a new generation of more autonomous "agentic" AI systems that can handle entire software development tasks. - This release is part of a dual-mode strategy: Codex-Spark is optimized for rapid, low-latency collaboration, while the larger GPT-5.3-Codex model is designed for more complex, long-running tasks that require deeper reasoning. - An internal OpenAI team recently shipped a beta product where every line of code was generated by Codex agents, reportedly building it in one-tenth of the time it would have taken human engineers. - Beyond just writing code, the new model is designed to be a "daily productivity driver" for tasks across the entire software lifecycle, including debugging, deployment, monitoring systems, and writing documentation.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.