OpenAI Launches GPT-5.4 with 1M Token Context
OpenAI just released GPT-5.4, a major upgrade featuring a 1 million token context window and native computer control via its API. The new model is also more efficient, using 47% fewer tokens in tool-heavy workflows and making 33% fewer errors. Developers are already noting some breaking changes, particularly with thread compatibility in older apps.
The jump to a 1 million token context window is a massive leap from previous models like GPT-4 Turbo, which offered a 128,000 token window, and the original GPT-4 that maxed out at 32,768 tokens. This allows developers to process the equivalent of an entire novel or a complete codebase in a single prompt, a task that previously required complex chunking and text management. Native computer control is a paradigm shift, moving beyond structured APIs to allow the model to interact with graphical user interfaces (GUIs) directly. This "computer use" capability enables automation of legacy systems and multi-application workflows by having the AI see the screen and operate a mouse and keyboard, just as a human would. The introduction of model variants like "Standard," "Thinking," and "Pro" allows developers to tailor their API calls to specific needs. This follows a trend of offering specialized models, providing a balance between peak performance for complex reasoning and cost-efficiency for more general-purpose tasks. Pricing for the standard GPT-5.4 model is set at $2.50 per million input tokens and $15.00 per million output tokens. However, OpenAI is offering significant discounts for cached inputs, which could lower costs by as much as 90%, a crucial factor for developers building applications that repeatedly reference the same large documents. The reported 33% reduction in single-claim errors is a direct response to enterprise demand for higher reliability in high-stakes fields like finance and law. For developers, this translates to more trustworthy outputs and less need to build complex error-checking and validation layers around the API. While the feature set is expanding, the API is also introducing complexities. The mention of breaking changes to thread compatibility points to shifts in how conversational state is managed. This is a critical detail for developers maintaining applications with long-running, multi-turn interactions, as it may require significant refactoring to avoid disrupting user experience.