OpenAI ships GPT‑5.4

Published April 3, 2026 by The Daily Scout

OpenAI released GPT‑5.4 and related GPT‑5.1 endpoints to enterprise API customers, framing the models as enterprise‑grade tools for heavier automation and reasoning tasks. The stack brings native computer control, an up to 1‑million token context window, and embedded code/codex capabilities while claiming a roughly 33% reduction in hallucinations — plus “Instant” and “Thinking” API modes for speed vs. depth tradeoffs (help.openai.com). Analysts and explainers are already mapping the practical impacts on developer workflows and long multi‑step orchestrations (help.apiyi.com).

Why it matters

OpenAI rolled out GPT‑5.4 to its ChatGPT and API surfaces on March 5, 2026, positioning it as a professional‑grade model with built‑in ability to operate software, longer memory, and faster/deeper modes for different tasks. (openai.com) The new model can directly interact with applications and the desktop — for example, taking screenshots, driving a browser, and controlling mouse and keyboard to carry out multi‑step workflows — and it can hold roughly one million pieces of text in a single session, which is enough to load a full codebase or the equivalent of a 750‑page document. (openai.com) (datacamp.com) Technically, OpenAI exposes GPT‑5.4 in multiple variants (Standard, Thinking, and Pro) and documents a context capacity of about 1,050,000 tokens with a maximum single response output cap around 128,000 tokens; the “Thinking” variant can produce an explicit plan of its reasoning while it works so orchestration can be adjusted mid‑run. (developers.openai.com) (help.openai.com) The API pricing and session model contain concrete cost and performance tradeoffs: input beyond a 272,000‑token threshold is charged at a higher rate for the whole request, cached input tokens are billed at a lower cached rate, and regional processing carries a 10% uplift — all of which affect how long, stateful sessions should be architected. (developers.openai.com) (byteiota.com) Early third‑party evaluations and OpenAI’s materials both report measurable accuracy improvements: OpenAI says the family produces higher‑quality answers, and independent analyses are reporting roughly a one‑third drop in factual errors compared with prior GPT‑5 releases; those numbers are already shaping expectations for using the model as a component in mission‑critical automation. (openai.com) (help.apiyi.com) For platform engineering and product teams, the immediate operational implications are concrete: session and context management must account for a very large but price‑tiered context window (use streaming, state checkpoints, and cached inputs to control cost), computer‑control features require sandboxing and privileged‑access governance at the platform edge, and the new variants mean offering mode choices (speed vs. depth) in SDKs and developer tooling will matter for enterprise SLAs. (developers.openai.com) (datacamp.com) OpenAI also continued rolling out related GPT‑5.1 endpoints for coding and agentic tasks with configurable reasoning effort, and it has stated retirement timelines for some older models in ChatGPT that enterprise customers should track when planning migrations and custom‑GPT continuity. (developers.openai.com) (help.openai.com)

Key numbers

OpenAI released GPT‑5.4 and related GPT‑5.1 endpoints to enterprise API customers, framing the models as enterprise‑grade tools for heavier automation and reasoning tasks.
The stack brings native computer control, an up to 1‑million token context window, and embedded code/codex capabilities while claiming a roughly 33% reduction in hallucinations — plus “Instant” and “Thinking” API modes for speed vs.
OpenAI rolled out GPT‑5.4 to its ChatGPT and API surfaces on March 5, 2026, positioning it as a professional‑grade model with built‑in ability to operate software, longer memory, and faster/deeper modes for different tasks.

What happens next

depth) in SDKs and developer tooling will matter for enterprise SLAs.

Sources

Quick answers

What happened in OpenAI ships GPT‑5.4?

OpenAI released GPT‑5.4 and related GPT‑5.1 endpoints to enterprise API customers, framing the models as enterprise‑grade tools for heavier automation and reasoning tasks. The stack brings native computer control, an up to 1‑million token context window, and embedded code/codex capabilities while claiming a roughly 33% reduction in hallucinations — plus “Instant” and “Thinking” API modes for speed vs. depth tradeoffs (help.openai.com). Analysts and explainers are already mapping the practical impacts on developer workflows and long multi‑step orchestrations (help.apiyi.com).

Why does OpenAI ships GPT‑5.4 matter?

OpenAI rolled out GPT‑5.4 to its ChatGPT and API surfaces on March 5, 2026, positioning it as a professional‑grade model with built‑in ability to operate software, longer memory, and faster/deeper modes for different tasks. (openai.com) The new model can directly interact with applications and the desktop — for example, taking screenshots, driving a browser, and controlling mouse and keyboard to carry out multi‑step workflows — and it can hold roughly one million pieces of text in a single session, which is enough to load a full codebase or the equivalent of a 750‑page document. (openai.com) (datacamp.com) Technically, OpenAI exposes GPT‑5.4 in multiple variants (Standard, Thinking, and Pro) and documents a context capacity of about 1,050,000 tokens with a maximum single response output cap around 128,000 tokens; the “Thinking” variant can produce an explicit plan of its reasoning while it works so orchestration can be adjusted mid‑run. (developers.openai.com) (help.openai.com) The API pricing and session model contain concrete cost and performance tradeoffs: input beyond a 272,000‑token threshold is charged at a higher rate for the whole request, cached input tokens are billed at a lower cached rate, and regional processing carries a 10% uplift — all of which affect how long, stateful sessions should be architected. (developers.openai.com) (byteiota.com) Early third‑party evaluations and OpenAI’s materials both report measurable accuracy improvements: OpenAI says the family produces higher‑quality answers, and independent analyses are reporting roughly a one‑third drop in factual errors compared with prior GPT‑5 releases; those numbers are already shaping expectations for using the model as a component in mission‑critical automation. (openai.com) (help.apiyi.com) For platform engineering and product teams, the immediate operational implications are concrete: session and context management must account for a very large but price‑tiered context window (use streaming, state checkpoints, and cached inputs to control cost), computer‑control features require sandboxing and privileged‑access governance at the platform edge, and the new variants mean offering mode choices (speed vs. depth) in SDKs and developer tooling will matter for enterprise SLAs. (developers.openai.com) (datacamp.com) OpenAI also continued rolling out related GPT‑5.1 endpoints for coding and agentic tasks with configurable reasoning effort, and it has stated retirement timelines for some older models in ChatGPT that enterprise customers should track when planning migrations and custom‑GPT continuity. (developers.openai.com) (help.openai.com)