GPT‑5.4 Mini Rolls Out
OpenAI has rolled out a GPT‑5.4 mini variant, a quiet product change that shows the GPT family is evolving via incremental model variants rather than a single big launch. (help.openai.com) Benchmarks in an Android development test now put GPT‑5.4 tied with Google's Gemini 3.1 Pro Preview at the top (72.4%), while a GPT‑5.3-Codex variant scored 67.7% — the takeaway is steady, competitive iteration on coding tasks. ( )
OpenAI slipped a new model into ChatGPT without a big keynote: GPT‑5.4 mini showed up in the release notes on March 18, 2026, and the note says it is available to Free and Plus users in ChatGPT. (help.openai.com) That kind of release tells you how this market is moving now. Instead of one giant model launch every few months, OpenAI is shipping a family of versions like GPT‑5.3, GPT‑5.4, GPT‑5.4 Pro, and now GPT‑5.4 mini, each aimed at a different speed and cost tradeoff. (openai.com) A “mini” model is the smaller engine in the same car lineup. OpenAI’s own GPT‑5.4 launch described the main model as its most capable and efficient frontier model for professional work, while later reports on the mini said it was designed to run faster and closer to the flagship than older small models did. (openai.com; letsdatascience.com) The coding angle is where this quiet update gets easier to see. Google’s Android Bench is a test built around real Android app work like Jetpack Compose user interfaces, Coroutines and Flows for background tasks, Room database storage, and Hilt dependency injection. (9to5google.com; letsdatascience.com) In Google’s April 9, 2026 refresh of that ranking, GPT‑5.4 tied Google’s Gemini 3.1 Pro Preview at 72.4%. OpenAI’s GPT‑5.3‑Codex landed at 67.7%, ahead of Claude Opus 4.6 at 66.6% and GPT‑5.2 Codex at 62.5%. (9to5google.com; letsdatascience.com) That ranking matters because Android Bench is not a general trivia quiz. It tests whether a model can actually help build Android software, which is closer to the work that turns into shipped apps and fewer bugs. (9to5google.com) OpenAI has also been folding coding skill into its main models instead of keeping it in a separate lane. Its model notes say GPT‑5.4 incorporates the coding capabilities of GPT‑5.3‑Codex while improving how the model works across tools, software environments, spreadsheets, presentations, and documents. (help.openai.com) So the pattern is not “one model replaces all the others.” The pattern is that OpenAI is turning GPT into a product stack, where a flagship model handles maximum performance, a Pro version pushes harder, a Codex variant targets software work, and a mini version brings newer capabilities to cheaper or faster tiers. (openai.com; help.openai.com) That is why a quiet release-note update can matter more than a flashy demo. When a smaller GPT‑5.4 variant arrives at the same time the full GPT‑5.4 family is posting top-tier Android coding scores, it suggests the competition is now about steady weekly gains in useful work, not just headline model names. (help.openai.com; 9to5google.com; letsdatascience.com)