GPT-5.5 posts 82.7% terminal-bench

- OpenAI said on April 23 it released GPT-5.5, a new model for coding and computer use, and began rolling it out in ChatGPT. - OpenAI’s launch page says GPT-5.5 scored 82.7% on Terminal-Bench 2.0 and matched GPT-5.4 latency while using fewer tokens on Codex tasks. - The API price starts at $5 per million input tokens, signaling a premium model aimed at agent-style work. (openai.com)

OpenAI released GPT-5.5 on April 23 and said the model is built to handle coding, research, data analysis, and software tasks across tools. (openai.com) The company said GPT-5.5 is rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex, with GPT-5.5 Pro going to Pro, Business, and Enterprise tiers. An April 24 update said GPT-5.5 and GPT-5.5 Pro are now available in the application programming interface, or API. (openai.com) OpenAI’s launch page lists GPT-5.5 at 82.7% on Terminal-Bench 2.0, compared with 75.1% for GPT-5.4, 69.4% for Claude Opus 4.7, and 68.5% for Gemini 3.1 Pro. The same chart lists GPT-5.5 at 78.7% on OSWorld-Verified and 81.8% on CyberGym. (openai.com) Terminal-Bench is a test for terminal-based computer work, the kind of command-line environment developers use to inspect files, run code, and fix systems. OpenAI framed GPT-5.5 as a model that can plan, use tools, check its work, and keep going through multi-step tasks. (openai.com) OpenAI said GPT-5.5 matches GPT-5.4 on per-token latency in real-world serving while performing at a higher level and using fewer tokens on the same Codex tasks. That combination points to a model pitched for longer jobs where speed and cost both matter. (openai.com) The API pricing page lists GPT-5.5 at $5.00 per 1 million input tokens, $0.50 per 1 million cached input tokens, and $30.00 per 1 million output tokens. GPT-5.4 is listed lower, at $2.50 per 1 million input tokens and $15.00 per 1 million output tokens. (openai.com) OpenAI’s system card says the company ran GPT-5.5 through its predeployment safety evaluations and Preparedness Framework, including targeted red-teaming for advanced cybersecurity and biology capabilities. It also said it gathered feedback from nearly 200 early-access partners before release. (openai.com) The public material does not mention a model named “GPT-5.4-Cyber,” and OpenAI’s launch page instead names GPT-5.5, GPT-5.5 Pro, GPT-5.4, and GPT-5.4 Pro. The benchmark claim at the center of the chatter traces back to OpenAI’s own product page, not just reposts on X. (openai.com) The release turns a social-media rumor into a documented product launch: GPT-5.5 is real, the 82.7% Terminal-Bench 2.0 score is on OpenAI’s site, and the listed API entry price is $5 per million input tokens. (openai.com 1) (openai.com 2)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.