Z.ai releases GLM‑5.1

Z.ai unveiled GLM‑5.1, an open‑weight, 754‑billion‑parameter model aimed at coding tasks that its backers say can run autonomously for hours and beat rivals on developer benchmarks. Reports claim GLM‑5.1 topped SWE‑Bench Pro and can sustain eight‑hour agentic execution, and it was released under an MIT licence that makes it a plausible substitute for specialised coding domains. That matters because open models with strong coding chops can narrow the moat around closed providers in enterprise software workflows. (computerworld.com, dataconomy.com)

A coding model is only useful if it can stay on task after the easy fixes run out, and Z.ai says its new GLM‑5.1 can keep working on one software job for up to eight hours instead of stalling after a few tool calls. The company published it on April 7, 2026 as an open‑weight release rather than a closed web service. (z.ai, computerworld.com) The basic idea here is a coding agent, which is a model that does more than answer a question once. It plans steps, edits files, runs tests, reads the errors, and tries again, which makes it closer to an intern with a terminal than a chatbot in a text box. (computerworld.com, z.ai) That kind of system usually breaks down over time because long jobs create drift. A model can forget the original goal, repeat bad fixes, or waste its context window, which is the running workspace where it keeps the code, logs, and instructions it still needs. (venturebeat.com, z.ai) Z.ai’s pitch is that GLM‑5.1 handles that long-horizon problem better than earlier coding models. The company says the model can keep improving over hundreds of iterations, which means repeated cycles of planning, editing, testing, and revising on the same task. (computerworld.com, z.ai) The number getting the most attention is 58.4 on SWE‑Bench Pro, which is a benchmark built from real GitHub issues and checks whether a model can actually patch the codebase correctly. Z.ai says that score put GLM‑5.1 ahead of GPT‑5.4, Claude Opus 4.6, and Gemini 3.1 Pro on that test. (z.ai, dataconomy.com) The model is large even by frontier standards at 754 billion parameters, which are the adjustable weights that store what the model learned during training. Z.ai also says GLM‑5.1 has a 202,752‑token context window, which is the amount of text and code it can keep in view at once during a job. (marktechpost.com, venturebeat.com) The licensing choice is a big part of the story. Z.ai says the weights are released under the Massachusetts Institute of Technology, or MIT, License, which is one of the most permissive software licenses and lets companies modify and ship the model with far fewer restrictions than many “open” AI releases. (z.ai, dataconomy.com) That changes the economics for software teams because a strong coding model no longer has to be rented only through a proprietary application programming interface, or API. If a company can run or fine-tune an MIT‑licensed coding model itself, it gets more control over cost, privacy, and the internal tools wrapped around the model. (z.ai, z.ai) There is still a catch in every benchmark story. SWE‑Bench Pro is useful because it tests real repository fixes, but benchmark wins do not guarantee that a model will behave reliably inside a messy corporate codebase with custom tools, security rules, and humans interrupting the workflow. (computerworld.com, venturebeat.com) Still, the old pattern in artificial intelligence was that the best coding systems stayed closed while open models lagged well behind. GLM‑5.1 is getting attention because it suggests that gap is narrowing in one of the most valuable parts of the market, which is software work that companies already know how to measure in bugs fixed, tests passed, and engineer hours saved. (dataconomy.com, computerworld.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.