Z.ai's GLM-5 Model Tops Open-Source Benchmark

Published February 21, 2026 by The Daily Scout

Z.ai has released its GLM-5 model, which has achieved the top score on the open-weights Intelligence Index benchmark. The new model is more than double the size of its predecessor, positioning it as a leading contender in the open-source large language model ecosystem.

Why it matters

- GLM-5 is explicitly designed for "Agentic Engineering," targeting complex, long-horizon tasks that require autonomous planning, tool invocation, and multi-step execution. It has achieved state-of-the-art performance for open-weight models on agentic benchmarks such as BrowseComp (web navigation) and MCP-Atlas (multi-step task execution). - The model's architecture is a Mixture of Experts (MoE) with 744 billion total parameters and 40 billion active parameters per token. It incorporates DeepSeek Sparse Attention (DSA) to efficiently manage its 200K token context window, which is aimed at reducing deployment costs for long-context applications. - Post-training was conducted using a novel asynchronous reinforcement learning (

Key numbers

Z.ai has released its GLM-5 model, which has achieved the top score on the open-weights Intelligence Index benchmark.
- GLM-5 is explicitly designed for "Agentic Engineering," targeting complex, long-horizon tasks that require autonomous planning, tool invocation, and multi-step execution.
The model's architecture is a Mixture of Experts (MoE) with 744 billion total parameters and 40 billion active parameters per token.
It incorporates DeepSeek Sparse Attention (DSA) to efficiently manage its 200K token context window, which is aimed at reducing deployment costs for long-context applications.

Sources

Quick answers

What happened in Z.ai's GLM-5 Model Tops Open-Source Benchmark?

Z.ai has released its GLM-5 model, which has achieved the top score on the open-weights Intelligence Index benchmark. The new model is more than double the size of its predecessor, positioning it as a leading contender in the open-source large language model ecosystem.

Why does Z.ai's GLM-5 Model Tops Open-Source Benchmark matter?

GLM-5 is explicitly designed for "Agentic Engineering," targeting complex, long-horizon tasks that require autonomous planning, tool invocation, and multi-step execution. It has achieved state-of-the-art performance for open-weight models on agentic benchmarks such as BrowseComp (web navigation) and MCP-Atlas (multi-step task execution). The model's architecture is a Mixture of Experts (MoE) with 744 billion total parameters and 40 billion active parameters per token. It incorporates DeepSeek Sparse Attention (DSA) to efficiently manage its 200K token context window, which is aimed at reducing deployment costs for long-context applications. Post-training was conducted using a novel asynchronous reinforcement learning (

Z.ai's GLM-5 Model Tops Open-Source Benchmark

What happened

Why it matters

Key numbers

Sources

Quick answers

What happened in Z.ai's GLM-5 Model Tops Open-Source Benchmark?

Why does Z.ai's GLM-5 Model Tops Open-Source Benchmark matter?

Get your own daily briefing