Zhipu AI Releases 744B-Parameter Open-Source Model

Chinese AI firm Zhipu AI has released GLM-5, an open-source model with a 744B-parameter Mixture-of-Experts (MoE) architecture. The model, trained on 28.5T tokens using Huawei Ascend NPUs, is positioned as a competitor to frontier models like Claude Opus 4.5, demonstrating strong capabilities in agentic reasoning and workflow automation.

- Zhipu AI, known internationally as Z.ai, originated from Tsinghua University's Computer Science Department in 2019 and is considered one of China's "AI Tiger" startups. The company's valuation was reported at $2.8 billion in 2023, with investors including Alibaba, Tencent, Meituan, and Saudi Aramco's Prosperity7 Ventures. - The GLM-5's Mixture-of-Experts (MoE) architecture activates only a fraction of its total 744 billion parameters—roughly 40 billion—for any given token during inference. This approach is designed to provide the knowledge capacity of a very large model while maintaining the computational efficiency and lower inference costs of a smaller model. - A key feature of GLM-5 is its maximum output of 128,000 tokens, a significant increase from the typical 4,000 to 16,000 token limits of many other models. This allows it to generate entire software modules or long-form documents in a single pass. - For enterprise applications, GLM-5 is positioned for "agentic engineering," moving beyond simple code snippet generation to managing entire repositories and complex, multi-step business workflows. Its performance on benchmarks like SWE-Bench and Terminal-Bench 2.0 shows it approaching the capabilities of models like Claude 4.5 Opus in coding and reasoning tasks. - GLM-5's API is priced significantly lower than some competitors, with input tokens costing approximately 80% less and output tokens about 87% less than Claude 4.5 Opus. - The model integrates DeepSeek Sparse Attention (DSA), a technique intended to reduce memory overhead and deployment costs while preserving performance on tasks requiring a long context. It also utilizes a novel asynchronous reinforcement learning framework called "slime" for more efficient post-training. - Zhipu AI has been iterating on its GLM series, with GLM-4.5 released in March 2025 (355B parameters) and GLM-4.7 in December 2025, which showed significant gains in coding and reasoning benchmarks. The company also rebranded internationally from Zhipu AI to Z.ai in 2025. - In January 2025, the United States Commerce Department added Zhipu AI to its Entity List, citing national security concerns.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.