Glean, Mistral push enterprise agents
- Glean launched Waldo on April 28, a search-specific model for enterprise agents, while Mistral opened Workflows in public preview to run agents reliably. - Waldo claims roughly 50% lower latency and 25% fewer tokens; Mistral says Workflows already powers millions of daily executions for firms like ASML. - The market is shifting from flashy demos to agent plumbing—tool use, retries, traces, and failure handling that enterprises actually need.
Enterprise AI agents keep running into the same wall. The model can sound smart, but the system around it is brittle. Search breaks, tools fail, context gets lost, and nobody can explain why an answer went wrong. This week, Glean and Mistral pushed at that exact problem from two different angles—Glean with a search-specialized model called Waldo, and Mistral with a workflow layer built for retries, observability, and fault tolerance. (glean.com) ### What did Glean actually launch? Glean launched Waldo on April 28 as an “agentic search model” built on NVIDIA Nemotron 3 Nano and tuned specifically for enterprise search. The idea is simple but important: a lot of enterprise agents do not start with reasoning, they start with finding the right internal information. Waldo plans queries, calls search tools, reads results iteratively, and then hands evidence to a larger model for the final answer. (glean.com) ### Why make a separate search model? Because general-purpose frontier models are expensive overkill for every step. Glean is basically splitting the job in two. Waldo handles the search loop—query decomposition, tool choice, evidence gathering—while a stronger model handles synthesis at the end. Glean says that setup delivered about 50% lower latency and roughly 25% fewer tokens in its internal harness, without a quality drop. (glean.com) ### What changed at Mistral? Mistral put Workflows into public preview on April 28. This is not another model release. It is an orchestration layer for enterprise AI processes—the software that keeps multi-step agent systems running when real work is on the line. Mistral frames it as the missing bridge between proof-of-concept agents and production systems that need durability, observability, and fault tolerance. (mistral.ai) ### What does “orchestration” mean here? Basically, all the boring stuff that turns out not to be boring at all. Workflows gives developers retry logic, human approvals, traceable execution, and deployment options across cloud, on-prem, and hybrid setups. Mistral’s docs also expose execution traces and OpenTelemetry-style observability, so teams can inspect spans, timings, activity counts, and errors instead of guessing where an agent failed. (mistral.ai) ### Why are those features the real story? Because enterprise agents usually fail in operational ways, not benchmark ways. A demo can survive one flaky API call or one bad retrieval step. A production workflow cannot. If an agent is processing customer support, logistics, or finance tasks, it needs retries, checkpoints, audit trails, and a way to pause for a human. That is why Mistral is highlighting users like A(mistral.ai)ue Postale—not just model scores. (mistral.ai) ### How do Glean and Mistral fit together? They are solving adjacent layers of the same stack. Glean is narrowing one hard subproblem—enterprise search—into a specialized model that can gather evidence faster and cheaper. Mistral is building the runtime that lets many such steps execute reliably over time. One is about better agent behavior inside a task. The other is about keeping the whole process alive across tasks. (glean.com) ### Why is this happening now? Because the market is maturing. Last year’s pitch was “look, the model can use tools.” This year’s pitch is “look, the system can survive production.” Even Glean’s own recent messaging leans toward unified enterprise platforms and operational context, not just raw model quality. And Mistral is explicitly saying Workflows is for moving AI processes from prototype to production. (glean.com) ### What should you take away? The center of gravity is shifting from model magic to agent infrastructure. Better reasoning still matters, but the harder commercial problem is making agents dependable, inspectable, and cheap enough to run at scale. Glean and Mistral are both betting that enterprises will pay for that layer first. (glean.com)