Alibaba Paper Details Adaptive Scheduling for Agent Teams

A new research paper from Alibaba's Orbit has detailed AQESF, an adaptive scheduling framework for managing multi-agent systems. The architecture dynamically balances quality-of-service requirements for online task batches, adjusting to changing workloads and reliability constraints. The patterns are reportedly influencing internal Alibaba deployments, offering a template for scaling reliable agent platforms.

While public details on Alibaba's AQESF scheduling framework remain sparse, it points to a critical trend in scaling multi-agent systems: the need for dynamic, quality-of-service-aware orchestration. Alibaba's broader strategy leans heavily on a data-centric approach, viewing effective agent collaboration as the most practical path to scaling complex AI applications, rather than pursuing a single "super agent." This philosophy is materialized in their open-source frameworks like AgentScope and Spring AI Alibaba, which provide the architectural backbone for their enterprise deployments. A core component of Alibaba's strategy is the "data flywheel," a continuous feedback loop for improving agent performance. This involves collecting data from customer interactions, combining it with domain-specific knowledge, creating evaluation datasets to meet service-level agreements, and then using feedback from live deployments to refine the system. This data-first approach is evident in the integration of agents into high-traffic platforms like Amap for multi-day trip planning and DingTalk, which handles over 200 million daily interactions with its Qwen-powered copilots. The challenges of reliability and handoffs in multi-agent systems are significant, with many pilot projects failing when they move to production. Common failure points include state synchronization issues, where agents work with outdated information, and communication bottlenecks that can cascade and cause significant latency. Production-grade systems often compound the probabilistic nature of LLMs, turning minor uncertainties into major deviations from the intended workflow. To combat these issues, leading teams are implementing several architectural patterns. One common approach is to build a clean "tooling layer" of APIs, preventing agents from directly interacting with complex and sometimes legacy backend systems. Another is to enforce deterministic behavior by constraining agents to use predefined JSON outputs for actions rather than allowing them to improvise. For orchestration, patterns like a hierarchical structure with a planner agent delegating tasks to specialized worker agents, and a validator to check the quality of work, are gaining traction. In China's competitive landscape, the battle is less about individual model performance and more about platform-level integration. Companies like Tencent, with its Hunyuan AI backbone in WeChat managing billions of daily agent tool calls, and Baidu with its ERNIE AI Agent platform, are building comprehensive ecosystems. This contrasts with the Western market's more fragmented approach, highlighting a strategy of deep integration into super-apps to drive user engagement and monetization through ecosystem lock-in rather than direct AI sales. The open-source community is rapidly evolving to address the complexities of multi-agent systems. Frameworks like LangGraph offer precise, graph-based control over workflows, which is beneficial for creating deterministic and stateful agent interactions. Microsoft's AutoGen focuses on multi-agent conversations, while CrewAI simplifies the orchestration of role-playing agents for collaborative tasks. These tools provide a range of options for developers to manage the intricacies of agent communication and collaboration. For consumer-facing AI products, the user experience of complex agent behavior is paramount. The trend is moving from simple conversational interfaces to fully autonomous task execution loops. This requires a deep understanding of user intent and the ability to break down complex goals into a series of reliable actions. The success of consumer AI agents will depend on making this intricate behind-the-scenes orchestration feel simple and seamless to the end-user. From a regulatory standpoint, the Chinese government is actively encouraging the development and adoption of AI agents through policy measures and standardization efforts. This support, combined with the massive domestic market, is fueling rapid growth in the AI agent sector. For companies operating in this space, staying attuned to these regulatory signals is as crucial as the technical development of their agent platforms.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.