Research Advances Multi-Agent and Embodied AI
Recent research is pushing the frontiers of embodied AI. The CaPo framework optimizes cooperative planning for multiple physical agents, while a new method from Oxford and Microsoft proposes a scalable approach to measuring agent autonomy via code inspection. Another study reveals how language models develop emergent planning abilities, crucial for complex task execution.
- The Department of Defense is significantly increasing investment in AI and autonomy, with a recent budget request seeking funds for drone swarms, autonomous "wingman" fighters, and undersea drones. Specific allocations include $789 million for the Air Force's collaborative combat aircraft program, $486 million for the Army's basic AI research, and $185 million for the Navy's large unmanned undersea program. A joint House and Senate Armed Services Committee bill proposed adding an estimated $150B to the DOD's budget for fiscal year 2026, with large sums earmarked for AI and autonomy applications. - Venture capital funding for robotics startups is robust, with startups raising $6.4 billion so far in 2024, on pace to exceed the $6.9 billion raised in 2023. However, the number of funding rounds has decreased from 671 in 2023 to 473 in 2024, indicating that larger checks are going to fewer, more established companies. Notable recent funding rounds include Figure AI's $675 million raise and Physical Intelligence's $400 million round. - In the growing humanoid robot market, which is projected to reach $7.9 billion by 2025, Chinese manufacturers currently lead in shipment volume, accounting for 87% of deliveries in 2025. Key players include Chinese companies Agibot and Unitree, while U.S.-based companies like Figure AI, Agility Robotics, and Boston Dynamics are focusing on innovation and functionality over volume. - Multi-agent systems are a critical area of development for defense applications, enhancing strategy and operations in areas like coastal defense, drone swarms, and cybersecurity. These systems use networks of autonomous AI agents to process vast amounts of data, coordinate responses, and adapt to dynamic environments more effectively than centralized systems. In cybersecurity, for example, multi-agent systems can simulate thousands of attack vectors to identify vulnerabilities and develop countermeasures in minutes. - The concept of "emergent abilities," where large language models develop unexpected capabilities as they scale, is a key research area for creating more advanced autonomous agents. These abilities, which are not present in smaller models and cannot be predicted by extrapolation, include advanced reasoning and problem-solving. There is ongoing debate about whether these are true emergent properties or artifacts of the evaluation metrics used. - Scalable methods for evaluating the autonomy of AI agents are being developed to reduce the risks and costs associated with traditional run-time evaluations. One novel approach involves a code-based assessment that scores an agent's orchestration code based on a taxonomy of autonomy attributes, such as impact and oversight, without needing to run the agent. - In industrial and logistics automation, there is a major trend toward "physical AI," which focuses on making robots more adaptive to variable and unstructured environments like warehouses and factory floors. This involves retrofitting existing hardware with AI and developing robotics foundation models. Key applications include automated guided vehicles (AGVs) and autonomous mobile robots (AMRs) for material transport, and robotic arms for picking and packing. - For leaders of engineering teams in the autonomy space, a key challenge is transitioning technically ready embodied AI into widespread commercial adoption. This requires a focus on usability for non-roboticists, transparency through explainable AI, and demonstrating clear ROI through metrics like cycle-time reductions and throughput gains.