NVIDIA Releases 'DreamDojo' Open-Source Robot Brain

NVIDIA has released DreamDojo, an open-source world model for scalable robot training. The system uses over 44,000 hours of egocentric human video to create action models and a real-time simulation environment. This foundation model approach is designed to enable zero-shot planning and accelerate sim-to-real transfer for embodied AI.

- The DreamDojo-HV training dataset is the largest of its kind, containing 44,711 hours of first-person human video, which is 15 times longer and covers 2,000 times more unique scenes than previous datasets used for world model pre-training. - To learn from human videos that lack robot-specific motor commands, DreamDojo uses "latent actions," a hardware-agnostic proxy that infers the physics of intent from pixels, which can then be fine-tuned to a specific robot's hardware. - This technology is part of NVIDIA's broader Project GR00T (Generalist Robot 00 Technology), a foundation model intended to be a generalized "brain" for humanoid robots from partners like Figure AI, Boston Dynamics, and Agility Robotics. - A distilled version of the model can generate physically accurate simulations at over 10 frames per second, enabling real-time applications like live VR teleoperation and policy evaluation with a 0.995 correlation to real-world performance. - The models are designed to run on Jetson Thor, a new computer for humanoid robots featuring a next-generation GPU based on the NVIDIA Blackwell architecture, delivering 800 teraflops of 8-bit floating point AI performance. - DreamDojo is part of NVIDIA's full-stack robotics strategy, which combines edge computers (Jetson), simulation platforms (Isaac Sim on Omniverse), and foundational AI models to create a comprehensive development ecosystem. - The release is part of an industry-wide "World Model arms race" against similar generative simulation projects from competitors like Google DeepMind (Genie 3) and robotics firm 1X (1XWM). - This approach is meant to solve the "sim-to-real gap," a major challenge in robotics where behaviors learned in simulation fail to transfer perfectly to the real world due to discrepancies in physics and visual rendering.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.