Logo RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration

Abstract

The rise of embodied intelligence has intensified the need for robust multi-agent collaboration in industrial automation, service robotics, and smart manufacturing. However, current robotic systems struggle with critical limitations, including poor cross-embodiment adaptability, inefficient task scheduling, and inadequate dynamic error correction. While end-to-end vision-language-action (VLA) models (e.g., OpenVLA, RDT, Pi-0) exhibit weak long-horizon planning and task generalization, hierarchical VLA models (e.g., Helix, Gemini-Robotics, GR00T-N1) lack cross-embodiment compatibility and multi-agent coordination capabilities. To address these challenges, we present RoboOS, the first open-source embodied cross-embodiment and multi-agent collaboration Framework for based on a Brain-Cerebellum hierarchical architecture, facilitating a paradigm shift from single-agent to swarm intelligence. Specifically, RoboOS comprises three key components: (1) the Embodied Cloud Model, a multimodal large language model (MLLM) for global perception and high-level decision-making; (2) the Cerebellum Skill Library, a modular, plug-and-play toolkit for seamless multi-skill execution; and (3) Real-Time Shared Memory, a spatiotemporal synchronization mechanism for multi-agent state coordination. By integrating hierarchical information flow, RoboOS bridges the Embodied Brain and Cerebellum Skill Library, enabling robust planning, scheduling, and error correction for long-horizon tasks while ensuring efficient multi-agent collaboration by Real-Time Shared Memory. Moreover, we optimize edge-cloud communication and cloud-based distributed inference to support high-frequency interactions and scalable deployment.Extensive real-world experiments across diverse scenarios (e.g., restaurant, household, supermarket) demonstrate RoboOS's versatility, supporting heterogeneous embodiments (single-arm, dual-arm, humanoid, wheeled), which provides a scalable and practical solution for cross-embodiment collaboration, pushing the boundaries of embodied intelligence.

Framework of RoboOS

teaser
Framework of RoboOS 1.0. RoboOS is a Brain-Cerebellum hierarchical architecture for multi-robot collaboration that comprises three core components: (a) Cloud-based Embodied Brain model for high-level cognition and multi-agent coordination; (b) Distributed Cerebellum Modules for executing robot-specific skills; and (c) Real-Time Shared Memory for enhanced environmental awareness.
teaser
Framework of RoboOS 2.0 (SaaS + MCP). RoboOS is a Brain-Cerebellum hierarchical architecture for multi-robot collaboration that comprises three core components: (a) Cloud-based Embodied Brain model for high-level cognition and multi-agent coordination; (b) Distributed Cerebellum Modules for executing robot-specific skills; and (c) Real-Time Shared Memory for enhanced environmental awareness.

Pipeline of RoboOS

teaser
Pipeline of RoboOS. The RoboOS framework implements a workflow pipeline for multi-robot collaboration, consisting of four key phases: (1) hierarchical task decomposition, (2) topology-aware subtask allocation, (3) distributed agent-based execution, and (4) dynamic memory updating. This integrated workflow enables coordinated task completion while maintaining adaptability to environmental and operational constraints.

Real-world Demos of RoboOS

teaser
Real-world RoboOS Demonstrations. We showcase multi-robot collaboration in three scenarios: (a) Restaurant: Unitree G1 and Agilex robots prepare burgers. (b) Household: Realman and Agilex robots fetch items. (c) Supermarket: Robots coordinate gift selection and packaging.

Citation

If you find our work helpful, feel free to cite it:


@article{tan2025roboos,
    title={RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration}, 
    author={Tan, Huajie and Hao, Xiaoshuai and Lin, Minglan and Wang, Pengwei and Lyu, Yaoxu and Cao, Mingyu and Wang, Zhongyuan and Zhang, Shanghang},
    journal={arXiv preprint arXiv:2505.03673},
    year={2025}
}

@article{RoboBrain2.0TechnicalReport,
    title={RoboBrain 2.0 Technical Report},
    author={BAAI RoboBrain Team},
    journal={arXiv preprint arXiv:2507.02029},
    year={2025}
}

@article{RoboBrain1.0,
    title={Robobrain: A unified brain model for robotic manipulation from abstract to concrete},
    author={Ji, Yuheng and Tan, Huajie and Shi, Jiayu and Hao, Xiaoshuai and Zhang, Yuan and Zhang, Hengyuan and Wang, Pengwei and Zhao, Mengdi and Mu, Yao and An, Pengju and others},
    journal={arXiv preprint arXiv:2502.21257},
    year={2025}
}