Abstract
Recent works have been exploring the scaling laws in the field of EmbodiedAI. Given the prohibitive costs of collecting real-world data, we believe theSimulation-to-Real (Sim2Real) paradigm is a crucial step for scaling thelearning of embodied models. This paper introduces project GRUtopia, the firstsimulated interactive 3D society designed for various robots. It featuresseveral advancements: (a) The scene dataset, GRScenes, includes 100kinteractive, finely annotated scenes, which can be freely combined intocity-scale environments. In contrast to previous works mainly focusing on home,GRScenes covers 89 diverse scene categories, bridging the gap ofservice-oriented environments where general robots would be initially deployed.(b) GRResidents, a Large Language Model (LLM) driven Non-Player Character (NPC)system that is responsible for social interaction, task generation, and taskassignment, thus simulating social scenarios for embodied AI applications. (c)The benchmark, GRBench, supports various robots but focuses on legged robots asprimary agents and poses moderately challenging tasks involving ObjectLoco-Navigation, Social Loco-Navigation, and Loco-Manipulation. We hope thatthis work can alleviate the scarcity of high-quality data in this field andprovide a more comprehensive assessment of Embodied AI research. The project isavailable at https://github.com/OpenRobotLab/GRUtopia.