Abstract
Embodied Artificial Intelligence (Embodied AI) is crucial for achievingArtificial General Intelligence (AGI) and serves as a foundation for variousapplications that bridge cyberspace and the physical world. Recently, theemergence of Multi-modal Large Models (MLMs) and World Models (WMs) haveattracted significant attention due to their remarkable perception,interaction, and reasoning capabilities, making them a promising architecturefor the brain of embodied agents. However, there is no comprehensive survey forEmbodied AI in the era of MLMs. In this survey, we give a comprehensiveexploration of the latest advancements in Embodied AI. Our analysis firstlynavigates through the forefront of representative works of embodied robots andsimulators, to fully understand the research focuses and their limitations.Then, we analyze four main research targets: 1) embodied perception, 2)embodied interaction, 3) embodied agent, and 4) sim-to-real adaptation,covering the state-of-the-art methods, essential paradigms, and comprehensivedatasets. Additionally, we explore the complexities of MLMs in virtual and realembodied agents, highlighting their significance in facilitating interactionsin dynamic digital and physical environments. Finally, we summarize thechallenges and limitations of embodied AI and discuss their potential futuredirections. We hope this survey will serve as a foundational reference for theresearch community and inspire continued innovation. The associated project canbe found at https://github.com/HCPLab-SYSU/Embodied_AI_Paper_List.