Abstract
Recent years have seen significant advancements in humanoid control, largelydue to the availability of large-scale motion capture data and the applicationof reinforcement learning methodologies. However, many real-world tasks, suchas moving large and heavy furniture, require multi-character collaboration.Given the scarcity of data on multi-character collaboration and the efficiencychallenges associated with multi-agent learning, these tasks cannot bestraightforwardly addressed using training paradigms designed for single-agentscenarios. In this paper, we introduce Cooperative Human-Object Interaction(CooHOI), a novel framework that addresses multi-character objects transportingthrough a two-phase learning paradigm: individual skill acquisition andsubsequent transfer. Initially, a single agent learns to perform tasks usingthe Adversarial Motion Priors (AMP) framework. Following this, the agent learnsto collaborate with others by considering the shared dynamics of themanipulated object during parallel training using Multi Agent Proximal PolicyOptimization (MAPPO). When one agent interacts with the object, resulting inspecific object dynamics changes, the other agents learn to respondappropriately, thereby achieving implicit communication and coordinationbetween teammates. Unlike previous approaches that relied on tracking-basedmethods for multi-character HOI, CooHOI is inherently efficient, does notdepend on motion capture data of multi-character interactions, and can beseamlessly extended to include more participants and a wide range of objecttypes