Abstract
Humanoid robots derive much of their dexterity from hyper-dexterouswhole-body movements, enabling tasks that require a large operationalworkspace: such as picking objects off the ground. However, achieving thesecapabilities on real humanoids remains challenging due to their high degrees offreedom (DoF) and nonlinear dynamics. We propose Adaptive Motion Optimization(AMO), a framework that integrates sim-to-real reinforcement learning (RL) withtrajectory optimization for real-time, adaptive whole-body control. To mitigatedistribution bias in motion imitation RL, we construct a hybrid AMO dataset andtrain a network capable of robust, on-demand adaptation to potentially O.O.D.commands. We validate AMO in simulation and on a 29-DoF Unitree G1 humanoidrobot, demonstrating superior stability and an expanded workspace compared tostrong baselines. Finally, we show that AMO's consistent performance supportsautonomous task execution via imitation learning, underscoring the system'sversatility and robustness.