Learning to Brachiate via Simplified Model Imitation

Abstract

Brachiation is the primary form of locomotion for gibbons and siamangs, inwhich these primates swing from tree limb to tree limb using only their arms.It is challenging to control because of the limited control authority, therequired advance planning, and the precision of the required grasps. We presenta novel approach to this problem using reinforcement learning, and asdemonstrated on a finger-less 14-link planar model that learns to brachiateacross challenging handhold sequences. Key to our method is the use of asimplified model, a point mass with a virtual arm, for which we first learn apolicy that can brachiate across handhold sequences with a prescribed order.This facilitates the learning of the policy for the full model, for which itprovides guidance by providing an overall center-of-mass trajectory to imitate,as well as for the timing of the holds. Lastly, the simplified model can alsoreadily be used for planning suitable sequences of handholds in a givenenvironment. Our results demonstrate brachiation motions with a variety ofdurations for the flight and hold phases, as well as emergent extraback-and-forth swings when this proves useful. The system is evaluated with avariety of ablations. The method enables future work towards more general 3Dbrachiation, as well as using simplified model imitation in other settings.

Quick Read (beta)

loading the full paper ...