Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning

Abstract

Symmetry is pervasive in robotics and has been widely exploited to improvesample efficiency in deep reinforcement learning (DRL). However, existingapproaches primarily focus on spatial symmetries, such as reflection, rotation,and translation, while largely neglecting temporal symmetries. To address thisgap, we explore time reversal symmetry, a form of temporal symmetry commonlyfound in robotics tasks such as door opening and closing. We propose TimeReversal symmetry enhanced Deep Reinforcement Learning (TR-DRL), a frameworkthat combines trajectory reversal augmentation and time reversal guided rewardshaping to efficiently solve temporally symmetric tasks. Our method generatesreversed transitions from fully reversible transitions, identified by aproposed dynamics-consistent filter, to augment the training data. Forpartially reversible transitions, we apply reward shaping to guide learning,according to successful trajectories from the reversed task. Extensiveexperiments on the Robosuite and MetaWorld benchmarks demonstrate that TR-DRLis effective in both single-task and multi-task settings, achieving highersample efficiency and stronger final performance compared to baseline methods.

Quick Read (beta)

loading the full paper ...