Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning

Abstract

Signal Temporal Logic (STL) is a powerful framework for describing thecomplex temporal and logical behaviour of the dynamical system. Numerousstudies have attempted to employ reinforcement learning to learn a controllerthat enforces STL specifications; however, they have been unable to effectivelytackle the challenges of ensuring robust satisfaction in continuous state spaceand maintaining tractability. In this paper, leveraging the concept of funnelfunctions, we propose a tractable reinforcement learning algorithm to learn atime-dependent policy for robust satisfaction of STL specification incontinuous state space. We demonstrate the utility of our approach on severalSTL tasks using different environments.

Quick Read (beta)

loading the full paper ...