Abstract
Inverse Reinforcement Learning (IRL) is the problem of finding a rewardfunction which describes observed/known expert behavior. IRL is useful forautomated control in situations where the reward function is difficult tospecify manually, which impedes reinforcement learning. We provide a new IRLalgorithm for the continuous state space setting with unknown transitiondynamics by modeling the system using a basis of orthonormal functions. Weprovide a proof of correctness and formal guarantees on the sample and timecomplexity of our algorithm.
Quick Read (beta)
loading the full paper ...