Abstract
Interactive reinforcement learning has allowed speeding up the learningprocess in autonomous agents by including a human trainer providing extrainformation to the agent in real-time. Current interactive reinforcementlearning research has been limited to real-time interactions that offerrelevant user advice to the current state only. Additionally, the informationprovided by each interaction is not retained and instead discarded by the agentafter a single-use. In this work, we propose a persistent rule-basedinteractive reinforcement learning approach, i.e., a method for retaining andreusing provided knowledge, allowing trainers to give general advice relevantto more than just the current state. Our experimental results show persistentadvice substantially improves the performance of the agent while reducing thenumber of interactions required for the trainer. Moreover, rule-based adviceshows similar performance impact as state-based advice, but with asubstantially reduced interaction count.