Abstract
Reinforcement learning (RL) agents aim at learning by interacting with anenvironment, and are not designed for representing or reasoning withdeclarative knowledge. Knowledge representation and reasoning (KRR) paradigmsare strong in declarative KRR tasks, but are ill-equipped to learn from suchexperiences. In this work, we integrate logical-probabilistic KRR withmodel-based RL, enabling agents to simultaneously reason with declarativeknowledge and learn from interaction experiences. The knowledge from humans andRL is unified and used for dynamically computing task-specific planning modelsunder potentially new environments. Experiments were conducted using a mobilerobot working on dialog, navigation, and delivery tasks. Results showsignificant improvements, in comparison to existing model-based RL methods.