Representation-Driven Reinforcement Learning

  • 2023-05-31 15:59:12
  • Ofir Nabati, Guy Tennenholtz, Shie Mannor
  • 0


We present a representation-driven framework for reinforcement learning. Byrepresenting policies as estimates of their expected values, we leveragetechniques from contextual bandits to guide exploration and exploitation.Particularly, embedding a policy network into a linear feature space allows usto reframe the exploration-exploitation problem as arepresentation-exploitation problem, where good policy representations enableoptimal exploration. We demonstrate the effectiveness of this framework throughits application to evolutionary and policy gradient-based approaches, leadingto significantly improved performance compared to traditional methods. Ourframework provides a new perspective on reinforcement learning, highlightingthe importance of policy representation in determining optimalexploration-exploitation strategies.


Quick Read (beta)

loading the full paper ...