Inverse Reinforcement Learning for Marketing

  • 2017-12-13 05:46:22
  • Igor Halperin
  • 3

Abstract

Learning customer preferences from an observed behaviour is an importanttopic in the marketing literature. Structural models typically modelforward-looking customers or firms as utility-maximizing agents whose utilityis estimated using methods of Stochastic Optimal Control. We suggest analternative approach to study dynamic consumer demand, based on InverseReinforcement Learning (IRL). We develop a version of the Maximum Entropy IRLthat leads to a highly tractable model formulation that amounts tolow-dimensional convex optimization in the search for optimal model parameters.Using simulations of consumer demand, we show that observational noise foridentical customers can be easily confused with an apparent consumerheterogeneity.

 

Quick Read (beta)

loading the full paper ...