Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation

  • 2018-05-21 17:23:31
  • Qiuyuan Huang, Zhe Gan, Asli Celikyilmaz, Dapeng Wu, Jianfeng Wang, Xiaodong He
  • 11

Abstract

We propose a hierarchically structured reinforcement learning approach toaddress the challenges of planning for generating coherent multi-sentencestories for the visual storytelling task. Within our framework, the task ofgenerating a story given a sequence of images is divided across a two-levelhierarchical decoder. The high-level decoder constructs a plan by generating asemantic concept (i.e., topic) for each image in sequence. The low-leveldecoder generates a sentence for each image using a semantic compositionalnetwork, which effectively grounds the sentence generation conditioned on thetopic. The two decoders are jointly trained end-to-end using reinforcementlearning. We evaluate our model on the visual storytelling (VIST) dataset.Empirical results demonstrate that the proposed hierarchically structuredreinforced training achieves significantly better performance compared to aflat deep reinforcement learning baseline.

 

Quick Read (beta)

loading the full paper ...