Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks

Abstract

With the recent prevalence of reinforcement learning (RL), there have beentremendous interests in utilizing RL for ads allocation in recommendationplatforms (e.g., e-commerce and news feed sites). To achieve better allocation,the input of recent RL-based ads allocation methods is upgraded from point-wisesingle item to list-wise item arrangement. However, this also results in ahigh-dimensional space of state-action pairs, making it difficult to learnlist-wise representations with good generalization ability. This furtherhinders the exploration of RL agents and causes poor sample efficiency. Toaddress this problem, we propose a novel RL-based approach for ads allocationwhich learns better list-wise representations by leveraging task-specificsignals on Meituan food delivery platform. Specifically, we propose threedifferent auxiliary tasks based on reconstruction, prediction, and contrastivelearning respectively according to prior domain knowledge on ads allocation. Weconduct extensive experiments on Meituan food delivery platform to evaluate theeffectiveness of the proposed auxiliary tasks. Both offline and onlineexperimental results show that the proposed method can learn better list-wiserepresentations and achieve higher revenue for the platform compared to thestate-of-the-art baselines.

Quick Read (beta)

loading the full paper ...