Abstract
While several high profile video games have served as testbeds for DeepReinforcement Learning (DRL), this technique has rarely been employed by thegame industry for crafting authentic AI behaviors. Previous research focuses ontraining super-human agents with large models, which is impractical for gamestudios with limited resources aiming for human-like agents. This paperproposes a sample-efficient DRL method tailored for training and fine-tuningagents in industrial settings such as the video game industry. Our methodimproves sample efficiency of value-based DRL by leveraging pre-collected dataand increasing network plasticity. We evaluate our method training a goalkeeperagent in EA SPORTS FC 25, one of the best-selling football simulations today.Our agent outperforms the game's built-in AI by 10% in ball saving rate.Ablation studies show that our method trains agents 50% faster compared tostandard DRL methods. Finally, qualitative evaluation from domain expertsindicates that our approach creates more human-like gameplay compared tohand-crafted agents. As a testimony of the impact of the approach, the methodis intended to replace the hand-crafted counterpart in next iterations of theseries.