Countering Language Drift with Seeded Iterated Learning

Abstract

Supervised learning methods excel at capturing statistical properties oflanguage when trained over large text corpora. Yet, these models often produceinconsistent outputs in goal-oriented language settings as they are not trainedto complete the underlying task. Moreover, as soon as the agents are finetunedto maximize task completion, they suffer from the so-called language driftphenomenon: they slowly lose syntactic and semantic properties of language asthey only focus on solving the task. In this paper, we propose a genericapproach to counter language drift by using iterated learning. We iteratebetween fine-tuning agents with interactive training steps, and periodicallyreplacing them with new agents that are seeded from last iteration and trainedto imitate the latest finetuned models. Iterated learning does not requireexternal syntactic constraint nor semantic knowledge, making it a valuabletask-agnostic finetuning protocol. We first explore iterated learning in theLewis Game. We then scale-up the approach in the translation game. In bothsettings, our results show that iterated learn-ing drastically counterslanguage drift as well as it improves the task completion metric.

Quick Read (beta)

loading the full paper ...