Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks

Abstract

Systematic compositionality is the ability to recombine meaningful units withregular and predictable outcomes, and it's seen as key to humans' capacity forgeneralization in language. Recent work has studied systematic compositionalityin modern seq2seq models using generalization to novel navigation instructionsin a grounded environment as a probing tool, requiring models to quicklybootstrap the meaning of new words. We extend this framework here to settingswhere the model needs only to recombine well-trained functional words (such as"around" and "right") in novel contexts. Our findings confirm and strengthenthe earlier ones: seq2seq models can be impressively good at generalizing tonovel combinations of previously-seen input, but only when they receiveextensive training on the specific pattern to be generalized (e.g.,generalizing from many examples of "X around right" to "jump around right"),while failing when generalization requires novel application of compositionalrules (e.g., inferring the meaning of "around right" from those of "right" and"around").

Quick Read (beta)

loading the full paper ...