Transparency Helps Reveal When Language Models Learn Meaning

Abstract

Many current NLP systems are built from language models trained to optimizeunsupervised objectives on large amounts of raw text. Under what conditionsmight such a procedure acquire meaning? Our systematic experiments withsynthetic data reveal that, with languages where all expressions havecontext-independent denotations (i.e., languages with strong transparency),both autoregressive and masked language models successfully learn to emulatesemantic relations between expressions. However, when denotations are changedto be context-dependent with the language otherwise unmodified, this abilitydegrades. Turning to natural language, our experiments with a specificphenomenon -- referential opacity -- add to the growing body of evidence thatcurrent language models do not represent natural language semantics well. Weshow this failure relates to the context-dependent nature of natural languageform-meaning mappings.

Quick Read (beta)

loading the full paper ...