Transparency Helps Reveal When Language Models Learn Meaning

  • 2023-03-04 19:58:34
  • Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, Noah A. Smith
  • 0

Abstract

Many current NLP systems are built from language models trained to optimizeunsupervised objectives on large amounts of raw text. Under what conditionsmight such a procedure acquire meaning? Our systematic experiments withsynthetic data reveal that, with languages where all expressions havecontext-independent denotations (i.e., languages with strong transparency),both autoregressive and masked language models successfully learn to emulatesemantic relations between expressions. However, when denotations are changedto be context-dependent with the language otherwise unmodified, this abilitydegrades. Turning to natural language, our experiments with a specificphenomenon -- referential opacity -- add to the growing body of evidence thatcurrent language models do not represent natural language semantics well. Weshow this failure relates to the context-dependent nature of natural languageform-meaning mappings.

 

Quick Read (beta)

loading the full paper ...