Limits of Detecting Text Generated by Large-Scale Language Models

Abstract

Some consider large-scale language models that can generate long and coherentpieces of text as dangerous, since they may be used in misinformationcampaigns. Here we formulate large-scale language model output detection as ahypothesis testing problem to classify text as genuine or generated. We showthat error exponents for particular language models are bounded in terms oftheir perplexity, a standard measure of language generation performance. Underthe assumption that human language is stationary and ergodic, the formulationis extended from considering specific language models to considering maximumlikelihood language models, among the class of k-order Markov approximations;error probabilities are characterized. Some discussion of incorporatingsemantic side information is also given.

Quick Read (beta)

loading the full paper ...