Exploring the Limitations of Detecting Machine-Generated Text

Abstract

Recent improvements in the quality of the generations by large languagemodels have spurred research into identifying machine-generated text. Such workoften presents high-performing detectors. However, humans and machines canproduce text in different styles and domains, yet the performance impact ofsuch on machine generated text detection systems remains unclear. In thispaper, we audit the classification performance for detecting machine-generatedtext by evaluating on texts with varying writing styles. We find thatclassifiers are highly sensitive to stylistic changes and differences in textcomplexity, and in some cases degrade entirely to random classifiers. Wefurther find that detection systems are particularly susceptible to misclassifyeasy-to-read texts while they have high performance for complex texts, leadingto concerns about the reliability of detection systems. We recommend thatfuture work attends to stylistic factors and reading difficulty levels ofhuman-written and machine-generated text.

Quick Read (beta)

loading the full paper ...