Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features

Abstract

This study examines the lexical and syntactic interventions of human and LLMproofreading aimed at improving overall intelligibility in identical secondlanguage writings, and evaluates the consistency of outcomes across three LLMs(ChatGPT-4o, Llama3.1-8b, Deepseek-r1-8b). Findings show that both human andLLM proofreading enhance bigram lexical features, which may contribute tobetter coherence and contextual connectedness between adjacent words. However,LLM proofreading exhibits a more generative approach, extensively reworkingvocabulary and sentence structures, such as employing more diverse andsophisticated vocabulary and incorporating a greater number of adjectivemodifiers in noun phrases. The proofreading outcomes are highly consistent inmajor lexical and syntactic features across the three models.

Quick Read (beta)

loading the full paper ...