Multilingual Grammatical Error Annotation: Combining Language-Agnostic Framework with Language-Specific Flexibility

  • 2025-06-09 14:01:19
  • Mengyang Qiu, Tran Minh Nguyen, Zihao Huang, Zelong Li, Yang Gu, Qingyu Gao, Siliang Liu, Jungyeul Park
  • 0

Abstract

Grammatical Error Correction (GEC) relies on accurate error annotation andevaluation, yet existing frameworks, such as $\texttt{errant}$, facelimitations when extended to typologically diverse languages. In this paper, weintroduce a standardized, modular framework for multilingual grammatical errorannotation. Our approach combines a language-agnostic foundation withstructured language-specific extensions, enabling both consistency andflexibility across languages. We reimplement $\texttt{errant}$ using$\texttt{stanza}$ to support broader multilingual coverage, and demonstrate theframework's adaptability through applications to English, German, Czech,Korean, and Chinese, ranging from general-purpose annotation to more customizedlinguistic refinements. This work supports scalable and interpretable GECannotation across languages and promotes more consistent evaluation inmultilingual settings. The complete codebase and annotation tools can beaccessed at https://github.com/open-writing-evaluation/jp_errant_bea.

 

Quick Read (beta)

loading the full paper ...