Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs

Abstract

Recent advances in large language models (LLMs) have enabled impressiveperformance in various tasks. However, standard prompting often struggles toproduce structurally valid and accurate outputs, especially in dependencyparsing. We propose a novel step-by-step instruction strategy, where universalpart-of-speech tagging precedes the prediction of syntactic heads anddependency labels, and a simplified CoNLL-U like output format, our methodachieves state-of-the-art accuracy on Universal Dependencies datasets across 17languages without hallucination or contamination. We further show thatmultilingual fine-tuning simultaneously improves cross-language generalizationperformance. Our results highlight the effectiveness of explicit reasoningsteps in LLM-based parsing and offer a scalable, format-consistent alternativeto bracket-based approaches.

Quick Read (beta)

loading the full paper ...