Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs

  • 2025-06-16 07:09:38
  • Hiroshi Matsuda, Chunpeng Ma, Masayuki Asahara
  • 0

Abstract

Recent advances in large language models (LLMs) have enabled impressiveperformance in various tasks. However, standard prompting often struggles toproduce structurally valid and accurate outputs, especially in dependencyparsing. We propose a novel step-by-step instruction strategy, where universalpart-of-speech tagging precedes the prediction of syntactic heads anddependency labels, and a simplified CoNLL-U like output format, our methodachieves state-of-the-art accuracy on Universal Dependencies datasets across 17languages without hallucination or contamination. We further show thatmultilingual fine-tuning simultaneously improves cross-language generalizationperformance. Our results highlight the effectiveness of explicit reasoningsteps in LLM-based parsing and offer a scalable, format-consistent alternativeto bracket-based approaches.

 

Quick Read (beta)

loading the full paper ...