Post-training for Efficient Communication via Convention Formation

  • 2025-08-08 17:42:16
  • Yilun Hua, Evan Wang, Yoav Artzi
  • 0

Abstract

Humans communicate with increasing efficiency in multi-turn interactions, byadapting their language and forming ad-hoc conventions. In contrast, prior workshows that LLMs do not naturally show this behavior. We develop a post-trainingprocess to develop this ability through targeted fine-tuning on heuristicallyidentified demonstrations of convention formation. We evaluate with two newbenchmarks focused on this capability. First, we design a focused,cognitively-motivated interaction benchmark that consistently elicits strongconvention formation trends in humans. Second, we create a newdocument-grounded reference completion task that reflects in-the-wildconvention formation behavior. Our studies show significantly improvedconvention formation abilities in post-trained LLMs across the two evaluationmethods.

 

Quick Read (beta)

loading the full paper ...