Language Model Decoding as Likelihood-Utility Alignment

  • 2023-03-16 18:54:53
  • Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kıcıman, Boi Faltings, Robert West
  • 0

Abstract

A critical component of a successful language generation pipeline is thedecoding algorithm. However, the general principles that should guide thechoice of a decoding algorithm remain unclear. Previous works only comparedecoding algorithms in narrow scenarios, and their findings do not generalizeacross tasks. We argue that the misalignment between the model's likelihood andthe task-specific notion of utility is the key factor to understanding theeffectiveness of decoding algorithms. To structure the discussion, we introducea taxonomy of misalignment mitigation strategies (MMSs), providing a unifyingview of decoding as a tool for alignment. The MMS taxonomy groups decodingalgorithms based on their implicit assumptions about likelihood--utilitymisalignment, yielding general statements about their applicability acrosstasks. Specifically, by analyzing the correlation between the likelihood andthe utility of predictions across a diverse set of tasks, we provide empiricalevidence supporting the proposed taxonomy and a set of principles to structurereasoning when choosing a decoding algorithm. Crucially, our analysis is thefirst to relate likelihood-based decoding algorithms with algorithms that relyon external information, such as value-guided methods and prompting, and coversthe most diverse set of tasks to date. Code, data, and models are available athttps://github.com/epfl-dlab/understanding-decoding.

 

Quick Read (beta)

loading the full paper ...