Abstract
Personalized preference alignment for large language models (LLMs), theprocess of tailoring LLMs to individual users' preferences, is an emergingresearch direction spanning the area of NLP and personalization. In thissurvey, we present an analysis of works on personalized alignment and modelingfor LLMs. We introduce a taxonomy of preference alignment techniques, includingtraining time, inference time, and additionally, user-modeling based methods.We provide analysis and discussion on the strengths and limitations of eachgroup of techniques and then cover evaluation, benchmarks, as well as openproblems in the field.
Quick Read (beta)
loading the full paper ...