Abstract
Goal-oriented proactive dialogue systems are designed to guide userconversations seamlessly towards specific objectives by planning agoal-oriented path. However, previous research has focused predominantly onoptimizing these paths while neglecting the inconsistencies that may arisebetween generated responses and dialogue contexts, including user profiles,dialogue history, domain knowledge, and subgoals. To address this issue, weintroduce a model-agnostic two-stage Consistency Reflection and Correction(CRC) framework. Specifically, in the consistency reflection stage, the modelis prompted to reflect on the discrepancies between generated responses anddialogue contexts, identifying inconsistencies and suggesting possiblecorrections. In the consistency correction stage, the model generates responsesthat are more consistent with the dialogue context based on these reflectionresults. We conducted experiments on various model architectures with differentparameter sizes, including encoder-decoder models (BART, T5) and decoder-onlymodels (GPT-2, DialoGPT, Phi3, Mistral and LLaMA3), and the experimentalresults on three datasets demonstrate that our CRC framework significantlyimproves the consistency between generated responses and dialogue contexts.