VERITAS: A Unified Approach to Reliability Evaluation

Abstract

Large language models (LLMs) often fail to synthesize information from theircontext to generate an accurate response. This renders them unreliable inknowledge intensive settings where reliability of the output is key. A criticalcomponent for reliable LLMs is the integration of a robust fact-checking systemthat can detect hallucinations across various formats. While severalopen-access fact-checking models are available, their functionality is oftenlimited to specific tasks, such as grounded question-answering or entailmentverification, and they perform less effectively in conversational settings. Onthe other hand, closed-access models like GPT-4 and Claude offer greaterflexibility across different contexts, including grounded dialogueverification, but are hindered by high costs and latency. In this work, weintroduce VERITAS, a family of hallucination detection models designed tooperate flexibly across diverse contexts while minimizing latency and costs.VERITAS achieves state-of-the-art results considering average performance onall major hallucination detection benchmarks, with $10\%$ increase in averageperformance when compared to similar-sized models and get close to theperformance of GPT4 turbo with LLM-as-a-judge setting.

Quick Read (beta)

loading the full paper ...