HALO: An Ontology for Representing Hallucinations in Generative Models

Abstract

Recent progress in generative AI, including large language models (LLMs) likeChatGPT, has opened up significant opportunities in fields ranging from naturallanguage processing to knowledge discovery and data mining. However, there isalso a growing awareness that the models can be prone to problems such asmaking information up or `hallucinations', and faulty reasoning on seeminglysimple problems. Because of the popularity of models like ChatGPT, bothacademic scholars and citizen scientists have documented hallucinations ofseveral different types and severity. Despite this body of work, a formal modelfor describing and representing these hallucinations (with relevant meta-data)at a fine-grained level, is still lacking. In this paper, we address this gapby presenting the Hallucination Ontology or HALO, a formal, extensible ontologywritten in OWL that currently offers support for six different types ofhallucinations known to arise in LLMs, along with support for provenance andexperimental metadata. We also collect and publish a dataset containinghallucinations that we inductively gathered across multiple independent Websources, and show that HALO can be successfully used to model this dataset andanswer competency questions.

Quick Read (beta)

loading the full paper ...