Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey

Abstract

Detecting out-of-distribution (OOD) samples is crucial for ensuring thesafety of machine learning systems and has shaped the field of OOD detection.Meanwhile, several other problems are closely related to OOD detection,including anomaly detection (AD), novelty detection (ND), open set recognition(OSR), and outlier detection (OD). To unify these problems, a generalized OODdetection framework was proposed, taxonomically categorizing these fiveproblems. However, Vision Language Models (VLMs) such as CLIP havesignificantly changed the paradigm and blurred the boundaries between thesefields, again confusing researchers. In this survey, we first present ageneralized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OODdetection, and OD in the VLM era. Our framework reveals that, with some fieldinactivity and integration, the demanding challenges have become OOD detectionand AD. In addition, we also highlight the significant shift in the definition,problem settings, and benchmarks; we thus feature a comprehensive review of themethodology for OOD detection, including the discussion over other relatedtasks to clarify their relationship to OOD detection. Finally, we explore theadvancements in the emerging Large Vision Language Model (LVLM) era, such asGPT-4V. We conclude this survey with open challenges and future directions.

Quick Read (beta)

loading the full paper ...