SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions

Abstract

Concept Bottleneck Models (CBMs) and other concept-based interpretable modelsshow great promise for making AI applications more transparent, which isessential in fields like medicine. Despite their success, we demonstrate thatCBMs struggle to reliably identify the correct concepts under distributionshifts. To assess the robustness of CBMs to concept variations, we introduceSUB: a fine-grained image and concept benchmark containing 38,400 syntheticimages based on the CUB dataset. To create SUB, we select a CUB subset of 33bird classes and 45 concepts to generate images which substitute a specificconcept, such as wing color or belly pattern. We introduce a novel TiedDiffusion Guidance (TDG) method to precisely control generated images, wherenoise sharing for two parallel denoising processes ensures that both thecorrect bird class and the correct attribute are generated. This novelbenchmark enables rigorous evaluation of CBMs and similar interpretable models,contributing to the development of more robust methods. Our code is availableat https://github.com/ExplainableML/sub and the dataset athttp://huggingface.co/datasets/Jessica-bader/SUB.

Quick Read (beta)

loading the full paper ...