Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models

Abstract

Research on Large Language Models (LLMs) has often neglected subtle biasesthat, although less apparent, can significantly influence the models' outputstoward particular social narratives. This study addresses two such biaseswithin LLMs: representative bias, which denotes a tendency of LLMs to generateoutputs that mirror the experiences of certain identity groups, and affinitybias, reflecting the models' evaluative preferences for specific narratives orviewpoints. We introduce two novel metrics to measure these biases: theRepresentative Bias Score (RBS) and the Affinity Bias Score (ABS), and presentthe Creativity-Oriented Generation Suite (CoGS), a collection of open-endedtasks such as short story writing and poetry composition, designed withcustomized rubrics to detect these subtle biases. Our analysis uncovers markedrepresentative biases in prominent LLMs, with a preference for identitiesassociated with being white, straight, and men. Furthermore, our investigationof affinity bias reveals distinctive evaluative patterns within each model,akin to `bias fingerprints'. This trend is also seen in human evaluators,highlighting a complex interplay between human and machine bias perceptions.

Quick Read (beta)

loading the full paper ...