OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

  • 2025-04-09 18:59:35
  • Jiacheng Liu, Taylor Blanton, Yanai Elazar, Sewon Min, YenSung Chen, Arnavi Chheda-Kothary, Huy Tran, Byron Bischoff, Eric Marsh, Michael Schmitz, Cassidy Trier, Aaron Sarnat, Jenna James, Jon Borchardt, Bailey Kuehl, Evie Cheng, Karen Farley, Sruthi Sreeram, Taira Anderson, David Albright, Carissa Schoenick, Luca Soldaini, Dirk Groeneveld, Rock Yuren Pang, Pang Wei Koh, Noah A. Smith, Sophie Lebrecht, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi, Jesse Dodge
  • 0

Abstract

We present OLMoTrace, the first system that traces the outputs of languagemodels back to their full, multi-trillion-token training data in real time.OLMoTrace finds and shows verbatim matches between segments of language modeloutput and documents in the training text corpora. Powered by an extendedversion of infini-gram (Liu et al., 2024), our system returns tracing resultswithin a few seconds. OLMoTrace can help users understand the behavior oflanguage models through the lens of their training data. We showcase how it canbe used to explore fact checking, hallucination, and the creativity of languagemodels. OLMoTrace is publicly available and fully open-source.

 

Quick Read (beta)

loading the full paper ...