Knowledge Distillation Detection for Open-weights Models

  • 2025-10-02 17:59:14
  • Qin Shi, Amber Yijia Zheng, Qifan Song, Raymond A. Yeh
  • 0

Abstract

We propose the task of knowledge distillation detection, which aims todetermine whether a student model has been distilled from a given teacher,under a practical setting where only the student's weights and the teacher'sAPI are available. This problem is motivated by growing concerns about modelprovenance and unauthorized replication through distillation. To address thistask, we introduce a model-agnostic framework that combines data-free inputsynthesis and statistical score computation for detecting distillation. Ourapproach is applicable to both classification and generative models.Experiments on diverse architectures for image classification and text-to-imagegeneration show that our method improves detection accuracy over the strongestbaselines by 59.6% on CIFAR-10, 71.2% on ImageNet, and 20.0% for text-to-imagegeneration. The code is available athttps://github.com/shqii1j/distillation_detection.

 

Quick Read (beta)

loading the full paper ...