Abstract
With the rapid advancement of AI-generated videos, there is an urgent needfor effective detection tools to mitigate societal risks such as misinformationand reputational harm. In addition to accurate classification, it is essentialthat detection models provide interpretable explanations to ensure transparencyfor regulators and end users. To address these challenges, we introduceVidGuard-R1, the first video authenticity detector that fine-tunes amulti-modal large language model (MLLM) using group relative policyoptimization (GRPO). Our model delivers both highly accurate judgments andinsightful reasoning. We curate a challenging dataset of 140k real andAI-generated videos produced by state-of-the-art generation models, carefullydesigning the generation process to maximize discrimination difficulty. We thenfine-tune Qwen-VL using GRPO with two specialized reward models that targettemporal artifacts and generation complexity. Extensive experiments demonstratethat VidGuard-R1 achieves state-of-the-art zero-shot performance on existingbenchmarks, with additional training pushing accuracy above 95%. Case studiesfurther show that VidGuard-R1 produces precise and interpretable rationalesbehind its predictions. The code is publicly available athttps://VidGuard-R1.github.io.