RecGPT Technical Report

  • 2025-07-30 17:55:06
  • Chao Yi, Dian Chen, Gaoyang Guo, Jiakai Tang, Jian Wu, Jing Yu, Sunhao Dai, Wen Chen, Wenjun Yang, Yuning Jiang, Zhujin Gao, Bo Zheng, Chi Li, Dimin Wang, Dixuan Wang, Fan Li, Fan Zhang, Haibin Chen, Haozhuang Liu, Jialin Zhu, Jiamang Wang, Jiawei Wu, Jin Cui, Ju Huang, Kai Zhang, Kan Liu, Lang Tian, Liang Rao, Longbin Li, Lulu Zhao, Mao Zhang, Na He, Peiyang Wang, Qiqi Huang, Tao Luo, Wenbo Su, Xiaoxiao He, Xin Tong, Xu Chen, Xunke Xi, Yang Li, Yaxuan Wu, Yeqiu Yang, Yi Hu, Yinnan Song, Yuchen Li, Yujie Luo, Yujin Yuan, Yuliang Yan, Zhengyang Wang, Zhibo Xiao, Zhixin Ma, Zile Zhou
  • 0

Abstract

Recommender systems are among the most impactful applications of artificialintelligence, serving as critical infrastructure connecting users, merchants,and platforms. However, most current industrial systems remain heavily relianton historical co-occurrence patterns and log-fitting objectives, i.e.,optimizing for past user interactions without explicitly modeling user intent.This log-fitting approach often leads to overfitting to narrow historicalpreferences, failing to capture users' evolving and latent interests. As aresult, it reinforces filter bubbles and long-tail phenomena, ultimatelyharming user experience and threatening the sustainability of the wholerecommendation ecosystem. To address these challenges, we rethink the overall design paradigm ofrecommender systems and propose RecGPT, a next-generation framework that placesuser intent at the center of the recommendation pipeline. By integrating largelanguage models (LLMs) into key stages of user interest mining, item retrieval,and explanation generation, RecGPT transforms log-fitting recommendation intoan intent-centric process. To effectively align general-purpose LLMs to theabove domain-specific recommendation tasks at scale, RecGPT incorporates amulti-stage training paradigm, which integrates reasoning-enhancedpre-alignment and self-training evolution, guided by a Human-LLM cooperativejudge system. Currently, RecGPT has been fully deployed on the Taobao App.Online experiments demonstrate that RecGPT achieves consistent performancegains across stakeholders: users benefit from increased content diversity andsatisfaction, merchants and the platform gain greater exposure and conversions.These comprehensive improvement results across all stakeholders validates thatLLM-driven, intent-centric design can foster a more sustainable and mutuallybeneficial recommendation ecosystem.

 

Quick Read (beta)

loading the full paper ...