Abstract
Sequential Recommendation (SR) plays a pivotal role in recommender systems bytailoring recommendations to user preferences based on their non-stationaryhistorical interactions. Achieving high-quality performance in SR requiresattention to both item representation and diversity. However, designing an SRmethod that simultaneously optimizes these merits remains a long-standingchallenge. In this study, we address this issue by integrating recentgenerative Diffusion Models (DM) into SR. DM has demonstrated utility inrepresentation learning and diverse image generation. Nevertheless, astraightforward combination of SR and DM leads to sub-optimal performance dueto discrepancies in learning objectives (recommendation vs. noisereconstruction) and the respective learning spaces (non-stationary vs.stationary). To overcome this, we propose a novel framework called DimeRec(\textbf{Di}ffusion with \textbf{m}ulti-interest \textbf{e}nhanced\textbf{Rec}ommender). DimeRec synergistically combines a guidance extractionmodule (GEM) and a generative diffusion aggregation module (DAM). The GEMextracts crucial stationary guidance signals from the user's non-stationaryinteraction history, while the DAM employs a generative diffusion processconditioned on GEM's outputs to reconstruct and generate consistentrecommendations. Our numerical experiments demonstrate that DimeRecsignificantly outperforms established baseline methods across three publiclyavailable datasets. Furthermore, we have successfully deployed DimeRec on alarge-scale short video recommendation platform, serving hundreds of millionsof users. Live A/B testing confirms that our method improves both users' timespent and result diversification.