MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation

Abstract

The recent success of large language models (LLMs) has renewed interest inwhether recommender systems can achieve similar scaling benefits. Conventionalrecommenders, dominated by massive embedding tables, tend to plateau asembedding dimensions grow. In contrast, the emerging generative paradigmreplaces embeddings with compact Semantic ID (SID) sequences produced byautoregressive Transformers. Yet most industrial deployments remainproprietary, leaving two fundamental questions open: (1) Do the expectedscaling laws hold on public benchmarks? (2) What is the minimal post-trainingrecipe that enables competitive performance? We present MiniOneRec, to the best of our knowledge, the first fullyopen-source generative recommendation framework, which provides an end-to-endworkflow spanning SID construction, supervised fine-tuning, andrecommendation-oriented reinforcement learning. We generate SIDs via a ResidualQuantized VAE and post-train Qwen backbones ranging from 0.5B to 7B parameterson the Amazon Review dataset. Our experiments reveal a consistent downwardtrend in both training and evaluation losses with increasing model size,validating the parameter efficiency of the generative approach. To furtherenhance performance, we propose a lightweight yet effective post-trainingpipeline that (1) enforces full-process SID alignment and (2) appliesreinforcement learning with constrained decoding and hybrid rewards. Together,these techniques yield significant improvements in both ranking accuracy andcandidate diversity.

Quick Read (beta)

loading the full paper ...