Abstract
Recent developments in LLMs offer new opportunities for assisting authors inimproving their work. In this paper, we envision a use case where authors canreceive LLM-generated reviews that uncover weak points in the current draft.While initial methods for automated review generation already exist, thesemethods tend to produce reviews that lack detail, and they do not cover therange of opinions that human reviewers produce. To address this shortcoming, wepropose an efficient two-stage review generation framework called Reviewer2.Unlike prior work, this approach explicitly models the distribution of possibleaspects that the review may address. We show that this leads to more detailedreviews that better cover the range of aspects that human reviewers identify inthe draft. As part of the research, we generate a large-scale review dataset of27k papers and 99k reviews that we annotate with aspect prompts, which we makeavailable as a resource for future research.