Abstract
Deep learning-based computational methods have achieved promising results inpredicting protein-protein interactions (PPIs). However, existing benchmarkspredominantly focus on isolated pairwise evaluations, overlooking a model'scapability to reconstruct biologically meaningful PPI networks, which iscrucial for biology research. To address this gap, we introduce PRING, thefirst comprehensive benchmark that evaluates protein-protein interactionprediction from a graph-level perspective. PRING curates a high-quality,multi-species PPI network dataset comprising 21,484 proteins and 186,818interactions, with well-designed strategies to address both data redundancy andleakage. Building on this golden-standard dataset, we establish twocomplementary evaluation paradigms: (1) topology-oriented tasks, which assessintra and cross-species PPI network construction, and (2) function-orientedtasks, including protein complex pathway prediction, GO module analysis, andessential protein justification. These evaluations not only reflect the model'scapability to understand the network topology but also facilitate proteinfunction annotation, biological module detection, and even disease mechanismanalysis. Extensive experiments on four representative model categories,consisting of sequence similarity-based, naive sequence-based, protein languagemodel-based, and structure-based approaches, demonstrate that current PPImodels have potential limitations in recovering both structural and functionalproperties of PPI networks, highlighting the gap in supporting real-worldbiological applications. We believe PRING provides a reliable platform to guidethe development of more effective PPI prediction models for the community. Thedataset and source code of PRING are available athttps://github.com/SophieSarceau/PRING.