Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

  • 2025-07-23 12:13:06
  • Situo Zhang, Hanqi Li, Lu Chen, Zihan Zhao, Xuanze Lin, Zichen Zhu, Bo Chen, Xin Chen, Kai Yu
  • 0

Abstract

Retrosynthesis planning, essential in organic synthesis and drug discovery,has greatly benefited from recent AI-driven advancements. Nevertheless,existing methods frequently face limitations in both applicability andexplainability. Traditional graph-based and sequence-to-sequence models oftenlack generalized chemical knowledge, leading to predictions that are neitherconsistently accurate nor easily explainable. To address these challenges, weintroduce RetroDFM-R, a reasoning-based large language model (LLM) designedspecifically for chemical retrosynthesis. Leveraging large-scale reinforcementlearning guided by chemically verifiable rewards, RetroDFM-R significantlyenhances prediction accuracy and explainability. Comprehensive evaluationsdemonstrate that RetroDFM-R significantly outperforms state-of-the-art methods,achieving a top-1 accuracy of 65.0% on the USPTO-50K benchmark. Double-blindhuman assessments further validate the chemical plausibility and practicalutility of RetroDFM-R's predictions. RetroDFM-R also accurately predictsmultistep retrosynthetic routes reported in the literature for both real-worlddrug molecules and perovskite materials. Crucially, the model's explicitreasoning process provides human-interpretable insights, thereby enhancingtrust and practical value in real-world retrosynthesis applications.

 

Quick Read (beta)

loading the full paper ...