Abstract
Retrosynthesis planning, essential in organic synthesis and drug discovery,has greatly benefited from recent AI-driven advancements. Nevertheless,existing methods frequently face limitations in both applicability andexplainability. Traditional graph-based and sequence-to-sequence models oftenlack generalized chemical knowledge, leading to predictions that are neitherconsistently accurate nor easily explainable. To address these challenges, weintroduce RetroDFM-R, a reasoning-based large language model (LLM) designedspecifically for chemical retrosynthesis. Leveraging large-scale reinforcementlearning guided by chemically verifiable rewards, RetroDFM-R significantlyenhances prediction accuracy and explainability. Comprehensive evaluationsdemonstrate that RetroDFM-R significantly outperforms state-of-the-art methods,achieving a top-1 accuracy of 65.0% on the USPTO-50K benchmark. Double-blindhuman assessments further validate the chemical plausibility and practicalutility of RetroDFM-R's predictions. RetroDFM-R also accurately predictsmultistep retrosynthetic routes reported in the literature for both real-worlddrug molecules and perovskite materials. Crucially, the model's explicitreasoning process provides human-interpretable insights, thereby enhancingtrust and practical value in real-world retrosynthesis applications.