Abstract
Chemical synthesis remains a critical bottleneck in the discovery andmanufacture of functional small molecules. AI-based synthesis planning modelscould be a potential remedy to find effective syntheses, and have made progressin recent years. However, they still struggle with less frequent, yet criticalreactions for synthetic strategy, as well as hallucinated, incorrectpredictions. This hampers multi-step search algorithms that rely on models, andleads to misalignment with chemists' expectations. Here we proposeRetroChimera: a frontier retrosynthesis model, built upon two newly developedcomponents with complementary inductive biases, which we fuse together using anew framework for integrating predictions from multiple sources via alearning-based ensembling strategy. Through experiments across several ordersof magnitude in data scale and splitting strategy, we show RetroChimeraoutperforms all major models by a large margin, demonstrating robustnessoutside the training data, as well as for the first time the ability to learnfrom even a very small number of examples per reaction class. Moreover,industrial organic chemists prefer predictions from RetroChimera over thereactions it was trained on in terms of quality, revealing high levels ofalignment. Finally, we demonstrate zero-shot transfer to an internal datasetfrom a major pharmaceutical company, showing robust generalization underdistribution shift. With the new dimension that our ensembling frameworkunlocks, we anticipate further acceleration in the development of even moreaccurate models.