Abstract
Current LLM-driven approaches using test-time computing to generate PDEsolvers execute a large number of solver samples to identify high-accuracysolvers. These paradigms are especially costly for complex PDEs requiringsubstantial computational resources for numerical evaluation. We introducePDE-SHARP, a framework to reduce computational costs by replacing expensivescientific computation by cheaper LLM inference that achieves superior solveraccuracy with 60-75% fewer computational evaluations. PDE-SHARP employs threestages: (1) Analysis: mathematical chain-of-thought analysis including PDEclassification, solution type detection, and stability analysis; (2) Genesis:solver generation based on mathematical insights from the previous stage; and(3) Synthesis: collaborative selection-hybridization tournaments in which LLMjudges iteratively refine implementations through flexible performancefeedback. To generate high-quality solvers, PDE-SHARP requires fewer than 13solver evaluations on average compared to 30+ for baseline methods, improvingaccuracy uniformly across tested PDEs by $4\times$ on average, and demonstratesrobust performance across LLM architectures, from general-purpose tospecialized reasoning models.