Abstract
Prompt optimization is essential for effective utilization of large languagemodels (LLMs) across diverse tasks. While existing optimization methods areeffective in optimizing short prompts, they struggle with longer, more complexones, often risking information loss and being sensitive to smallperturbations. To address these challenges, we propose SCULPT (SystematicTuning of Long Prompts), a framework that treats prompt optimization as ahierarchical tree refinement problem. SCULPT represents prompts as treestructures, enabling targeted modifications while preserving contextualintegrity. It employs a Critic-Actor framework that generates reflections andapplies actions to refine the prompt. Evaluations demonstrate SCULPT'seffectiveness on long prompts, its robustness to adversarial perturbations, andits ability to generate high-performing prompts even without any initialhuman-written prompt. Compared to existing state of the art methods, SCULPTconsistently improves LLM performance by preserving essential task informationwhile applying structured refinements. Both qualitative and quantitativeanalyses show that SCULPT produces more stable and interpretable promptmodifications, ensuring better generalization across tasks.