Abstract
LLM test-time compute (or LLM inference) via search has emerged as apromising research area with rapid developments. However, current frameworksoften adopt distinct perspectives on three key aspects (task definition, LLMprofiling, and search procedures), making direct comparisons challenging.Moreover, the search algorithms employed often diverge from standardimplementations, and their specific characteristics are not thoroughlyspecified. In this survey, we provide a comprehensive technical review thatunifies task definitions and provides modular definitions of LLM profiling andsearch procedures. The definitions enable precise comparisons of various LLMinference frameworks while highlighting their departures from conventionalsearch algorithms. We also discuss the applicability, performance, andefficiency of these methods. We have updated our content to include the latestpapers, and the differences between versions are highlighted in the appendix.For further details and ongoing updates, please refer to our GitHub repository:https://github.com/xinzhel/LLM-Agent-Survey/blob/main/search.md