Abstract
Large Language Models (LLMs) are leading a new technological revolution asone of the most promising research streams toward artificial generalintelligence. The scaling of these models, accomplished by increasing thenumber of parameters and the magnitude of the training datasets, has beenlinked to various so-called emergent abilities that were previously unobserved.These emergent abilities, ranging from advanced reasoning and in-contextlearning to coding and problem-solving, have sparked an intense scientificdebate: Are they truly emergent, or do they simply depend on external factors,such as training dynamics, the type of problems, or the chosen metric? Whatunderlying mechanism causes them? Despite their transformative potential,emergent abilities remain poorly understood, leading to misconceptions abouttheir definition, nature, predictability, and implications. In this work, weshed light on emergent abilities by conducting a comprehensive review of thephenomenon, addressing both its scientific underpinnings and real-worldconsequences. We first critically analyze existing definitions, exposinginconsistencies in conceptualizing emergent abilities. We then explore theconditions under which these abilities appear, evaluating the role of scalinglaws, task complexity, pre-training loss, quantization, and promptingstrategies. Our review extends beyond traditional LLMs and includes LargeReasoning Models (LRMs), which leverage reinforcement learning andinference-time search to amplify reasoning and self-reflection. However,emergence is not inherently positive. As AI systems gain autonomous reasoningcapabilities, they also develop harmful behaviors, including deception,manipulation, and reward hacking. We highlight growing concerns about safetyand governance, emphasizing the need for better evaluation frameworks andregulatory oversight.