Abstract
Advances in natural language processing and large language models are drivinga major transformation in Human Capital Management, with a growing interest inbuilding smart systems based on language technologies for talent acquisition,upskilling strategies, and workforce planning. However, the adoption andprogress of these technologies critically depend on the development of reliableand fair models, properly evaluated on public data and open benchmarks, whichhave so far been unavailable in this domain. To address this gap, we present TalentCLEF 2025, the first evaluationcampaign focused on skill and job title intelligence. The lab consists of twotasks: Task A - Multilingual Job Title Matching, covering English, Spanish,German, and Chinese; and Task B - Job Title-Based Skill Prediction, in English.Both corpora were built from real job applications, carefully anonymized, andmanually annotated to reflect the complexity and diversity of real-world labormarket data, including linguistic variability and gender-marked expressions. The evaluations included monolingual and cross-lingual scenarios and coveredthe evaluation of gender bias. TalentCLEF attracted 76 registered teams with more than 280 submissions. Mostsystems relied on information retrieval techniques built with multilingualencoder-based models fine-tuned with contrastive learning, and several of themincorporated large language models for data augmentation or re-ranking. Theresults show that the training strategies have a larger effect than the size ofthe model alone. TalentCLEF provides the first public benchmark in this fieldand encourages the development of robust, fair, and transferable languagetechnologies for the labor market.