JobHop: A Large-Scale Dataset of Career Trajectories

  • 2025-11-03 17:13:50
  • Iman Johary, Raphael Romero, Alexandru C. Mara, Tijl De Bie
  • 0

Abstract

Understanding labor market dynamics is essential for policymakers, employers,and job seekers. However, comprehensive datasets that capture real-world careertrajectories are scarce. In this paper, we introduce JobHop, a large-scalepublic dataset derived from anonymized resumes provided by VDAB, the publicemployment service in Flanders, Belgium. Utilizing Large Language Models(LLMs), we process unstructured resume data to extract structured careerinformation, which is then normalized to standardized ESCO occupation codesusing a multi-label classification model. This results in a rich dataset ofover 1.67 million work experiences, extracted from and grouped into more than361,000 user resumes and mapped to standardized ESCO occupation codes, offeringvaluable insights into real-world occupational transitions. This datasetenables diverse applications, such as analyzing labor market mobility, jobstability, and the effects of career breaks on occupational transitions. Italso supports career path prediction and other data-driven decision-makingprocesses. To illustrate its potential, we explore key dataset characteristics,including job distributions, career breaks, and job transitions, demonstratingits value for advancing labor market research.

 

Quick Read (beta)

loading the full paper ...