figshare
Browse

CMap: a database for mapping job titles, sector specialization, and promotions across 24 sectors

Version 2 2025-06-09, 15:05
Version 1 2025-06-09, 15:04
dataset
posted on 2025-06-09, 15:05 authored by Shehryar SubhaniShehryar Subhani, Shahan Ali MemonShahan Ali Memon, Bedoor AlShebli

Understanding job titles, career trajectories, and promotions provides valuable insight into labor market dynamics and professional mobility. We present Career Map (CMap), a novel dataset spanning 24 industry sectors, systematically structured to study job specialization, sector concentration, and career advancements. Using advanced natural language processing techniques and large language models, we standardize 6.2 million job titles into 109 thousand unique titles and introduce a Specialization Index to quantify how specialized a title is within its sector. The dataset includes both a structured job titles dataset and a set of identified promotions—30 thousand validated promotions from the United States and the United Kingdom, and 72 thousand inferred promotions from a global context. It enables research on job hierarchies, workforce mobility and systemic inequalities in professional advancement. By providing insights into career progression patterns, labor market structures, and the impact of education and experience, this dataset serves as a valuable resource for economists, sociologists, and computational researchers studying employment trends across industries and regions.

This repository contains the code necessary to recreate Figure 4 and Table 4 from the original manuscript.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC