figshare
Browse
1/1
6 files

HAM10000 expert-annotated explanations and pseudonymized reader study data.

Download all (5.25 MB) This item is shared privately
dataset
modified on 2024-02-28, 15:49

Reader study was conducted in 3 phases: no ai (phase 1), ai support (phase 2), xai support (phase 3). The data generated in each phase is available in metadata_phase1.csv, metadata_phase2.csv, and metadata_phase3.csv.

There are 113 unique participants in phase 1. Additional participants were added in phase 2, resulting in 116 unique clinicians in phases 2 and 3.

Important: The 3rd and 13th image in each group are identical. Be careful when performing table joins as the duplicate image_ids can affect them. In metadata_phase1.csv, the AI predictions for the 13th image in each group are null. Please take that into account when performing analysis. In metadata_phase2 and metadata_phase3, the AI predictions for the repeating images are not omitted.


participant: Each clinician was assigned a participant Id represented by the participant column.

group: Each clinician was randomly assigned to a group. Each group was assigned mutually exclusive sets of images.

mask: An internal identifier used for the images. Can be ignored.

benign_malignant: ground truth diagnosis.

prediction: Diagnosis chosen by the clinician. 1 represents melanoma, 0 represents nevus, 0.5 represents a nevus diagnosis but the clinician chose to excise.

confidence: Confidence value entered by the clinician.

trust: Trust value entered by the clinician.

AI_prediction: Diagnosis predicted by the AI. 1 represents melanoma and 0 represents nevus.

language: Language chosen by the clinician.