Additional file 1 of A new hybrid record linkage process to make epidemiological databases interoperable: application to the GEMO and GENEPSO studies involving BRCA1 and BRCA2 mutation carriers
Additional file 1: Table S1. Confusion matrix. Table S2. Score distribution for all record pairs comparisons between GEMO and GENEPSO in dataset 1. Table S3. Size of each dataset A after blocking. Table S4. List of matches identified by either PRL or RF. Table S5. Performance of the unsupervised machine learning models.
Funding
Institut National Du Cancer Fondation ARC pour la Recherche sur le Cancer