AUROC values for tissue-specific regulatory sequence prediction on validation sets.

journal contribution
posted on 01.12.2020, 18:30 by Louisa-Marie Krützfeldt, Max Schubach, Martin Kircher

Models were trained on DHS sequences (positive) with corresponding sets of negative sequences and tested on a set of tissue-specific chromosome 21 test set. For each classifier two different negative training sets are compared; sequences were either chosen from genomic background (tGC = 0.1) or generated by shuffling positive sequences and preserving k-mer counts (k = 7). AUROC value was calculated to compare model performance.