figshare
Browse
Figure_4.tif (1021.31 kB)

Predictive accuracy of phylogenetic profiling when we control for the influence of the Open World Assumption.

Download (0 kB)
figure
posted on 2015-02-13, 17:50 authored by Nives Škunca, Christophe Dessimoz

Two sets of experiments are denoted with colours: experiments when we include only the well-annotated proteins (purple) and experiments where we randomly remove 60% of the available annotations (red). Dashed and full lines connect the dots of the mean AUPRC scores for two sets of experiments: random sub-selection of genomes (full lines) and sub-selection to keep maximum diversity among the selected genomes (dashed lines). Each dot represents the mean AUPRC for the GO terms we use in annotating. The final point denotes the mean AUPRC score when we include all the available bacteria in the used OMA database release (1078 bacteria).

History