Average precision, recall and <i>F</i><sub>1</sub> scores for all the methods and all the datasets (<i>T</i> = 2/3).

<p>Please note that DS results are averaged over only 3 datasets and thus cannot be taken into account for a fair comparison.</p>