Figure_3.tif (523.27 kB)

Comparison of species richness estimators.

figure

posted on 2014-06-19, 03:55 authored by Daniel J. Laydon, Anat Melamed, Aaron Sim, Nicolas A. Gillet, Kathleen Sim, Sam Darko, J. Simon Kroll, Daniel C. Douek, David A. Price, Charles R. M. Bangham, Becca Asquith

A–D The Chao1bc (blue), ACE (grey), Bootstrap (green), Good-Turing (black), and negative-exponential estimators (orange) are applied to in silico random subsamples of observed data. Examples for HTLV-1, microbial, and TCR data are shown. Estimates systematically increase with sample size in datasets where rarefaction curves do not plateau (e.g. in I, J, K). Where rarefaction curves do plateau (e.g. in L), estimates are consistent. E–H DivE (red) is applied to same subsamples as the other estimators. Performance of DivE was evaluated by comparing the error of estimates (Ŝ_obs), to the (known) number of species S_obs in the full observed data (purple line), i.e. error = |S_obs - Ŝ_obs| /S_obs. In all datasets, DivE accurately estimates the species richness of the full observed data from subsamples of that data. I–L Corresponding HTLV-1, microbial and TCR rarefaction curves: arrows denote the size of the subsample to which each estimator was applied.