figshare
Browse
uasa_a_1375931_sm9799.pdf (1.19 MB)

Controlling the FDR in Imperfect Matches to an Incomplete Database

Download (1.19 MB)
Version 2 2018-06-28, 20:06
Version 1 2017-09-29, 20:31
journal contribution
posted on 2018-06-28, 20:06 authored by Uri Keich, William Stafford Noble

We consider the problem of controlling the false discovery rate (FDR) among discoveries from searching an incomplete database. This problem differs from the classical multiple testing setting because there are two different types of false discoveries: those arising from objects that have no match in the database and those that are incorrectly matched. We show that commonly used FDR controlling procedures are inadequate for this setup, a special case of which is tandem mass spectrum identification. We then derive a novel FDR controlling approach which extensive simulations suggest is unbiased. We also compare its performance with problem-specific as well as general FDR controlling procedures using both simulated and real mass spectrometry data.

Funding

This work is supported in part by NIH award P41 GM103533.

History