figshare
Browse
13059_2020_2254_MOESM1_ESM.docx (2.49 MB)

Additional file 1 of SequencErr: measuring and suppressing sequencer errors in next-generation sequencing data

Download (2.49 MB)
journal contribution
posted on 2021-01-25, 14:40 authored by Eric M. Davis, Yu Sun, Yanling Liu, Pandurang Kolekar, Ying Shao, Karol Szlachta, Heather L. Mulder, Dongren Ren, Stephen V. Rice, Zhaoming Wang, Joy Nakitandwe, Alexander M. Gout, Bridget Shaner, Salina Hall, Leslie L. Robison, Stanley Pounds, Jeffery M. Klco, John Easton, Xiaotu Ma
Additional file 1: Supplementary Figure S1. Minimal sample effect for calculating error rates for flow cells. Supplementary Figure S2. Flow cell-level error rate of all sequencers analyzed in this study. Supplementary Figure S3. Physical location pattern of tile-level error rates across flow cells in an outlier sequencer. Supplementary Figure S4. Prevalence of outlier tiles at flow cell level. Supplementary Figure S5. Positional pattern of problematic tiles. Supplementary Figure S6. Comparison of sequencers by using a common reference DNA library. Supplementary Figure S7. Effect of removing outlier tiles on the overall sequencing error rate. Supplementary Figure S8. Effect of removing outlier tiles evaluated by fold change. Supplementary Figure S9. Benchmarking SequencErr with FastQC. Supplementary Figure S10. Effect of DNA sequencing features on sequencer error rate. Supplementary Figure S11. Comparison of SequencErr with error correction methods. Supplementary Figure S12. Application of SequencErr on non-human dataset (SARS-CoV-2). Supplementary Note 1. Illustration of flowcell architecture. Supplementary Note 2. Manually checking read name information from NCBI SRA. Supplementary Note 3. Manually checking NCBI SRA database for the suitability of public datasets for our analysis. Supplementary Note 4. Flow cell layout of HiSeq. Supplementary Note 5. Flow cell layout of NextSeq. Supplementary Note 6. Flow cell layout of NovaSeq.

Funding

Fund for Innovation in Cancer Informatics American Lebanese Syrian Associated Charities National Institutes of Health

History

Usage metrics

    Genome Biology

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC