figshare
Browse

Data_Sheet_1_Fast and Accurate Classification of Meta-Genomics Long Reads With deSAMBA.docx

Download (587.84 kB)
dataset
posted on 2021-04-28, 04:40 authored by Gaoyang Li, Yongzhuang Liu, Deying Li, Bo Liu, Junyi Li, Yang Hu, Yadong Wang

There is still a lack of fast and accurate classification tools to identify the taxonomies of noisy long reads, which is a bottleneck to the use of the promising long-read metagenomic sequencing technologies. Herein, we propose de Bruijn graph-based Sparse Approximate Match Block Analyzer (deSAMBA), a tailored long-read classification approach that uses a novel pseudo alignment algorithm based on sparse approximate match block (SAMB). Benchmarks on real sequencing datasets demonstrate that deSAMBA enables to achieve high yields and fast speed simultaneously, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies.

History