figshare
Browse
1/2
35 files

agnostosDB_dbf02445-20200519

Version 24 2022-06-15, 06:59
Version 23 2021-12-06, 11:16
Version 22 2021-11-08, 16:12
Version 21 2021-10-19, 13:15
Version 20 2021-10-07, 08:39
Version 19 2021-10-07, 08:16
Version 18 2021-10-06, 15:44
Version 17 2021-10-06, 09:13
Version 16 2021-05-18, 10:34
Version 15 2021-03-25, 08:14
Version 14 2021-03-25, 07:36
Version 13 2021-03-10, 13:55
Version 12 2020-11-12, 10:15
Version 11 2020-09-22, 13:23
Version 10 2020-09-22, 13:06
Version 9 2020-08-07, 09:50
Version 8 2020-07-31, 16:54
Version 7 2020-07-30, 06:56
Version 6 2020-07-20, 09:30
Version 5 2020-07-09, 08:10
dataset
posted on 2022-06-15, 06:59 authored by Chiara VanniChiara Vanni, Antonio Fernandez-Guerra
The agnostosDB (dbf02445-20200519) is a comprehensive dataset of microbial gene clusters (GCs) from genomes and metagenomes. It contains 5,287,759 GCs and more than 280M genes coming from the bacterial and archaeal Genome Taxonomy Database (GTDB) genomes, and from five large-scale metagenomic projects: 583 marine metagenomes from Tara Oceans expedition (TARA), Malaspina expedition, Ocean Sampling Day (OSD), Global Ocean Sampling Expedition (GOS), complemented with 1,246 metagenomes from the Human Microbiome Project (HMP) phase I and II.
The dataset is described in Vanni et al. 2020.
Additional and more detailed information about the dataset creation and some of its applications, can be found at https://dark.metagenomics.eu/.
Related to the agnostosDB is the agnostos-wf, a snakemake workflow stored in the GitHub repository https://github.com/functional-dark-side/agnostos-wf.
The agnostos-wf allows to search the agnostosDB gene cluster HMM profiles and/or to integrate new sequence data (genes/contigs) in it.

Funding

no funding

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC