figshare
Browse

sorry, we can't preview this file

gut_gene_catalog_filtered_sequences_translated_protein_50_perc.clstr.gz (95.2 MB)

Gut microbiome gene catalog cluster file, 50% amino acid identity

Download (95.2 MB)
dataset
posted on 2019-04-16, 15:07 authored by Braden TierneyBraden Tierney
Gene clusters for non-redundant microbial gene catalog (clustered at 50% sequence identity) from 2,182 gut microbiome shotgun metagenomic samples.

This is one of two raw output files from CD-HIT. Each cluster is marked in the left column with a ">," and there is one cluster per consensus genes in the gene catalog files. The ID's in the right column are the genes within a cluster, with the consensus gene marked with an asterisk. Percent identity to the consensus gene is also described per gene ID.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC