sorry, we can't preview this file

...but you can still download gut_gene_catalog_filtered_sequences_translated_protein_50_perc.clstr.gz

gut_gene_catalog_filtered_sequences_translated_protein_50_perc.clstr.gz (95.2 MB)

Gut microbiome gene catalog cluster file, 50% amino acid identity

Download (95.2 MB)
dataset
posted on 16.04.2019 by Braden Tierney
Gene clusters for non-redundant microbial gene catalog (clustered at 50% sequence identity) from 2,182 gut microbiome shotgun metagenomic samples.

This is one of two raw output files from CD-HIT. Each cluster is marked in the left column with a ">," and there is one cluster per consensus genes in the gene catalog files. The ID's in the right column are the genes within a cluster, with the consensus gene marked with an asterisk. Percent identity to the consensus gene is also described per gene ID.

History

Licence

Exports

Categories

Licence

Exports