sorry, we can't preview this file

...but you can still download gut_gene_catalog_filtered_sequences_translated_protein_50_perc.gz

gut_gene_catalog_filtered_sequences_translated_protein_50_perc.gz (1.09 GB)

Gut microbiome gene catalog, 50% amino acid identity

Download (1.09 GB)
dataset
posted on 16.04.2019, 15:07 by Braden Tierney
Non-redundant microbial gene catalog (clustered at 50% sequence identity) from 2,182 gut microbiome shotgun metagenomic samples.

This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.

History

Licence

Exports

Categories

Licence

Exports