Merged oral and gut gene catalog, cluster file (95% identity)

2019-05-23T22:01:46Z (GMT) by Braden Tierney
Non-redundant microbial gene catalog cluster file (clustered at 95% sequence identity) from 1,473 oral microbiome shotgun metagenomic samples and 2,182 gut microbiome samples.

This is one of two raw output files from CD-HIT. Each cluster is marked in the left column with a ">," and there is one cluster per consensus genes in the gene catalog files. The ID's in the right column are the genes within a cluster, with the consensus gene marked with an asterisk. Percent identity to the consensus gene is also described per gene ID.