Oral microbiome gene catalog, 95% nucleotide identity

2019-04-16T16:54:39Z (GMT) by Braden Tierney
Non-redundant microbial gene catalog (clustered at 95% sequence identity) from 1,473 oral microbiome shotgun metagenomic samples.

This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.