sorry, we can't preview this file
...but you can still download gene_catalog_oral_gut_95_nucl.gz
Merged oral and gut gene catalog, consensus sequences (95% identity)
datasetposted on 23.05.2019 by Braden Tierney
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Non-redundant microbial gene catalog (clustered at 95% sequence identity) from 1,473 oral microbiome and 2,182 gut microbiome shotgun metagenomic samples.
This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.