sorry, we can't preview this file
...but you can still download oral_1473_gene_catalogue_filtered_sequences.gz
Oral microbiome gene catalog, 95% nucleotide identity
datasetposted on 16.04.2019 by Braden Tierney
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Non-redundant microbial gene catalog (clustered at 95% sequence identity) from 1,473 oral microbiome shotgun metagenomic samples.
This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.