sorry, we can't preview this file
...but you can still download gut_gene_catalog_95perc_filtered_sequences.gz
Gut microbiome gene catalog, 95% nucleotide identity
datasetposted on 22.04.2019 by Braden Tierney
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Non-redundant microbial gene catalog (clustered at 95% sequence identity) from 2,182 gut microbiome shotgun metagenomic samples.
This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.