figshare
Browse

sorry, we can't preview this file

gene_catalog_oral_gut_95_nucl.gz (7.65 GB)

Merged oral and gut gene catalog, consensus sequences (95% identity)

Download (7.65 GB)
Version 2 2019-05-23, 22:16
Version 1 2019-05-23, 21:57
dataset
posted on 2019-05-23, 22:16 authored by Braden TierneyBraden Tierney
Non-redundant microbial gene catalog (clustered at 95% sequence identity) from 1,473 oral microbiome and 2,182 gut microbiome shotgun metagenomic samples.

This is one of two raw output files from CD-HIT. It is a fasta file containing the sequences of the consensus genes – the longest genes in each cluster – as well as their ID's. These ID's map back to the second CD-HIT output file, the cluster (extension: .clstr) file.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC