figshare
Browse
1/1
2 files

Bacterial genomic 16S gene copy number and sequences

dataset
posted on 2014-01-31, 13:45 authored by Steven KembelSteven Kembel

This data set includes estimates of genomic copy number of the 16S rRNA gene and corresponding 16S gene sequences for 484 bacterial taxa, as described in the paper:

Kembel SW, Wu M, Eisen JA, Green JL (2012) Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance. PLoS Comput Biol 8(10): e1002743. doi:10.1371/journal.pcbi.1002743

http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002743

 

Two files are included:

reftaxa.16S.copynumber.csv: estimated number of copies of the 16S gene in the genome of the taxon (see article for description of methodology)

reftaxa.16S.alignment.fasta: 16S sequence for each taxon aligned to the GreenGenes core set using PyNAST

For each file, the reference taxa identifiers consist of a code of the format "t511693-Escherichia", where 511693 represents the NCBI taxonomic identifier of the taxon and Escherichia represents the taxonomic genus identity of the taxon.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC