Bacterial genomic 16S gene copy number and sequences
This data set includes estimates of genomic copy number of the 16S rRNA gene and corresponding 16S gene sequences for 484 bacterial taxa, as described in the paper:
Kembel SW, Wu M, Eisen JA, Green JL (2012) Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance. PLoS Comput Biol 8(10): e1002743. doi:10.1371/journal.pcbi.1002743
http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002743
Two files are included:
reftaxa.16S.copynumber.csv: estimated number of copies of the 16S gene in the genome of the taxon (see article for description of methodology)
reftaxa.16S.alignment.fasta: 16S sequence for each taxon aligned to the GreenGenes core set using PyNAST
For each file, the reference taxa identifiers consist of a code of the format "t511693-Escherichia", where 511693 represents the NCBI taxonomic identifier of the taxon and Escherichia represents the taxonomic genus identity of the taxon.