Nucleotide word frequency plots and phylogenetic analysis of metagenome assemblies.
Nucleotide word frequency principal component analysis (PCA) of assembled metagenome sequence data (contigs>1500 bp) from five chemotrophic geothermal habitats in YNP: A. Metagenome sequence colored by site (Crater Hills = gold, Norris Geyser Basin = red, Joseph's Coat = blue, Mammoth Hot Springs = green, Calcite Springs = violet). B. Identical PCA orientation of metagenome sequence observed in Panel A, but colors now designate phylogenetic affiliation at the order level (Sulfolobales = gold; Desulfurococcales = light blue; Thermoproteales = dark blue; Aquificales = green; Thermales = violet; Unassigned = black), and C. Identical PCA orientation with phylogenetic classification at the domain-level (Archaea = gold, Bacteria = violet).