12864_2023_9456_MOESM1_ESM.pptx (44.64 MB)
Download file

Additional file 1 of Homoeologous evolution of the allotetraploid genome of Poa annua L.

Download (44.64 MB)
posted on 2023-06-27, 03:38 authored by Christopher W. Benson, Matthew R. Sheltra, Peter J. Maughan, Eric N. Jellen, Matthew D. Robbins, B. Shaun Bushman, Eric L. Patterson, Nathan D. Hall, David R. Huff
Additional file 1: Supplementary Table 1. Features of the genome assemblies and annotations. Parental species, Poa infirma (PiA) and P. supina (PsB), relative to the subgenomes of P. annua (PaA and PaB). Values correspond to the seven pseudomolecules. BUSCOs are n=1,614. Supplementary Fig. 1. Linked density histograms of the Poa infirma and P. supina genomes. The Omni-C plot represents long-range cis information using proximity ligation and mapping position of paired-end data. Red indicates the number of read pair interactions within each bin. Supplementary Fig. 2. The homoeologous and orthologous sequences of Poa annua and its diploid parents, P. infirma and P. supina. (a) The evolutionary pathway and chromosomal relationships within and between the homologous sequences of P. annua and its diploid progenitors. (b) The distribution of sequence identity measured by the gap-compressed sequence identity of full-genome alignments. Percentages above violin plots indicate the median. (c) Chromosome lengths of the genome assemblies of all three species. Supplementary Fig. 3. Whole-genome sequence alignment depicts the primary mapping of parental chromosomes (PiA & PsB) to allotetraploid, Poa annua (PaA & PaB). The black arrow over Pa2B highlights 30 Mb of novel sequence in the tetraploid genome assembly, mostly composed of repetitive DNA (22.7Mb). Supplementary Fig. 4. The distribution of shared orthologous clusters (gene families) between the A and B genomes of Poa infirma (PiA), P. supina (PsB), and P. annua (PaA & PaB). Single-copy gene clusters are not depicted. Supplementary Fig. 5. Estimated molecular divergence of the PaA and PaB subgenomes of P. annua and the genomes of the diploid parents (PiA and PsB). Arrows and corresponding values highlight the peak density of synonymous substitutions. Supplementary Fig. 6. Sequence alignment of five model monocots spanning two whole-genome duplications. (a) A phylogenetic tree shows the species relationships. Pineapple (Ananas comosus) serves as outgroup because its speciation from the BOP and PACMAD clades of grasses predates the most recent of the ancestral Poaceae WGD events, rho (r). (b) Genomic alignment between monocots Ananas comosus, Brachypodium distachyon, Poa infirma, P. supina, and P. annua. Percentages in red show the ratio of 1:1 orthogroups relative to A. comosus. The syntenic block highlighted in green shows the colinear evolution of a cluster of genes. Supplementary Fig. 7. Retention of Poa annua genes across the parental chromosomes of P. infirma and P. supina. PiA to PaA and PsB to PaB have elevated gene retention (~97%) across chromosomes and reflects minimal gene loss (fractionation) since polyploidization. PiA to PaB and PsB to PaA have lower gene retention (~61%) and reflects the quantity of genes retained since the two parental lineages diverged from their common ancestor. The red arrow highlights the largest homoeologous exchange in the P. annua genome. Supplementary Fig. 8. Principal component analysis compares the gene expression profiles of 30 Poa annua samples. On the right, a PCA including all samples, with variables being subgenome (A or B), treatment (mowed or unmowed), and biotype (dwarf-type or wild-type). On the left, a PCA of the A subgenome illustrates that samples cluster by mowing treatment but biotypes are nested. Supplementary Fig. 9. Differential gene expression analysis across the A and B subgenomes of Poa annua. (a) Principal component analysis of the differential gene expression profiles of P. annua samples with biotypes (dwarf-type or wild-type) removed from the analysis. (b) A heatmap of the DEGs across subgenome and treatment. Blue genes are upregulated and orange are downregulated. (c) Clusters of genes with similar expression profiles. On the left, a cluster of genes upregulated in the B subgenome. On the right, a cluster upregulated in unmowed plants. In the DEG hierarchal cluster and dendrogram, red=B_mowed, green=B_unmowed, purple=A_unmowed, blue=A_mowed. Supplementary Fig. 10. Treemaps cluster enriched gene ontologies in the subgenome (A vs B) and treatment (mowed vs unmowed) comparisons based on semantic similarity of enriched terminologies. Supplementary Fig. 11. Geographic distribution of 13 re-sequenced Poa annua accessions. Two additional plants (PA-14 dwarf and Pa-14 WT) were also sequenced but not depicted on the map. Supplementary Fig. 12. SNP density across a 1 Mb sliding window demonstrates variability in sequence identity between Poa annua accessions and across chromosomes. Pink is sample ‘Arizona’ and blue is sample ‘Wales’. Coverage plots of both samples are included for reference. The scale bar is 320 Mb in length with each hash representing 10 Mb. Supplementary Fig. 13. The core Poa annua genome. Of the 76,541 gene annotationsin the P. annua reference genome, 68,733 are present in all 15 re-sequenced genotypes. 7,808 genes are dispensable and absent in at least one genotype. Supplementary Fig. 14. Depth of coverage plotted along the Poa annua reference genome suggests large-scale structural modification in P. annua chromosomes. Supplementary Fig. 15. Homoeolog-specific markers and Sanger sequencing verifies the composition of a large-scale chromosomal rearrangement in Poa annua. Alignment of the homoeologous sequences of Pa1A and Pa1B span the breakpoint of a large-scale chromosomal rearrangement in re-sequenced genotype ‘Arizona’ but not ‘Germany’. Sample ‘Ohio’ has both a rearranged and non-rearranged chromosome, verifying the haplotype specificity (heterozygotes) of chromosome rearrangements in some individuals. Black arrows highlight homoeolog distinguishing SNPs on either side of the breakpoint. Supplementary Fig. 16. Sequence alignment of three samples to the Poa annua reference genome illustrates recombination hotspots. Arrows point to alignment breakpoints at the 224 Mb deletion. Black arrows point to breakpoints that occur on the same sequence coordinates for both parental haplotypes. Red arrows point to haplotype-variable breakpoints. Blue boxes at the top of the alignment window show genes with exons (boxes) and introns (lines connecting boxes). Arrows that are perpendicular to genes are gene fusion events. Supplementary Fig. 17. Sequence alignments at chromosome 1A illustrates local variability at crossover ‘hotspots’. Black arrows indicate positions where both pairs of homologous chromosomes break at the same location and red arrows point to haplotype-variable breakpoints. Blue boxes at the top of the alignment window show genes with exons (boxes) and introns (lines connecting boxes). Arrows that are perpendicular to genes are gene fusion events. Supplementary Fig. 18. Sequence alignment of four Poa annua accessions shows structural variation at its two EPSPS homoeologs. ajg15317 and ajg73723 are EPSP synthases on chromosomes Pa5A and Pa5B, respectively. Blue boxes at the top depicts the genes exons (boxes) and introns (lines connecting boxes). The ajg15317 transcript is 3,891 bp in length, while ajg73723 is 9,092 bp. Grey boxes are reads that aligned to the reference genome as proper pairs. Open boxes are reads that mapped equally well to five or more locations in the genome. Red pairs have longer than anticipated insert lengths and depict putative indels at ajg73723’s longest introns. Sample ‘Sweden' is heterozygous for a 2,738 deletion at the second intron, while ‘Wales’ and breeding line ‘Pa-14 dwarf’ are homozygous for the deletion. Only the breeding line sample, ‘Pa-14 dwarf’ contained a 2,954 deletion at the 7th intron. Purple alignments in ajg15317 show reads with mates that map to the other subgenome homoeolog (ajg73723), within the indel at the second intron. Supplementary Fig. 19. Linear K-mer profiles and fitted models of the Poa infirma and P. supina genomes. Black lines show the fit of the model to the distribution of K-mer frequencies (blue). Sequencing errors are identified by low coverage k-mers shown in orange. The P. infirma and P. supina models follow a diploid distribution with low and high heterozygosity, respectively. Supplementary Fig. 20. The chloroplast sequences of Poa infirma and P. supina. (a) Chloroplast maps for P. supina and P. infirma. (b) Sequence alignment of chloroplasts show that P. annua’s maternal parent is P. infirma.(PPTX 45713 KB)