Exploring structural variation and gene family architecture with De Novo assemblies of 15 Medicago genomes
Posted on 2017-03-27 - 05:00
Abstract Background Previous studies exploring sequence variation in the model legume, Medicago truncatula, relied on mapping short reads to a single reference. However, read-mapping approaches are inadequate to examine large, diverse gene families or to probe variation in repeat-rich or highly divergent genome regions. De novo sequencing and assembly of M. truncatula genomes enables near-comprehensive discovery of structural variants (SVs), analysis of rapidly evolving gene families, and ultimately, construction of a pan-genome. Results Genome-wide synteny based on 15 de novo M. truncatula assemblies effectively detected different types of SVs indicating that as much as 22% of the genome is involved in large structural changes, altogether affecting 28% of gene models. A total of 63 million base pairs (Mbp) of novel sequence was discovered, expanding the reference genome space for Medicago by 16%. Pan-genome analysis revealed that 42% (180 Mbp) of genomic sequences is missing in one or more accession, while examination of de novo annotated genes identified 67% (50,700) of all ortholog groups as dispensable â estimates comparable to recent studies in rice, maize and soybean. Rapidly evolving gene families typically associated with biotic interactions and stress response were found to be enriched in the accession-specific gene pool. The nucleotide-binding site leucine-rich repeat (NBS-LRR) family, in particular, harbors the highest level of nucleotide diversity, large effect single nucleotide change, protein diversity, and presence/absence variation. However, the leucine-rich repeat (LRR) and heat shock gene families are disproportionately affected by large effect single nucleotide changes and even higher levels of copy number variation. Conclusions Analysis of multiple M. truncatula genomes illustrates the value of de novo assemblies to discover and describe structural variation, something that is often under-estimated when using read-mapping approaches. Comparisons among the de novo assemblies also indicate that different large gene families differ in the architecture of their structural variation.
CITE THIS COLLECTION
DataCite
3 Biotech
3D Printing in Medicine
3D Research
3D-Printed Materials and Systems
4OR
AAPG Bulletin
AAPS Open
AAPS PharmSciTech
Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg
ABI Technik (German)
Academic Medicine
Academic Pediatrics
Academic Psychiatry
Academic Questions
Academy of Management Discoveries
Academy of Management Journal
Academy of Management Learning and Education
Academy of Management Perspectives
Academy of Management Proceedings
Academy of Management Review
Zhou, Peng; Silverstein, Kevin; Ramaraj, Thiruvarangan; Guhlin, Joseph; Denny, Roxanne; Liu, Junqi; et al. (2017). Exploring structural variation and gene family architecture with De Novo assemblies of 15 Medicago genomes. figshare. Collection. https://doi.org/10.6084/m9.figshare.c.3727633.v1
or
Select your citation style and then place your mouse over the citation text to select it.
SHARE
Usage metrics
Read the peer-reviewed publication
AUTHORS (13)
PZ
Peng Zhou
KS
Kevin Silverstein
TR
Thiruvarangan Ramaraj
JG
Joseph Guhlin
RD
Roxanne Denny
JL
Junqi Liu
AF
Andrew Farmer
KS
Kelly Steele
RS
Robert Stupar
JM
Jason Miller
PT
Peter Tiffin
JM
Joann Mudge
NY
Nevin Young
KEYWORDS
De Novo assembliesgene family architecturenucleotide-binding site leucine-richResults Genome-wide syntenyaccession-specific gene poolread-mapping approachescopy number variationassemblyreference genome spaceLRRgene familiesNBS-LRRSV15 Medicago genomes Abstract Backgroundheat shock gene familiestruncatula genomes