figshare
Browse
1-s2.0-S1872497318300279-main.pdf (1.67 MB)

A phylogenetic framework facilitates Y-STR variant discovery and classification via massively parallel sequencing

Download (1.67 MB)
journal contribution
posted on 2018-04-20, 15:35 authored by Tunde I. Huszar, Mark A. Jobling, Jon H. Wetton
Short-tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single-nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega’s prototype PowerSeqÔ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context.

History

Citation

Forensic Science International: Genetics, 2018, 35, pp. 97-106

Author affiliation

/Organisation/COLLEGE OF LIFE SCIENCES/Biological Sciences/Genetics and Genome Biology

Version

  • VoR (Version of Record)

Published in

Forensic Science International: Genetics

Publisher

Elsevier

issn

1872-4973

Acceptance date

2018-03-28

Copyright date

2018

Available date

2018-12-11

Publisher version

https://www.sciencedirect.com/science/article/pii/S1872497318300279#!

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Keywords

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC