figshare
Browse

File(s) stored somewhere else

Please note: Linked content is NOT stored on Ag Data Commons and we can't guarantee its availability, quality, security or accept any liability.

Data from: Genome of ‘Charleston Gray’, the principal American watermelon cultivar, and genetic characterization of 1,365 accessions in the U.S. National Plant Germplasm System watermelon collection

dataset
posted on 2024-02-13, 13:56 authored by Shan Wu, Xin Wang, Umesh Reddy, Honghe Sun, Kan Bao, Lei Gao, Linyong Mao, Takshay Patel, Carlos Ortiz, Venkata L. Abburi, Padma Nimmakayala, Sandra Branham, Pat Wechter, Laura Massey, Kai‐Shu Ling, Chandrasekar Kousik, Sue A. Hammar, Yaakov Tadmor, Vitaly Portnoy, Amit Gur, Nurit Katzir, Nihat Guner, Angela Davis, Alvaro G. Hernandez, Chris L. Wright, Cecilia McGregor, Robert Jarret, Xingping Zhang, Yong Xu, Todd C. Wehner, Rebecca Grumet, Amnon Levi, Zhangjun Fei

Years of selection for desirable fruit quality traits in dessert watermelon (Citrullus lanatus) has resulted in a narrow genetic base in modern cultivars. Development of novel genomic and genetic resources offers great potential to expand genetic diversity and improve important traits in watermelon. Here, we report a high‐quality genome sequence of watermelon cultivar ‘Charleston Gray’, a principal American dessert watermelon, to complement the existing reference genome from ‘97103’, an East Asian cultivar. Comparative analyses between genomes of ‘Charleston Gray’ and ‘97103’ revealed genomic variants that may underlie phenotypic differences between the two cultivars. We then genotyped 1365 watermelon plant introduction (PI) lines maintained at the U.S. National Plant Germplasm System using genotyping‐by‐sequencing (GBS). These PI lines were collected throughout the world and belong to three Citrullus species, C. lanatus, C. mucosospermus and C. amarus. Approximately 25 000 high‐quality single nucleotide polymorphisms (SNPs) were derived from the GBS data using the ‘Charleston Gray’ genome as the reference. Population genomic analyses using these SNPs discovered a close relationship between C. lanatus and C. mucosospermus and identified four major groups in these two species correlated to their geographic locations. Citrullus amarus was found to have a distinct genetic makeup compared to C. lanatus and C. mucosospermus. The SNPs also enabled identification of genomic regions associated with important fruit quality and disease resistance traits through genome‐wide association studies. The high‐quality ‘Charleston Gray’ genome and the genotyping data of this large collection of watermelon accessions provide valuable resources for facilitating watermelon research, breeding and improvement.


Resources in this dataset:

  • Resource Title: Supplementary Data - Tables S1 to S13 (xlsx).

    File Name: Web Page, url: https://onlinelibrary.wiley.com/action/downloadSupplement?doi=10.1111/pbi.13136&file=pbi13136-sup-0002-TableS1-13.xlsx

    Direct download xlsx.

    Table S1: Summary of ‘Charleston Gray’ genome sequencing data.

    Table S2: Statistics of ‘Charleston Gray’ pseudomolecules.

    Table S3: Mapping of RNA‐Seq reads and ESTs to the ‘Charleston Gray’ genome.

    Table S4: Assessment of ‘Charleston Gray’ genome assembly completeness by BUSCO.

    Table S5: Summary of repeat sequences in the ‘Charleston Gray’ genome.

    Table S6: Summary of SNPs and small indels between ‘Charleston Gray’ and ‘97103’.

    Table S7: Structural variations between ‘Charleston Gray’ and ‘97103’.

    Table S8: Structural variations between ‘Charleston Gray’ and ‘97103’ overlapping with known QTLs.

    Table S9: List of the 1365 watermelon accessions from the NPGS collection.

    Table S10: Weighted pairwise FST values between C. mucosospermus and C. lanatus accessions from different geographic regions.

    Table S11: Phenotypic data for GWAS.

    Table S12: SNPs significantly associated with agronomic traits identified through GWAS.

    Table S13: Evaluation of the SNP imputation using KNN under the optimal parameters ‘w = 50, k = 7, p = ‐9, r = 0.8’.

Funding

USDA-NIFA: 2015‐51181‐24285

USDA-ARS

History

Data contact name

Levi, Amnon

Data contact email

Amnon.Levi@ars.usda.gov

Publisher

Plant Biotechnology Journal

Intended use

The high‐quality ‘Charleston Gray’ genome and the genotyping data of this large collection of watermelon accessions provide valuable resources for facilitating watermelon research, breeding and improvement.

Theme

  • Not specified

ISO Topic Category

  • biota
  • farming

National Agricultural Library Thesaurus terms

genome; watermelons; cultivars; United States; germplasm; fruit quality; desserts; Citrullus lanatus; genetic background; genomics; genetic variation; phenotype; genotyping; introduced plants; single nucleotide polymorphism; Geographical Locations; disease resistance; breeding; statistics; expressed sequence tags; genome assembly; quantitative trait loci; agronomic traits

Primary article PubAg Handle

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Wu, Shan; Wang, Xin; Reddy, Umesh; Sun, Honghe; Bao, Kan; Gao, Lei; Mao, Linyong; Patel, Takshay; Ortiz, Carlos; Abburi, Venkata L.; Nimmakayala, Padma; Branham, Sandra; Wechter, Pat; Massey, Laura; Ling, Kai‐Shu; Kousik, Chandrasekar; Hammar, Sue A.; Tadmor, Yaakov; Portnoy, Vitaly; Gur, Amit; Katzir, Nurit; Guner, Nihat; Davis, Angela; Hernandez, Alvaro G.; Wright, Chris L.; McGregor, Cecilia; Jarret, Robert; Zhang, Xingping; Xu, Yong; Wehner, Todd C.; Grumet, Rebecca; Levi, Amnon; Fei, Zhangjun (2020). Data from: Genome of ‘Charleston Gray’, the principal American watermelon cultivar, and genetic characterization of 1,365 accessions in the U.S. National Plant Germplasm System watermelon collection. Plant Biotechnology Journal.