Genetic variants analysis of three dromedary camels using whole genome sequencing data
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Whole genome wide identification and annotation of genetic variations in camels is in its first steps. The aim of this study was the identification of genome wide variants, functional annotations of them and enrichment analysis of affected genes using whole genome sequencing data of three dromedary camels. The genomes of two Iranian female dromedary camels that mostly used to produce meat and milk were sequenced to 41.9-fold and 38.6-fold coverage. A total of 4,727,238 single-nucleotide polymorphisms (SNPs) and 692,908 indels (insertions and deletions) were found by mapping raw reads to the dromedary reference assembly (GenBank Accession: GCA_000767585.1). In-silico functional annotation of the discovered variants in under study samples revealed that most SNPs (2,305,738; 48.78%) and indels (339,756; 49.03%) were located in intergenic regions. A comparison of the identified SNPs with those of the African camel (BioProject Accession: PRJNA269274) indicated that they had 993,474 SNPs in common. We found 15,168 non-synonymous SNPs in the shared variants of the three camels that could affect gene function and protein structure. Obtained results revealed that there were 7085, 6271 and 4688 non-synonymous SNPs among the 3436, 3058 and 2882 genes in the specific gene sets of Yazd dromedary, Trod dromedary and African dromedary, respectively. The list of genes predicted to be affected by non-synonymous variants in different individuals was subjected to gene ontology (GO) enrichment analysis.