Version 2 2021-08-31, 08:52Version 2 2021-08-31, 08:52
Version 1 2021-07-14, 06:48Version 1 2021-07-14, 06:48
dataset
posted on 2021-08-31, 08:52authored byVinaya Kumar KatneniVinaya Kumar Katneni, Mudagandur S. Shekhar, Ashok Kumar Jangam, Karthic Krishnan, Sudheesh K. Prabhudas, Nimisha Kaikkolante, Dushyant Singh Baghel, Vijayan K. Koyadan, Joykrushna Jena, Trilochan Mohapatra
The dataset contains files pertaining to the assembly
and annotation of Indian White Shrimp, Penaeus
indicus genome, repeat annotation, gene family analyses, phylogenetic
analyses and SNP identification in genes. The assembly was generated with Pacbio subreads (WTDBG2.5),
polished for indels with Illumina paired-end reads (POLCA) and scaffolded with
HiC chromatin interaction data (3D-DNA). Annotation of repeat elements was
performed with RMBlast search of RepeatMasker module implemented in OmicsBox v1.3.11.
The gene family analyses were performed using OrthoMCL v2.0.9. The SNP
calling was performed from vcf file generated with pooled-RNAseq reads. The
dataset contains the following files; assembly scaffolds, genes sequences and their
coordinates in genome, protein sequences, gene annotations, sequence alignment
based on single-copy orthologous genes and SNP positions in genes.