figshare
Browse
1/1
17 files

A superior contiguous whole genome assembly for shrimp (Penaeus indicus)

Version 2 2021-08-31, 08:52
Version 1 2021-07-14, 06:48
dataset
posted on 2021-08-31, 08:52 authored by Vinaya Kumar KatneniVinaya Kumar Katneni, Mudagandur S. Shekhar, Ashok Kumar Jangam, Karthic Krishnan, Sudheesh K. Prabhudas, Nimisha Kaikkolante, Dushyant Singh Baghel, Vijayan K. Koyadan, Joykrushna Jena, Trilochan Mohapatra
The dataset contains files pertaining to the assembly and annotation of Indian White Shrimp, Penaeus indicus genome, repeat annotation, gene family analyses, phylogenetic analyses and SNP identification in genes. The assembly was generated with Pacbio subreads (WTDBG2.5), polished for indels with Illumina paired-end reads (POLCA) and scaffolded with HiC chromatin interaction data (3D-DNA). Annotation of repeat elements was performed with RMBlast search of RepeatMasker module implemented in OmicsBox v1.3.11. The gene family analyses were performed using OrthoMCL v2.0.9. The SNP calling was performed from vcf file generated with pooled-RNAseq reads. The dataset contains the following files; assembly scaffolds, genes sequences and their coordinates in genome, protein sequences, gene annotations, sequence alignment based on single-copy orthologous genes and SNP positions in genes.

Funding

ICAR-CRP on Genomics

History