10.6084/m9.figshare.c.3660374_D1.v1
Aboozar Soorni
Aboozar
Soorni
David Haak
David
Haak
David Zaitlin
David
Zaitlin
Aureliano Bombarely
Aureliano
Bombarely
Additional file 1: Figure S1. of Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data
Springer Nature
2017
Chloroplast
Mitochondria
PacBio
Organelle Genome Assembly
2017-01-07 05:00:00
Figure
https://springernature.figshare.com/articles/figure/Additional_file_1_Figure_S1_of_Organelle_PBA_a_pipeline_for_assembling_chloroplast_and_mitochondrial_genomes_from_PacBio_DNA_sequencing_data/4529186
Chart flow for the Organelle_PBA software. Reads are mapped to an organelle reference using BlasR (1). The BlasR output is parsed and the sequence IDs are used to retrieve the reads from the input file (2). Organelle identified reads are assembled using Sprai (3). The program checks if the assembly is complete comparing its length with the reference (4). If it is not complete, it performs a scaffolding using SSPACE-Long and the whole PacBio dataset (5). It is complete it moves to the new checking point where it check for circularity (6). If it detects circularity by a self-BlastN, it trims the sequence corresponding to the circular overlap (6). Finally it check for the repeat assembly (7) and if it finds it, it breaks in four parts, identify the complete inverted repeat (IR), duplicate it (IRa and IRb) and re-assemble it will the long and short single copy (LSC, SSC) (9). (TIFF 5352 kb)