Appendix 1-figure 9: Ppyr1.2 Blobtools results
datasetposted on 13.08.2018 by Timothy Fallon, Sarah Sander Lower
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Given the recognized importance of filtering genome assemblies to avoid misinterpretation of the data, we sought to systematically remove assembled non-firefly contaminant sequence from Ppyr1.2. Using the blobtools toolset (v.1.0.1), we taxonomically annotated our scaffolds by performing a blastn (v2.6.0+) nucleotide sequence similarity search against the NCBI nt database, and a diamond (v.0.9.10.111) translated nucleotide sequence similarity search against the of Uniprot reference proteomes (July 2017). Using this similarity information, we annotated the scaffolds with blobtools using parameters “-x bestsumorder --rank phylum”.