Virus species found in the sample.
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The contigs split by virus species defined through closest homolog and sorted by the descending total number of derived reads and contigs. The species list have been manually curated and grouped in cases where yet unclassified strains dilute the species designation. Species producing a single read have also been removed. Furthermore, all alignments with an e-value at or above 1e-5 were ignored (this excluded 184 sequences, for which no species designation is provided).