figshare
Browse
1/1
2 files

Minimal dataset for ConFindr testing using pytest

Version 3 2023-06-13, 13:39
Version 2 2023-06-13, 13:14
Version 1 2023-05-16, 17:33
dataset
posted on 2023-06-13, 13:39 authored by Liam BrownLiam Brown

A .tar.gz archive containing 14 .fastq.gz files, which correspond to paired-end Illumina whole-genome sequence data from different foodborne pathogens, and an associated tab-separated value file for the metadata of these samples. These sequence data were obtained by selecting samples from the originally published ConFindr dataset (doi: 10.7717/peerj.6995) and downsampling them. The metadata for these samples was obtained from the Supplemental Information of the original publication. The DownsampleFactor column in the metadata file corresponds to the factor by which the original samples were downsampled (e.g. 0.5 is 2-fold downsampling, 0.1 is 10-fold).


Changelog


Version 3


  • Changed test_samples archive from .zip to .tar.gz, as it was in Version 1.


Version 2


  • Renamed '_1' and '_2' file patterns to '_R1' and '_R2' to reflect default ConFindr parameters.

History

Usage metrics

    Keywords

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC