Cornet-2022-GBIO-Figshare.tgz (680.41 MB)
Download fileSoftware repository for Cornet and Baurain (2022) Contamination detection in genomic data: more is not enough
This repository provides Singularity definition files for six contamination detection tools. Database and files for Physeter are also provided, as well as the script to simulate the data.
Singularity
- contams.def: Definition file for CheckM, Forty-Two, GUNC, Physeter and Kraken2
- eukcc.def: Definition file for EukCC
Physeter
- life-tqmd-of73.dmnd: DIAMOND blast database for Physeter (from https://doi.org/10.3389/fmicb.2021.755101)
- life-tqmd-of73.gca: List of GCA numbers for Physeter database (needed to enable the k-fold mode)
- contam-labels.idl: idl file use for Physeter and Forty-Two (from https://bitbucket.org/phylogeno/42-ribo-msas/)
- taxdump-20211206: NCBI Taxonomy dump used across the study
Simulations
- Chimeric-genomes.py: Python script used to create the chimeric bacterial genome