data_file_4.sh (1.98 kB)
POCP calculation for two genomes
This script takes two FASTA files of predicted gene sequences that have been converted to amino acids and calculates the POCP value (Qin et al. 2014; Journal of Bacteriology). The script also takes the number of CPUs to be used for BLASTing as a third parameter. The '_HH' is included at the end of all intermediate files to guarantee that the file names are unique, ensuring that existing files do not get overwritten.
EXAMPLE: ./POCP.sh genes_A.faa genes_B.faa 10
EXAMPLE: ./POCP.sh genes_A.faa genes_B.faa 10