data_file_5.txt (0.6 kB)
Artificial contigs from a complete genome
This code reads a FASTA file of complete contigs (chromosome and plasmid(s)) into R and divides each sequence up into 50 smaller sequences based on a chi-squared distribution (Degrees of Freedom = 1) whose values have been converted to proportions of the length of the sequence. The result is an artificial draft genome where each contig has been labelled with the replicon from which it originated.