The Marine Microbial Eukaryotic Transcriptome Sequencing Project (MMETSP) data set contains cultured samples of pelagic and endosymbiotic marine eukaryotic species representing more than 40 phyla (Keeling et al. 2014).
Each of these files is a de novo transcriptome assembly of one individual sequencing sample, defined by a unique SRR id. Files are named as follows:
Methods for the de novo transcriptome assembly are described in the Eel pond khmer protocols (Brown et al. 2015). Automated scripts are available on github:
https://github.com/ljcohen/MMETSP
C. Titus Brown, Camille Scott, and Leigh Sheneman. 2015. The Eel Pond mRNAseq Protocol. https://khmer-protocols.readthedocs.io/en/ctb/mrnaseq/