TY - DATA T1 - Protist Ribosomal Reference database (PR2) - SSU rRNA gene database - flat files for mothur PY - 2017/01/20 AU - Daniel Vaulot UR - https://figshare.com/articles/dataset/PR2_rRNA_gene_database/3803709 DO - 10.6084/m9.figshare.3803709.v11 L4 - https://ndownloader.figshare.com/files/7381309 L4 - https://ndownloader.figshare.com/files/7381303 L4 - https://ndownloader.figshare.com/files/7381369 L4 - https://ndownloader.figshare.com/files/7405798 KW - rRNA genes KW - marine algae KW - protists KW - phytoplankton KW - microalgae KW - Phycology (incl. Marine Grasses) KW - Marine and Estuarine Ecology (incl. Marine Ichthyology) KW - Biogeography and Phylogeography KW - Molecular Biology KW - Bioinformatics KW - Marine Biology KW - Microbial Ecology KW - Microbiology N2 - The Protist Ribosomal Reference database (PR2) provides a unique access to eukaryotic small sub-unit (SSU) ribosomal RNA and DNA sequences, with curated taxonomy. The database mainly consists of nuclear-encoded protistan sequences. However, metazoans, land plants, macrosporic fungi and eukaryotic organelles (mitochondrion, plastid and others) are also included because they are useful for the analysis of high-troughput sequencing data sets. Introns and putative chimeric sequences have been also carefully checked. Taxonomic assignation of sequences consists of eight unique taxonomic fields.The original web site (http://ssu-rrna.org/pr2) is currently out and we are proposing updated version of PR2 as flat files to use for annotating metabarcodes.Files- pr2_gb203_version_4.x.zip contains two files for use with Qiime or Mothur.   * pr2....fasta contains all sequences in fasta format with the accession in the description line   * pr2....tax contains the taxonomy of each sequence separated from the accession number by a tabulation- pr2_gb203_version_4.x.for_BLAST.zip contains one fasta file with the accession number of the sequence and its taxonomy on the description line.  It is suitable to build a local database for BLAST search- PR2 version notes.docx contains the revision history- PR2 versions.xls is a condensed list of the different versionsCurrent version : 4.5 based on GenBank 203Last update : 17 January 2017Notes- Qiime can only use 7 taxonomical levelsContactLaure GUILLOU and Daniel VAULOTDIPO team, Plankton Group, UMR 7144 CNRS-UPMCStation Biologique,Place G. Tessier29680 Roscoff FRANCEemail: vaulot@sb-roscoff.frContributors- Tristan Biard- Margot Tragin- Bente Edvardsen ReferencesGuillou, L., Bachar, D., Audic, S., Bass, D., Berney, C., Bittner, L., Boutte, C. et al. 2013. The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote Small Sub-Unit rRNA sequences with curated taxonomy. Nucleic Acids Res. 41:D597–604.Edvardsen, B., Egge, E.S. & Vaulot, D. 2016. Diversity and distribution of haptophytes revealed by environmental sequencing and metabarcoding – a review. Perspect. Phycol. in press.Tragin, M., Lopes dos Santos, A., Christen, R. & Vaulot, D. 2016. Diversity and ecology of green microalgae in marine systems: an overview based on 18S rRNA gene sequences. Perspect. Phycol. in press.  ER -