File(s) under embargo

Reason: Manuscript not published yet

22

day(s)

until file(s) become available

Global patterns and rates of habitat transitions across the eukaryotic tree of life

dataset
posted on 19.10.2021, 13:25 by Mahwash JamyMahwash Jamy, Fabien Burki, Daniel VaulotDaniel Vaulot, charlie biwer, Aleix Obiol, Hongmei Jing, Sari Peura, Ramon Massana

This repository contains supplementary data associated with the manuscript.


There are 8 files in this collection:


pr2.transitions.fasta.gz

The in house database based on the PR2 database used for taxonomic annotation of long-read environmental sequences. Fasta file.


soil.clustered.filtered.fasta.gz

Short-read environmental sequences from soils corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).


freshwater.clustered.filtered.fasta.gz

Short-read environmental sequences from freshwater corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).


marine_euphotic.clustered.filtered.fasta.gz

Short-read environmental sequences from marine euphotic layer (surface + DCM) corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).


marine_aphotic.clustered.filtered.fasta.gz

Short-read environmental sequences from marine aphotic waters (mesopelagic + bathypelagic) corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).


long_read.18S.otus.fasta.gz

Long read environmental sequence OTUs generated in this study and taxonomically annotated against the PR2_transitions database. Header indicates sequence ID, number of reads, sample, environment, and taxonomy. 18S sequences only.


long_read.28S.otus.fasta.gz

Long read environmental sequence OTUs generated in this study and taxonomically annotated against the PR2_transitions database. Header indicates sequence ID, number of reads, sample, environment, and taxonomy. 28S sequences only.



History

Usage metrics

Licence

Exports