Global patterns and rates of habitat transitions across the eukaryotic tree of life
This repository contains supplementary data associated with the manuscript.
There are 8 files in this collection:
pr2.transitions.fasta.gz
The in house database based on the PR2 database used for taxonomic annotation of long-read environmental sequences. Fasta file.
soil.clustered.filtered.fasta.gz
Short-read environmental sequences from soils corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).
freshwater.clustered.filtered.fasta.gz
Short-read environmental sequences from freshwater corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).
marine_euphotic.clustered.filtered.fasta.gz
Short-read environmental sequences from marine euphotic layer (surface + DCM) corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).
marine_aphotic.clustered.filtered.fasta.gz
Short-read environmental sequences from marine aphotic waters (mesopelagic + bathypelagic) corresponding the V4 fragment of the 18S gene. Sequences have been clustered at 97% similarity, and low abundance sequences filtered out to be conservative (see manuscript for details).
long_read.18S.otus.fasta.gz
Long read environmental sequence OTUs generated in this study and taxonomically annotated against the PR2_transitions database. Header indicates sequence ID, number of reads, sample, environment, and taxonomy. 18S sequences only.
long_read.28S.otus.fasta.gz
Long read environmental sequence OTUs generated in this study and taxonomically annotated against the PR2_transitions database. Header indicates sequence ID, number of reads, sample, environment, and taxonomy. 28S sequences only.