figshare
Browse
1/1
2 files

Plant expansion drives bacteria and collembola communities under winter climate change in frost-affected tundra

Download all (3.2 MB)
Version 2 2020-07-02, 08:17
Version 1 2019-08-27, 11:54
dataset
posted on 2020-07-02, 08:17 authored by Eveline J. Krab, Sylvain MonteuxSylvain Monteux, James WeedonJames Weedon, Ellen Dorrepaal
This dataset supports the article "Plant expansion drives bacteria and collembola communities under winter climate change in frost-affected tundra" by E.J. Krab et al. (2019, Soil Biology and Biochemistry).
The Jupyter notebook processes raw sequence data (ENA accession number PRJEB26326) into files used downstream for figures production.
The R script generates figures found throughout the article.

Extract both archives into a given folder.
- Running the /R/SBB_NSC_script.R from the /R/ folder will reproduce the figures shown in the article.
- Running the /Bioinformatics/SBB_NSC_script.ipynb will produce a /R_new/ folder from which the R script can be run to produce the same figures from scratch. The randomness introduced by the rarefaction step may induce small changes and invert NMDS axes direction in these new bacteria figures.


Folder structure and files description:
- R.zip : Figure-generation script and data
Archive content:
SBB_NSC_script.R - Figure generation script
/Data/global_wunifrac2000_clean.txt - bacteria weighted UniFrac distance matrix
/Data/global2000_alphadiv.txt - bacteria alpha diversity metrics
/Data/global2000_ggtax_L2.txt - bacteria taxa summary
/Data/Mapping_NSC.csv - metadata for bacterial samples
/Data/NSC_count_and_met.csv - collembola abundances and metadata
/Data/NSC_richness.csv - collembola alpha diversity
/Data/snow_depth_20130326.csv - snow depth measurements
/Data/snow_depth_20140403.csv - snow depth measurements
/Data/snow_depth_20140429.csv - snow depth measurements
/Data/wunifrac_deep.txt - bacteria weighted UniFrac distance matrix (deep samples only)
/Data/wunifrac_surf.txt - bacteria weighted UniFrac distance matrix (surface samples only)

- Bioinformatics.zip : Raw sequence data processing
Archive content:
SBB_NSC_script.ipynb - Jupyter notebook with bioinformatics pipeline
EDIT: Because of sequence headers being modified at ENA, the script works properly when downloading raw data with enaGroupGet using the parameter "-f submitted" but not when downloading manually or through ftp
Mapping_NSC.txt - QIIME mapping file
/scripts/cleaner.awk - custom awk script to clean file headers inherited from biom format
/scripts/Control_otu_removal.R - custom executable R script to remove the 3 OTUs abundant in control samples
/scripts/gold.fa - GOLD database (Haas et al. 2011 https://dx.doi.org/10.1101/gr.112730.110)
/scripts/relabel.awk - custom awk script to relabel reads according to barcode label
/scripts/remove_primers.awk - custom awk script to trim primers
/scripts/python_scipts/: Robert C. Edgar's suite of python scripts for UPARSE: http://drive5.com/python/
/scripts/rar/clean_tsv_from_biom_rarefied_tables.sh - custom bash script to remove biom-inherited headers from rarefied tables (batch)
/scripts/rar/convert_table.sh - custom bash script to convert biom tables into tsv (batch)
/scripts/rar/make_consensus_from_rarefactions.R - custom executable R script to average multiple rarefactions into a consensus table

Funding

Vetenskaprsrådet 621-2011-5444

Formas 214-2011-788

Knut och Alice Wallenberg's fundation Academy Fellowship 2012.0152

History