This dataset supports the article "Plant expansion drives bacteria and collembola communities under winter climate change in frost-affected tundra" by E.J. Krab et al. (2019, Soil Biology and Biochemistry).
The Jupyter notebook processes raw sequence data (ENA accession number PRJEB26326) into files used downstream for figures production.
The R script generates figures found throughout the article.
Extract both archives into a given folder.
- Running the /R/SBB_NSC_script.R from the /R/ folder will reproduce the figures shown in the article.
-
Running the /Bioinformatics/SBB_NSC_script.ipynb will produce a /R_new/ folder from
which the R script can be run to produce the same figures from scratch.
The randomness introduced by the rarefaction step may induce small
changes and invert NMDS axes direction in these new bacteria figures.
- Bioinformatics.zip : Raw sequence data processing
Archive content:
SBB_NSC_script.ipynb - Jupyter notebook with bioinformatics pipeline
EDIT: Because of sequence headers being modified at ENA, the script works properly when downloading raw data with enaGroupGet using the parameter "-f submitted" but not when downloading manually or through ftp
Mapping_NSC.txt - QIIME mapping file
/scripts/cleaner.awk - custom awk script to clean file headers inherited from biom format
/scripts/Control_otu_removal.R - custom executable R script to remove the 3 OTUs abundant in control samples
/scripts/gold.fa - GOLD database (Haas et al. 2011 https://dx.doi.org/10.1101/gr.112730.110)
/scripts/relabel.awk - custom awk script to relabel reads according to barcode label
/scripts/remove_primers.awk - custom awk script to trim primers
/scripts/python_scipts/: Robert C. Edgar's suite of python scripts for UPARSE: http://drive5.com/python/
/scripts/rar/clean_tsv_from_biom_rarefied_tables.sh - custom bash script to remove biom-inherited headers from rarefied tables (batch)
/scripts/rar/convert_table.sh - custom bash script to convert biom tables into tsv (batch)
/scripts/rar/make_consensus_from_rarefactions.R - custom executable R script to average multiple rarefactions into a consensus table
Funding
Vetenskaprsrådet 621-2011-5444
Formas 214-2011-788
Knut och Alice Wallenberg's fundation Academy Fellowship 2012.0152