|PRJEB11474||ERP012868||Integrated 'omics', targeted metabolite and single-cell analyses of Arctic snow algae functionality and adaptability|
Snow algae are poly-extremophilic microalgae and important primary colonisers and producers on glaciers and snow fields. Depending on their pigmentation they cause green or red mass blooms during the melt season. This decreases surface albedo and thus further enhances snow and ice melting. Although the phenomenon of snow algal blooms has been known for a long time, large aspects of their physiology and ecology sill remain cryptic. This study provides the first in-depth and multi-omics investigation of two very striking adjacent green and red snow fields on a glacier in Svalbard. We have assessed the algal community composition of green and red snow including their associated microbiota, i.e., bacteria and archaea, their metabolic profiles (targeted and non-targeted metabolites) on the bulk and single-cell level, and assessed the feedbacks between the algae and their physico-chemical environment including liquid water content, pH, albedo and nutrient availability. We demonstrate that green and red snow clearly vary in their physico-chemical environment, their microbial community composition and their metabolic profiles. For the algae this likely reflects both different stages of their life cycles and their adaptation strategies. Green snow represents a wet, carbon and nutrient rich environment and is dominated by the algae Microglena sp. with a metabolic profile that is characterized by key metabolites involved in growth and proliferation. In contrast, the dry and nutrient poor red snow habitat is colonised by various Chloromonas species with a high abundance of storage and reserve metabolites likely to face upcoming severe conditions. Combining a multitude of techniques we demonstrate the power of such complementary approaches in elucidating the function and ecology of extremophiles such as green and red snow algal blooms, which play crucial roles in glacial ecosystems.
You need SRA Toolkit to operate on SRA runs.
Default toolkit configuration enables it to find and retrieve SRA runs by accession. It also downloads (and cache) only the part of data you really need. For example quality scores represent a majority of data volume and you may not need them if you dump fasta only (versus fastq). Or if you are looking at particular gene you may not need reads aligned to other regions or not aligned at all. Same way if you use GATK with enabled SRA support you need only SRA run accessions to fire your process.
fastq-dump will dump reads in a number of "standard" fastq and fasta formats.
vdb-dump is also capable of producing fasta and fastq (beside other formats). It dumps data much faster then fastq-dump but ordering of reads may be different and it does not produce split-read multi-file output.
Prefetch tool will help you cache all data in advance if you plan to run data analysis in environment where getting data from NCBI at run time is unfeasible.
Read more at SRA Knowledge Base on how to download SRA data using command line utilities.
In addition to it you can download the following data:
|Type||Size||Name||Free Egress||Access Type|
The sections below show results of analysis run by software which is still in experimental stage. Please use provided results with a boatload of salt and let us know what you think.
-- SRA team
- Unidentified reads: 100%