|Run||Spots||Bases||Size||GC content||Published||Access Type|
This run has 2 reads per spot:
|L=101, 100%||L=101, 100%|
Technical read Application Read L=4, 100% Length is 4, 100% spots contain this read ̅L=165, σ=92.8, 66% Average length is 165, standard deviation is 92.8, 66% spots contain this read
|PRJNA306683||SRP067661||Distinct routes of lineage development reshape the human blood hierarchy across ontogeny|
In a classical view of hematopoiesis, the various blood cell lineages arise via a hierarchical scheme starting with multipotent stem cells that become increasingly restricted in their differentiation potential through oligopotent and then unipotent progenitors. We developed a cell-sorting scheme to resolve myeloid (My), erythroid (Er), and megakaryocytic (Mk) fates from single CD34+ cells and then mapped the progenitor hierarchy across human development. Fetal liver contained large numbers of distinct oligopotent progenitors with intermingled My, Er and Mk fates. However, few oligopotent progenitor intermediates were present in the adult bone marrow. Instead only two progenitor classes predominate, multipotent and unipotent, with Er-Mk lineages emerging from multipotent cells. The developmental shift to an adult ''two-tier'' hierarchy challenges current dogma and provides a revised framework to understand normal and disease states of human hematopoiesis. Overall design: Using SMARTseq, the expression profile of the new subsets identified in the linked study (12 samples in duplicates) were analyzed using RNA sequencing. The indicated subsets are all derived from neonatal cord blood.
You need SRA Toolkit to operate on SRA runs.
Default toolkit configuration enables it to find and retrieve SRA runs by accession. It also downloads (and cache) only the part of data you really need. For example quality scores represent a majority of data volume and you may not need them if you dump fasta only (versus fastq). Or if you are looking at particular gene you may not need reads aligned to other regions or not aligned at all. Same way if you use GATK with enabled SRA support you need only SRA run accessions to fire your process.
fastq-dump will dump reads in a number of "standard" fastq and fasta formats.
vdb-dump is also capable of producing fasta and fastq (beside other formats). It dumps data much faster then fastq-dump but ordering of reads may be different and it does not produce split-read multi-file output.
Prefetch tool will help you cache all data in advance if you plan to run data analysis in environment where getting data from NCBI at run time is unfeasible.
Read more at SRA Knowledge Base on how to download SRA data using command line utilities.