|Run||Spots||Bases||Size||GC content||Published||Access Type|
This run has 2 reads per spot:
|L=4, 100%||̅L=230, σ=43.3, 100%|
Technical read Application Read L=4, 100% Length is 4, 100% spots contain this read ̅L=165, σ=92.8, 66% Average length is 165, standard deviation is 92.8, 66% spots contain this read
|SAMN00000012 (SRS000012)||Catenibacterium mitsuokai (GenBank Accession Number for 16S rDNA gene: AB030221) is a member of the Firmicutes division of the domain bacteria. In one comprehensive 16S rDNA sequence-based enumeration of the colonic microbiota of three healthy adult humans, it represents, on average, 0.482% of all 16S rDNA sequences and 0.095% of the sequences in its division (Eckburg et. al. (2005)).||451640|
|PRJNA19923||SRP000012||Reference genome for the Human Microbiome Project|
Use SRA Toolkit tools to directly operate on SRA runs. Toolkit has capacity to find requested runs at NCBI and download (and cache) only the part you really need. For example quality scores represent a majority of data volume and you may not need them if you dump fasta only (versus fastq). Or if you are looking at particular gene you may not need the reads aligned to other regions or not aligned at all.
Use SRA Toolkit prefetch utility if you want to cache all data in advance (for example in case your processing cluster does not connect to internet). Read more at Downloading SRA data using command line utilities.