Illumina sequencing of Human Microbiome Project Metagenomes Production Phase HMPZ-764224817-700024437 paired end RANDOM library(SRR061183)
|Run||Spots||Bases||Size||GC content||Published||Access Type|
|28.4M||5.7Gbp||3.3G||45.3%||2010-07-29||filtered data is public|
This run has 2 reads per spot:
|L=100, 100%||L=100, 100%|
Technical read Application Read L=4, 100% Length is 4, 100% spots contain this read ̅L=165, σ=92.8, 66% Average length is 165, standard deviation is 92.8, 66% spots contain this read
|PRJNA48479||SRP002163||Human Microbiome Project (HMP) Metagenomic WGS Projects, deeper sequencing of the human microbiome samples: Production Phase|
The HMP metagenomes production phase represents the shotgun sequencing of metagenomic DNA extracted from samples taken from multiple body sites across hundreds of human subjects. Coupled with the other data generated during the HMP project, these results will provide insights into the genes and pathways present in the human microbiome. We are learning more and more about the ways that human health is influenced by the complex and dynamic communities of microbes (the human microbiota) present on and within our bodies. Disruptions in these communities may trigger or influence the course of various disease states. Preliminary studies have shown this to be the case for certain diseases. The Human Microbiome Project expands on these studies to better understand, prevent and treat many human diseases.
- Unidentified reads: 49.43%
- Identified reads: 50.57%
|Bacteria||Alistipes putredinis DSM 17216||4.0||224,283|
|Bacteria||Prevotella copri DSM 18205||2.5||143,518|
|Bacteria||Bacteroides coprocola DSM 17136||1.7||95,293|
|Bacteria||Alistipes shahii WAL 8301||1.6||91,009|
|Bacteria||Parabacteroides merdae CL03T12C32||0.3||16,251||3.7|
|Bacteria||[Eubacterium] eligens ATCC 27750||0.3||14,261||5.0|
|Bacteria||Collinsella aerofaciens ATCC 25986||0.2||13,588||6.2|
|Bacteria||[Eubacterium] rectale ATCC 33656||0.2||11,303||3.5|
- How to read results?
Results show distribution of reads mapping to specific taxonomy nodes as a percentage of total reads within the analyzed run. In cases where a read maps to more than one related taxonomy node, the read is reported as originating from the lowest shared taxonomic node. So when a read maps to two species belonging to the same genus, it is assigned at the genus level. Sequence reads from a single organism will map to several taxonomy nodes spanning the organism’s lineage. The number of reads mapping to higher level nodes will typically be greater than those that map to terminal nodes.
STAT results are proportional to the size of sequenced genomes. Given a mixed sample containing several organisms at equal copy number, proportionally more reads originate from the larger genomes. This means that the percentages reported by STAT will reflect genome size and must be considered against the genomic complexity of the sequenced sample.
- How can I query this analysis data?
Please take a look at NCBI SRA Cloud-based Taxonomy Analysis Table