skip to main content

Transcript assembly and abundance estimation from RNA-Seq reveals thousands of new transcripts and switching among isoforms

Identifiers: SRA: SRP002119
BioProject: PRJNA124751
GEO: GSE20846
Study Type: 
Transcriptome Analysis
Abstract: We introduce an approach to transcript discovery coupled with a statistical model for RNA-Seq experiments that produces estimates of transcript abundances. Our algorithms are implemented in an open source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed more than 430 million paired 75bp RNA-Seq reads from a mouse myoblast cell line representing a differentiation timeseries. We detected 13,689 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Analysis of transcript expression over the timeseries revealed complete switches in the dominant transcription start site (TSS) or splice-isoform in 330 genes, along with more subtle shifts in a further 1,304 genes. These dynamics suggest substantial regulatory flexibility and complexity in this well-studied model of muscle development. Overall design: Timeseries of C2C12 myoblast RNA-Seq
Center Project: GSE20846
External Link: /pubmed:20436464

Related SRA data

Experiments:
4 ( 4 samples )
Runs:
10 (32.7Gbp; 17.6Gb)
Additional objects:
File type count
srf 10