Empirical insights into the stochasticity of small RNA sequencing. Academic Article uri icon

Overview

abstract

  • The choice of stochasticity distribution for modeling the noise distribution is a fundamental assumption for the analysis of sequencing data and consequently is critical for the accurate assessment of biological heterogeneity and differential expression. The stochasticity of RNA sequencing has been assumed to follow Poisson distributions. We collected microRNA sequencing data and observed that its stochasticity is better approximated by gamma distributions, likely because of the stochastic nature of exponential PCR amplification. We validated our findings with two independent datasets, one for microRNA sequencing and another for RNA sequencing. Motivated by the gamma distributed stochasticity, we provided a simple method for the analysis of RNA sequencing data and showed its superiority to three existing methods for differential expression analysis using three data examples of technical replicate data and biological replicate data.

publication date

  • April 7, 2016

Research

keywords

  • MicroRNAs
  • Sequence Analysis, RNA

Identity

PubMed Central ID

  • PMC4823707

Scopus Document Identifier

  • 84962798297

Digital Object Identifier (DOI)

  • 10.1038/srep24061

PubMed ID

  • 27052356

Additional Document Info

volume

  • 6