Comprehensive transcriptome analysis using synthetic long-read sequencing reveals molecular co-association of distant splicing events. Academic Article uri icon

Overview

abstract

  • Alternative splicing shapes mammalian transcriptomes, with many RNA molecules undergoing multiple distant alternative splicing events. Comprehensive transcriptome analysis, including analysis of exon co-association in the same molecule, requires deep, long-read sequencing. Here we introduce an RNA sequencing method, synthetic long-read RNA sequencing (SLR-RNA-seq), in which small pools (≤1,000 molecules/pool, ≤1 molecule/gene for most genes) of full-length cDNAs are amplified, fragmented and short-read-sequenced. We demonstrate that these RNA sequences reconstructed from the short reads from each of the pools are mostly close to full length and contain few insertion and deletion errors. We report many previously undescribed isoforms (human brain: ∼13,800 affected genes, 14.5% of molecules; mouse brain ∼8,600 genes, 18% of molecules) and up to 165 human distant molecularly associated exon pairs (dMAPs) and distant molecularly and mutually exclusive pairs (dMEPs). Of 16 associated pairs detected in the mouse brain, 9 are conserved in human. Our results indicate conserved mechanisms that can produce distant but phased features on transcript and proteome isoforms.

authors

  • Tilgner, Hagen
  • Jahanbani, Fereshteh
  • Blauwkamp, Tim
  • Moshrefi, Ali
  • Jaeger, Erich
  • Chen, Feng
  • Harel, Itamar
  • Bustamante, Carlos D
  • Rasmussen, Morten
  • Snyder, Michael P

publication date

  • May 18, 2015

Research

keywords

  • Alternative Splicing
  • Gene Expression Profiling
  • Sequence Analysis, RNA

Identity

PubMed Central ID

  • PMC4832928

Scopus Document Identifier

  • 84929697731

Digital Object Identifier (DOI)

  • 10.1038/nbt.3242

PubMed ID

  • 25985263

Additional Document Info

volume

  • 33

issue

  • 7