Robust identification of transcriptional regulatory networks using a Gibbs sampler on outlier sum statistic. Academic Article uri icon

Overview

abstract

  • MOTIVATION: Identification of transcriptional regulatory networks (TRNs) is of significant importance in computational biology for cancer research, providing a critical building block to unravel disease pathways. However, existing methods for TRN identification suffer from the inclusion of excessive 'noise' in microarray data and false-positives in binding data, especially when applied to human tumor-derived cell line studies. More robust methods that can counteract the imperfection of data sources are therefore needed for reliable identification of TRNs in this context. RESULTS: In this article, we propose to establish a link between the quality of one target gene to represent its regulator and the uncertainty of its expression to represent other target genes. Specifically, an outlier sum statistic was used to measure the aggregated evidence for regulation events between target genes and their corresponding transcription factors. A Gibbs sampling method was then developed to estimate the marginal distribution of the outlier sum statistic, hence, to uncover underlying regulatory relationships. To evaluate the effectiveness of our proposed method, we compared its performance with that of an existing sampling-based method using both simulation data and yeast cell cycle data. The experimental results show that our method consistently outperforms the competing method in different settings of signal-to-noise ratio and network topology, indicating its robustness for biological applications. Finally, we applied our method to breast cancer cell line data and demonstrated its ability to extract biologically meaningful regulatory modules related to estrogen signaling and action in breast cancer. AVAILABILITY AND IMPLEMENTATION: The Gibbs sampler MATLAB package is freely available at http://www.cbil.ece.vt.edu/software.htm. CONTACT: xuan@vt.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

publication date

  • May 17, 2012

Research

keywords

  • Breast Neoplasms
  • Computational Biology
  • Gene Regulatory Networks
  • Software
  • Transcription Factors

Identity

PubMed Central ID

  • PMC3400952

Scopus Document Identifier

  • 84865148118

Digital Object Identifier (DOI)

  • 10.1093/bioinformatics/bts296

PubMed ID

  • 22595208

Additional Document Info

volume

  • 28

issue

  • 15