Empirical Bayes conditional independence graphs for regulatory network recovery. Academic Article uri icon

Overview

abstract

  • MOTIVATION: Computational inference methods that make use of graphical models to extract regulatory networks from gene expression data can have difficulty reconstructing dense regions of a network, a consequence of both computational complexity and unreliable parameter estimation when sample size is small. As a result, identification of hub genes is of special difficulty for these methods. METHODS: We present a new algorithm, Empirical Light Mutual Min (ELMM), for large network reconstruction that has properties well suited for recovery of graphs with high-degree nodes. ELMM reconstructs the undirected graph of a regulatory network using empirical Bayes conditional independence testing with a heuristic relaxation of independence constraints in dense areas of the graph. This relaxation allows only one gene of a pair with a putative relation to be aware of the network connection, an approach that is aimed at easing multiple testing problems associated with recovering densely connected structures. RESULTS: Using in silico data, we show that ELMM has better performance than commonly used network inference algorithms including GeneNet, ARACNE, FOCI, GENIE3 and GLASSO. We also apply ELMM to reconstruct a network among 5492 genes expressed in human lung airway epithelium of healthy non-smokers, healthy smokers and individuals with chronic obstructive pulmonary disease assayed using microarrays. The analysis identifies dense sub-networks that are consistent with known regulatory relationships in the lung airway and also suggests novel hub regulatory relationships among a number of genes that play roles in oxidative stress and secretion. AVAILABILITY AND IMPLEMENTATION: Software for running ELMM is made available at http://mezeylab.cb.bscb.cornell.edu/Software.aspx. CONTACT: ramimahdi@yahoo.com or jgm45@cornell.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

publication date

  • June 8, 2012

Research

keywords

  • Algorithms
  • Bayes Theorem
  • Gene Expression Profiling
  • Gene Regulatory Networks

Identity

PubMed Central ID

  • PMC3400959

Scopus Document Identifier

  • 84865153246

Digital Object Identifier (DOI)

  • 10.1093/bioinformatics/bts312

PubMed ID

  • 22685074

Additional Document Info

volume

  • 28

issue

  • 15