Systematic identification of statistically significant network measures. Academic Article uri icon

Overview

abstract

  • We present a graph embedding space (i.e., a set of measures on graphs) for performing statistical analyses of networks. Key improvements over existing approaches include discovery of "motif hubs" (multiple overlapping significant subgraphs), computational efficiency relative to subgraph census, and flexibility (the method is easily generalizable to weighted and signed graphs). The embedding space is based on scalars, functionals of the adjacency matrix representing the network. Scalars are global, involving all nodes; although they can be related to subgraph enumeration, there is not a one-to-one mapping between scalars and subgraphs. Improvements in network randomization and significance testing--we learn the distribution rather than assuming Gaussianity--are also presented. The resulting algorithm establishes a systematic approach to the identification of the most significant scalars and suggests machine-learning techniques for network classification.

publication date

  • January 10, 2005

Research

keywords

  • Computational Biology
  • Neural Networks, Computer

Identity

Scopus Document Identifier

  • 41349106724

PubMed ID

  • 15697661

Additional Document Info

volume

  • 71

issue

  • 1 Pt 2