Bayesian compositional regression with structured priors for microbiome feature selection. Academic Article uri icon

Overview

abstract

  • The microbiome plays a critical role in human health and disease, and there is a strong scientific interest in linking specific features of the microbiome to clinical outcomes. There are key aspects of microbiome data, however, that limit the applicability of standard variable selection methods. In particular, the observed data are compositional, as the counts within each sample have a fixed-sum constraint. In addition, microbiome features, typically quantified as operational taxonomic units, often reflect microorganisms that are similar in function, and may therefore have a similar influence on the response variable. To address the challenges posed by these aspects of the data structure, we propose a variable selection technique with the following novel features: a generalized transformation and z-prior to handle the compositional constraint, and an Ising prior that encourages the joint selection of microbiome features that are closely related in terms of their genetic sequence similarity. We demonstrate that our proposed method outperforms existing penalized approaches for microbiome variable selection in both simulation and the analysis of real data exploring the relationship of the gut microbiome to body mass index.

publication date

  • July 31, 2020

Research

keywords

  • Gastrointestinal Microbiome
  • Microbiota

Identity

PubMed Central ID

  • PMC8216648

Scopus Document Identifier

  • 85088788078

Digital Object Identifier (DOI)

  • 10.1111/biom.13335

PubMed ID

  • 32686846

Additional Document Info

volume

  • 77

issue

  • 3