Subphenotyping depression using machine learning and electronic health records.

Overview

abstract

OBJECTIVE: To identify depression subphenotypes from Electronic Health Records (EHRs) using machine learning methods, and analyze their characteristics with respect to patient demographics, comorbidities, and medications. MATERIALS AND METHODS: Using EHRs from the INSIGHT Clinical Research Network (CRN) database, multiple machine learning (ML) algorithms were applied to analyze 11 275 patients with depression to discern depression subphenotypes with distinct characteristics. RESULTS: Using the computational approaches, we derived three depression subphenotypes: Phenotype_A (n = 2791; 31.35%) included patients who were the oldest (mean (SD) age, 72.55 (14.93) years), had the most comorbidities, and took the most medications. The most common comorbidities in this cluster of patients were hyperlipidemia, hypertension, and diabetes. Phenotype_B (mean (SD) age, 68.44 (19.09) years) was the largest cluster (n = 4687; 52.65%), and included patients suffering from moderate loss of body function. Asthma, fibromyalgia, and Chronic Pain and Fatigue (CPF) were common comorbidities in this subphenotype. Phenotype_C (n = 1452; 16.31%) included patients who were younger (mean (SD) age, 63.47 (18.81) years), had the fewest comorbidities, and took fewer medications. Anxiety and tobacco use were common comorbidities in this subphenotype. CONCLUSION: Computationally deriving depression subtypes can provide meaningful insights and improve understanding of depression as a heterogeneous disorder. Further investigation is needed to assess the utility of these derived phenotypes to inform clinical trial design and interpretation in routine patient care.

authors

Xu, Zhenxing
Wang, Fei
Adekkanattu, Prakash
Bose, Budhaditya
Vekaria, Veer
Brandt, Pascal
Jiang, Guoqian
Kiefer, Richard C
Luo, Yuan
Pacheco, Jennifer A
Rasmussen, Luke V
Xu, Jie
Alexopoulos, George S
Pathak, Jyotishman

publication date

August 3, 2020

published in

Learning health systems Journal

Identity

PubMed Central ID

PMC7556423

Scopus Document Identifier

85088824128

Digital Object Identifier (DOI)

10.1002/lrh2.10241

PubMed ID

33083540

Additional Document Info

has global citation frequency

20

volume

4

issue

4

VIVO Weill Cornell Medical College

Subphenotyping depression using machine learning and electronic health records. Academic Article

Overview

abstract

authors

publication date

published in

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume

issue