Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation.

Overview

abstract

Mental health concerns, such as suicidal thoughts, are frequently documented by providers in clinical notes, as opposed to structured coded data. In this study, we evaluated weakly supervised methods for detecting "current" suicidal ideation from unstructured clinical notes in electronic health record (EHR) systems. Weakly supervised machine learning methods leverage imperfect labels for training, alleviating the burden of creating a large manually annotated dataset. After identifying a cohort of 600 patients at risk for suicidal ideation, we used a rule-based natural language processing approach (NLP) approach to label the training and validation notes (n = 17,978). Using this large corpus of clinical notes, we trained several statistical machine learning models-logistic classifier, support vector machines (SVM), Naive Bayes classifier-and one deep learning model, namely a text classification convolutional neural network (CNN), to be evaluated on a manually-reviewed test set (n = 837). The CNN model outperformed all other methods, achieving an overall accuracy of 94% and a F1-score of 0.82 on documents with "current" suicidal ideation. This algorithm correctly identified an additional 42 encounters and 9 patients indicative of suicidal ideation but missing a structured diagnosis code. When applied to a random subset of 5,000 clinical notes, the algorithm classified 0.46% (n = 23) for "current" suicidal ideation, of which 87% were truly indicative via manual review. Implementation of this approach for large-scale document screening may play an important role in point-of-care clinical information systems for targeted suicide prevention interventions and improve research on the pathways from ideation to attempt.

authors

Cusick, Marika

Adekkanattu, Prakash

Campion, Thomas
Sholle, Evan
Myers, Annie
Banerjee, Samprit
Alexopoulos, George S
Wang, Yanshan
Pathak, Jyotishman

publication date

February 2, 2021

published in

Journal of psychiatric research Journal

Research

keywords

Deep Learning
Suicidal Ideation

Identity

PubMed Central ID

PMC8009838

Scopus Document Identifier

85100637149

Digital Object Identifier (DOI)

10.1016/j.jpsychires.2021.01.052

PubMed ID

33581461

Additional Document Info

has global citation frequency

58

volume

136

VIVO Weill Cornell Medical College

Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation. Academic Article

Overview

abstract

authors

publication date

published in

Research

keywords

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume