Leveraging Latent Dirichlet Allocation in processing free-text personal goals among patients undergoing bladder cancer surgery. Academic Article uri icon

Overview

abstract

  • PURPOSE: As we begin to leverage Big Data in health care settings and particularly in assessing patient-reported outcomes, there is a need for novel analytics to address unique challenges. One such challenge is in coding transcribed interview data, typically free-text entries of statements made during a face-to-face interview. Latent Dirichlet Allocation (LDA) offers statistical rigor and consistency in automating the interpretation of patients' expressed concerns and coping strategies. METHODS: LDA was applied to interview data collected as part of a prospective, longitudinal study of QOL in Nā€‰=ā€‰211 patients undergoing radical cystectomy and urinary diversion for bladder cancer. LDA analyzed personal goal statements to extract the latent topics and themes, stratified by time, and on things patients wanted to accomplish and prevent. Model comparison metrics determined the number of topics to extract. RESULTS: LDA extracted seven latent topics. Prior to surgery, patients' priorities were primarily in cancer surgery and recovery. Six months after the surgery, they were replaced by goals on regaining a sense of normalcy, to resume work, to enjoy life more fully, and to appreciate friends and family more. LDA model parameters showed changing priorities, e.g., immediate concerns on surgery and resuming employment decreased post-surgery and were replaced by concerns over cancer recurrence and a desire to remain healthy and strong. CONCLUSIONS: Novel Big Data analytics such as LDA offer the possibility of summarizing personal goals without the need for conventional fixed-length measures and resource-intensive qualitative data coding.

publication date

  • February 23, 2019

Research

keywords

  • Algorithms
  • Big Data
  • Cystectomy
  • Quality of Life
  • Urinary Bladder Neoplasms
  • Urinary Diversion

Identity

PubMed Central ID

  • PMC6525036

Scopus Document Identifier

  • 85062023040

Digital Object Identifier (DOI)

  • 10.1007/s11136-019-02132-w

PubMed ID

  • 30798421

Additional Document Info

volume

  • 28

issue

  • 6