Prediction of Breast Cancer Treatment-Induced Fatigue by Machine Learning Using Genome-Wide Association Data. Academic Article uri icon

Overview

abstract

  • BACKGROUND: We aimed at predicting fatigue after breast cancer treatment using machine learning on clinical covariates and germline genome-wide data. METHODS: We accessed germline genome-wide data of 2799 early-stage breast cancer patients from the Cancer Toxicity study (NCT01993498). The primary endpoint was defined as scoring zero at diagnosis and higher than quartile 3 at 1 year after primary treatment completion on European Organization for Research and Treatment of Cancer quality-of-life questionnaires for Overall Fatigue and on the multidimensional questionnaire for Physical, Emotional, and Cognitive fatigue. First, we tested univariate associations of each endpoint with clinical variables and genome-wide variants. Then, using preselected clinical (false discovery rate < 0.05) and genomic (P < .001) variables, a multivariable preconditioned random-forest regression model was built and validated on a hold-out subset to predict fatigue. Gene set enrichment analysis identified key biological correlates (MetaCore). All statistical tests were 2-sided. RESULTS: Statistically significant clinical associations were found only with Emotional and Cognitive Fatigue, including receipt of chemotherapy, anxiety, and pain. Some single nucleotide polymorphisms had some degree of association (P < .001) with the different fatigue endpoints, although there were no genome-wide statistically significant (P < 5.00 × 10-8) associations. Only for Cognitive Fatigue, the predictive ability of the genomic multivariable model was statistically significantly better than random (area under the curve = 0.59, P = .01) and marginally improved with clinical variables (area under the curve = 0.60, P = .005). Single nucleotide polymorphisms found to be associated (P < .001) with Cognitive Fatigue belonged to genes linked to inflammation (false discovery rate adjusted P = .03), cognitive disorders (P = 1.51 × 10-12), and synaptic transmission (P = 6.28 × 10-8). CONCLUSIONS: Genomic analyses in this large cohort of breast cancer survivors suggest a possible genetic role for severe Cognitive Fatigue that warrants further exploration.

authors

  • Lee, Sangkyu
  • Deasy, Joseph
  • Oh, Jung Hun
  • Di Meglio, Antonio
  • Dumas, Agnes
  • Menvielle, Gwenn
  • Charles, Cecile
  • Boyault, Sandrine
  • Rousseau, Marina
  • Besse, Celine
  • Thomas, Emilie
  • Boland, Anne
  • Cottu, Paul
  • Tredan, Olivier
  • Levy, Christelle
  • Martin, Anne-Laure
  • Everhard, Sibille
  • Ganz, Patricia A
  • Partridge, Ann H
  • Michiels, Stefan
  • Deleuze, Jean-François
  • Andre, Fabrice
  • Vaz-Luis, Ines

publication date

  • May 11, 2020

Identity

PubMed Central ID

  • PMC7583150

Scopus Document Identifier

  • 85101394371

Digital Object Identifier (DOI)

  • 10.1093/jncics/pkaa039

PubMed ID

  • 33490863

Additional Document Info

volume

  • 4

issue

  • 5