Polygenic risk and rare variants in endotypes of idiopathic pulmonary fibrosis: a genetic analysis of population-based and case-control cohorts.
Academic Article
Overview
abstract
BACKGROUND: Idiopathic pulmonary fibrosis (IPF) and telomere length are both strongly linked to rare and common genetic variants. Shortened telomere length might itself be causal for IPF. We aimed to evaluate whether rare and common variants compete or cooperate to confer genetic risk of IPF uniformly. METHODS: In this genetic analysis, we used whole-genome sequencing (WGS) data from a discovery case-control cohort sequenced at Columbia University and validated findings using WGS data from Trans-Omics for Precision Medicine (TOPMed) and UK Biobank. In all cohorts, we identified rare damaging variants in disease-associated genes and computed control-normalised non-overlapping polygenic risk scores (PRS) for IPF and telomere length. We assessed the MUC5B rs35705950 single-nucleotide polymorphism (SNP), an IPF common risk variant with a large effect, independently from the polygenic scores. Telomere length in blood leukocytes was measured using a quantitative PCR assay for the discovery cohort and UK Biobank validation cohort. We conducted logistic regression (adjusting for age, sex, and principal components of ancestry) to evaluate the association between IPF risk and the MUC5B SNP, the IPF PRS excluding MUC5B (IPF-PRS-noMUC5B), and the PRS for telomere length in the overall cohort and analysed their effects in patient subgroups for IPF endotypes (carriers and non-carriers of rare variants stratified by telomere length cutoffs). To assess disease prediction, we calculated cross-validated area under the receiver operating receiver operating curve (AUC). We also compared the liability of IPF explained by genetic variables. FINDINGS: The discovery cohort was recruited between April 23, 2003 and June 19, 2019 and included 777 patients with IPF and 2905 controls. We replicated the analyses in the TOPMed (1148 patients with IPF and 5202 controls) and UK Biobank (2739 patients with IPF and 395 331 controls) cohorts. 23-43% of patients with IPF had damaging rare variants or telomeres shorter than the tenth percentile. Analysis of the association of genetic variables with IPF diagnosis yielded odds ratios of 1·63 (95% CI 1·47-1·81) for telomere length PRS and 1·60 (1·44-1·77) for IPF-PRS-noMUC5B in the discovery cohort, with similar effect sizes for the two variables in the replication cohorts (1·47, 1·36-1·59 vs 1·37, 1·25-1·50 in TOPMed; 1·24, 1·19-1·29 vs 1·25, 1·21-1·30 in UK Biobank). The telomere length PRS had the greatest effect on disease risk in patients with IPF not harbouring rare variants and with telomere length shorter than the tenth percentile in the discovery cohort (2·02, 1·76-2·33) and UK Biobank replication cohort (1·70, 1·56-1·85). Accounting for clinical variables and all genetic variables (rare variants, MUC5B SNP, IPF PRS, and telomere length PRS) led to the best disease prediction in the discovery cohort (combined AUC 0·89), TOPMed cohort (0·89), and UK Biobank cohort (0·77). Rare and common variants contributed jointly to the genetic liability of IPF. The telomere length PRS accounted for 13% of the explained genetic liability of IPF in the discovery cohort and 8% and 13% in the TOPMed and UK Biobank cohorts, respectively. INTERPRETATION: Common and rare genetic variation confer context-specific genetic risk in patients with IPF both competitively and cooperatively. In contrast to known IPF common risk variants, the telomere length PRS, which includes more than 180 genetic loci not previously associated with IPF, is associated with increased risk of disease in patients with specific IPF endotypes. Polygenic risk from telomere-associated common variants is a key feature of genetic heterogeneity in IPF. FUNDING: US National Institutes of Health, UK Medical Research Council, and UK National Institute for Health and Care Research.