Mining mutation contexts across the cancer genome to map tumor site of origin. Academic Article uri icon

Overview

abstract

  • The vast preponderance of somatic mutations in a typical cancer are either extremely rare or have never been previously recorded in available databases that track somatic mutations. These constitute a hidden genome that contrasts the relatively small number of mutations that occur frequently, the properties of which have been studied in depth. Here we demonstrate that this hidden genome contains much more accurate information than common mutations for the purpose of identifying the site of origin of primary cancers in settings where this is unknown. We accomplish this using a projection-based statistical method that achieves a highly effective signal condensation, by leveraging DNA sequence and epigenetic contexts using a set of meta-features that embody the mutation contexts of rare variants throughout the genome.

publication date

  • May 24, 2021

Research

keywords

  • Mutation
  • Neoplasms

Identity

PubMed Central ID

  • PMC8144407

Scopus Document Identifier

  • 85106559341

Digital Object Identifier (DOI)

  • 10.1038/s41467-021-23094-z

PubMed ID

  • 34031376

Additional Document Info

volume

  • 12

issue

  • 1