Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank.

Overview

abstract

UNLABELLED: null BACKGROUND: The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form "biobanks" where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on a large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypotheses generation. RESULTS: In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped for Type 2 Diabetes and Hypothyroidism to discover gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries. CONCLUSIONS: This study demonstrates how Semantic Web technologies can be applied in conjunction with clinical data stored in EHRs to accurately identify subjects with specific diseases and phenotypes, and identify genotype-phenotype associations.

authors

Pathak, Jyotishman
Kiefer, Richard C
Bielinski, Suzette J
Chute, Christopher G

publication date

December 17, 2012

published in

Journal of biomedical semantics Journal

Identity

PubMed Central ID

PMC3554594

Scopus Document Identifier

84889676637

Digital Object Identifier (DOI)

10.1186/2041-1480-3-10

PubMed ID

23244446

Additional Document Info

has global citation frequency

23

volume

3

issue

1

VIVO Weill Cornell Medical College

Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank. Academic Article

Overview

abstract

authors

publication date

published in

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume

issue