The impact of commercial health datasets on medical research and health-care algorithms. Review uri icon

Overview

abstract

  • As the health-care industry emerges into a new era of digital health driven by cloud data storage, distributed computing, and machine learning, health-care data have become a premium commodity with value for private and public entities. Current frameworks of health data collection and distribution, whether from industry, academia, or government institutions, are imperfect and do not allow researchers to leverage the full potential of downstream analytical efforts. In this Health Policy paper, we review the current landscape of commercial health data vendors, with special emphasis on the sources of their data, challenges associated with data reproducibility and generalisability, and ethical considerations for data vending. We argue for sustainable approaches to curating open-source health data to enable global populations to be included in the biomedical research community. However, to fully implement these approaches, key stakeholders should come together to make health-care datasets increasingly accessible, inclusive, and representative, while balancing the privacy and rights of individuals whose data are being collected.

publication date

  • May 1, 2023

Research

keywords

  • Algorithms
  • Biomedical Research
  • Datasets as Topic

Identity

Scopus Document Identifier

  • 85153291834

Digital Object Identifier (DOI)

  • 10.1016/S2589-7500(23)00025-0

PubMed ID

  • 37100543

Additional Document Info

volume

  • 5

issue

  • 5