The accuracy of race & ethnicity data in US based healthcare databases: A systematic review. Academic Article uri icon

Overview

abstract

  • BACKGROUND: The availability and accuracy of data on a patient's race/ethnicity varies across databases. Discrepancies in data quality can negatively impact attempts to study health disparities. METHODS: We conducted a systematic review to organize information on the accuracy of race/ethnicity data stratified by database type and by specific race/ethnicity categories. RESULTS: The review included 43 studies. Disease registries showed consistently high levels of data completeness and accuracy. EHRs frequently showed incomplete and/or inaccurate data on the race/ethnicity of patients. Databases had high levels of accurate data for White and Black patients but relatively high levels of misclassification and incomplete data for Hispanic/Latinx patients. Asians, Pacific Islanders, and AI/ANs are the most misclassified. Systems-based interventions to increase self-reported data showed improvement in data quality. CONCLUSION: Data on race/ethnicity that is collected with the purpose of research and quality improvement appears most reliable. Data accuracy can vary by race/ethnicity status and better collection standards are needed.

publication date

  • May 18, 2023

Research

keywords

  • Data Management
  • Ethnicity
  • Hispanic or Latino
  • Racial Groups

Identity

Scopus Document Identifier

  • 85159871842

Digital Object Identifier (DOI)

  • 10.1016/j.amjsurg.2023.05.011

PubMed ID

  • 37230870