The accuracy of race & ethnicity data in US based healthcare databases: A systematic review.
Academic Article
Overview
abstract
BACKGROUND: The availability and accuracy of data on a patient's race/ethnicity varies across databases. Discrepancies in data quality can negatively impact attempts to study health disparities. METHODS: We conducted a systematic review to organize information on the accuracy of race/ethnicity data stratified by database type and by specific race/ethnicity categories. RESULTS: The review included 43 studies. Disease registries showed consistently high levels of data completeness and accuracy. EHRs frequently showed incomplete and/or inaccurate data on the race/ethnicity of patients. Databases had high levels of accurate data for White and Black patients but relatively high levels of misclassification and incomplete data for Hispanic/Latinx patients. Asians, Pacific Islanders, and AI/ANs are the most misclassified. Systems-based interventions to increase self-reported data showed improvement in data quality. CONCLUSION: Data on race/ethnicity that is collected with the purpose of research and quality improvement appears most reliable. Data accuracy can vary by race/ethnicity status and better collection standards are needed.