Outliers in diagnosis ratios: A clue toward possibly absent data. Academic Article uri icon

Overview

abstract

  • The evaluation of completeness of real-world data is a particularly challenging component of data quality assessment because the degree of truly versus erroneously absent data is unknown. Among inpatient data sets, while absolute counts of admissions having specific categories of diagnoses in the principal or any position may vary depending on hospital size, we hypothesized that the ratio of these parameters will be preserved across sites, with outliers suggesting the potential for erroneously absent data. For several categories of clinical conditions assigned to inpatient admissions, we analyzed the ratio of their recording as the principal diagnosis versus any diagnosis across several hospitals and compared the ratios against a national benchmark. Our analysis showed ratios that matched clinical expectations, with reasonable preservation of ratios across sites. However, some conditions exhibited more variability in the ratios and some sites had many outliers possibly reflecting data quality issues that warrant further attention.

publication date

  • January 11, 2024

Research

keywords

  • Hospitalization
  • Hospitals

Identity

PubMed Central ID

  • PMC10785923

PubMed ID

  • 38222346

Additional Document Info

volume

  • 2023