EvidenceOutcomes: A Dataset of Clinical Trial Publications with Clinically Meaningful Outcomes. Academic Article uri icon

Overview

abstract

  • The fundamental process of evidence extraction in evidence-based medicine relies on identifying PICO elements, with Outcomes being the most complex and often overlooked. To address this, we introduce EvidenceOutcomes, a large annotated corpus of clinically meaningful outcomes. A robust annotation guideline was developed in collaboration with clinicians and NLP experts, and three annotators annotated the Results and Conclusions of 500 PubMed abstracts and 140 EBM-NLP abstracts, achieving an inter-rater agreement of 0.76. A fine-tuned PubMedBERT model achieved F1 scores of 0.69 (entity level) and 0.76 (token level). EvidenceOutcomes offers a benchmark for advancing machine learning algorithms in extracting clinically meaningful outcomes.

publication date

  • August 7, 2025

Research

keywords

  • Clinical Trials as Topic
  • Data Mining
  • Evidence-Based Medicine
  • Natural Language Processing
  • Outcome Assessment, Health Care

Identity

Digital Object Identifier (DOI)

  • 10.3233/SHTI250935

PubMed ID

  • 40775953

Additional Document Info

volume

  • 329