Analysis of online information searching for cardiovascular diseases on a consumer health information portal. Academic Article uri icon



  • Since the early 2000's, Internet usage for health information searching has increased significantly. Studying search queries can help us to understand users "information need" and how do they formulate search queries ("expression of information need"). Although cardiovascular diseases (CVD) affect a large percentage of the population, few studies have investigated how and what users search for CVD. We address this knowledge gap in the community by analyzing a large corpus of 10 million CVD related search queries from Using UMLS MetaMap and UMLS semantic types/concepts, we developed a rule-based approach to categorize the queries into 14 health categories. We analyzed structural properties, types (keyword-based/Wh-questions/Yes-No questions) and linguistic structure of the queries. Our results show that the most searched health categories are 'Diseases/Conditions', 'Vital-Sings', 'Symptoms' and 'Living-with'. CVD queries are longer and are predominantly keyword-based. This study extends our knowledge about online health information searching and provides useful insights for Web search engines and health websites.

publication date

  • November 14, 2014



  • Cardiovascular Diseases
  • Consumer Health Information
  • Information Storage and Retrieval
  • Internet


PubMed Central ID

  • PMC4419905

Scopus Document Identifier

  • 84964312765

PubMed ID

  • 25954380

Additional Document Info


  • 2014