Content Collection for the Labelling of Health-Related Web Content

  • Authors:
  • K. Stamatakis;V. Metsis;V. Karkaletsis;M. Ruzicka;V. Svátek;E. Amigó;M. Pöllä;C. Spyropoulos

  • Affiliations:
  • National Centre for Scientific Research "Demokritos",;National Centre for Scientific Research "Demokritos",;National Centre for Scientific Research "Demokritos",;University of Economics, Prague,;University of Economics, Prague,;Universidad Nacional de Educacion a Distancia,;Teknillinen Korkeakoulu --- Helsinki University of Technology,;National Centre for Scientific Research "Demokritos",

  • Venue:
  • AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the number of health-related web sites in various languages increases, so does the need for control mechanisms that give the users adequate guarantee on whether the web resources they are visiting meet a minimum level of quality standards. Based upon state-of-the-art technology in the areas of semantic web, content analysis and quality labelling, the MedIEQ project, integrates existing technologies and tests them in a novel application: the automation of the labelling process in health-related web content. MedIEQ provides tools that crawl the web to locate unlabelled health web resources, to label them according to pre-defined labelling criteria, as well as to monitor them. This paper focuses on content collection and discusses our experiments in the English language.