Content Collection for the Labelling of Health-Related Web Content

Authors:
K. Stamatakis;V. Metsis;V. Karkaletsis;M. Ruzicka;V. Svátek;E. Amigó;M. Pöllä;C. Spyropoulos
Affiliations:
National Centre for Scientific Research "Demokritos",;National Centre for Scientific Research "Demokritos",;National Centre for Scientific Research "Demokritos",;University of Economics, Prague,;University of Economics, Prague,;Universidad Nacional de Educacion a Distancia,;Teknillinen Korkeakoulu --- Helsinki University of Technology,;National Centre for Scientific Research "Demokritos",
Venue:
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Year:
2007

Citing 4
Cited 0

Focused crawling: a new approach to topic-specific Web resource discovery

WWW '99 Proceedings of the eighth international conference on World Wide Web
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web

Machine Learning - Special issue on information retrieval
Intelligent crawling on the World Wide Web with arbitrary predicates

Proceedings of the 10th international conference on World Wide Web
Ontology-focused crawling of Web documents

Proceedings of the 2003 ACM symposium on Applied computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

As the number of health-related web sites in various languages increases, so does the need for control mechanisms that give the users adequate guarantee on whether the web resources they are visiting meet a minimum level of quality standards. Based upon state-of-the-art technology in the areas of semantic web, content analysis and quality labelling, the MedIEQ project, integrates existing technologies and tests them in a novel application: the automation of the labelling process in health-related web content. MedIEQ provides tools that crawl the web to locate unlabelled health web resources, to label them according to pre-defined labelling criteria, as well as to monitor them. This paper focuses on content collection and discusses our experiments in the English language.