Discovery of environmental nodes in the web

  • Authors:
  • Anastasia Moumtzidou;Stefanos Vrochidis;Sara Tonelli;Ioannis Kompatsiaris;Emanuele Pianta

  • Affiliations:
  • Informatics and Telematics Institute, Thessaloniki, Greece;Informatics and Telematics Institute, Thessaloniki, Greece;FBK, Trento, Italy;Informatics and Telematics Institute, Thessaloniki, Greece;FBK, Trento, Italy

  • Venue:
  • IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analysis and processing of environmental information is considered of utmost importance for humanity. This article addresses the problem of discovery of web resources that provide environmental measurements. Towards the solution of this domain-specific search problem, we combine state-of-the-art search techniques together with advanced textual processing and supervised machine learning. Specifically, we generate domain-specific queries using empirical information and machine learning driven query expansion in order to enhance the initial queries with domain-specific terms. Multiple variations of these queries are submitted to a general-purpose web search engine in order to achieve a high recall performance and we employ a post processing module based on supervised machine learning to improve the precision of the final results. In this work, we focus on the discovery of weather forecast websites and we evaluate our technique by discovering weather nodes for south Finland.