A site oriented method for segmenting web pages

  • Authors:
  • David Fernandes;Edleno Silva de Moura;Altigran Soares da Silva;Berthier Ribeiro-Neto;Edisson Braga

  • Affiliations:
  • Fed. Univ. of Amazonas, Manaus, AM, Brazil;Fed. Univ. of Amazonas, Manaus, AM, Brazil;Fed. Univ. of Amazonas, Manaus, AM, Brazil;Fed. Univ. of Minas Gerais, Belo Horizonte, MG, Brazil;Fed. Univ. of Amazonas, Manaus, AM, Brazil

  • Venue:
  • Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information about how to segment a Web page can be used nowadays by applications such as segment aware Web search, classification and link analysis. In this research, we propose a fully automatic method for page segmentation and evaluate its application through experiments with four separate Web sites. While the method may be used in other applications, our main focus in this article is to use it as input to segment aware Web search systems. Our results indicate that the proposed method produces better segmentation results when compared to the best segmentation method we found in literature. Further, when applied as input to a segment aware Web search method, it produces results close to those produced when using a manual page segmentation method.