An indexing model of HTML documents

  • Authors:
  • Andrea Molinari;Gabriella Pasi;R. A. Marques Pereira

  • Affiliations:
  • University of Trento - Via Inama 5, 38100 Trento Italy;National Council of Research (ITIM-CNR), Via Ampère, 56, Milano, Italy;University of Trento, Via Inama 5, 38100 Trento Italy

  • Venue:
  • Proceedings of the 2003 ACM symposium on Applied computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The diffusion of the World Wide Web and the consequent increase in the production and exchange of textual information demand the development of effective information retrieval systems. The HyperText Markup Language (HTML) is broadly employed for defining the "typographical" appearance of documents over the Internet and Intranets. In this paper an indexing model of HTML documents is proposed. In this model the index term weight is computed by weighting the term occurrences differently, according to the tags in which they appear.