Genre and domain processing in an information retrieval perspective

  • Authors:
  • Céline Poudat;Guillaume Cleuziou

  • Affiliations:
  • Centre Orléanais de Recherche en Anthropologie et Linguistique, Orléans, France;Laboratoire d'Informatique Fondamentale d'Orléans, Orléans, France

  • Venue:
  • ICWE'03 Proceedings of the 2003 international conference on Web engineering
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The massive amount of textual data on the Web raises numerous classification problems. Although the notion of domain is widely acknowledged in the IR field, the applicative concept of genre could solve its weaknesses by taking into account the linguistic properties and the document structures of the texts. Two clustering methods are proposed here to illustrate the complementarity of the notions to characterize a closed scientific article corpus. The results are planned to be used in a Web-based application.