A fuzzy declarative approach for classifying unlabeled short texts using thesauri

  • Authors:
  • Francisco P. Romero;Pascual Julian-Iranzo;Andres Soto;Mateus Ferreira-Satler;Juan Gallardo-Casero

  • Affiliations:
  • Dept. of Information Systems and Technologies, University of Castilla La Mancha. Ciudad Real, Spain;Dept. of Information Systems and Technologies, University of Castilla La Mancha. Ciudad Real, Spain;Univ. Autonoma del Carmen, Ciudad del Carmen, Campeche, Mexico;Dept. of Information Systems and Technologies, University of Castilla La Mancha. Ciudad Real, Spain;Dept. of Information Systems and Technologies, University of Castilla La Mancha. Ciudad Real, Spain

  • Venue:
  • WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The classic approach to text categorisation is based on a learning process that requires a large number of labelled training texts to achieve an accurate performance. The most notable problem is that labelled texts are difficult to generate because categorising shorts texts as snippets or messages must be done by human developers, although unlabelled short texts could be easily collected. In this paper, we present an approach to categorising unlabelled short texts which only require, as user input, the category names defined by means of an ontology of terms modelled by a set of proximity equations. The proposed classification process is based on the ability of a fuzzy extension of the standard Prolog language named Bousi~Prolog for flexible matching and knowledge representation. This declarative approach provides a text classifier which is fast and easy to build, as well as a classification process that is easy for the user to understand. The results of the experiment showed that the proposed method achieved a reasonably good performance.