Unsupervised multi-label text classification using a world knowledge ontology

  • Authors:
  • Xiaohui Tao;Yuefeng Li;Raymond Y. K. Lau;Hua Wang

  • Affiliations:
  • Centre for Systems Biology, University of Southern Queensland, Australia;Science and Engineering Faculty, Queensland University of Technology, Australia;Department of Information Systems, City University of Hong Kong, Hong Kong;Centre for Systems Biology, University of Southern Queensland, Australia

  • Venue:
  • PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The development of text classification techniques has been largely promoted in the past decade due to the increasing availability and widespread use of digital documents. Usually, the performance of text classification relies on the quality of categories and the accuracy of classifiers learned from samples. When training samples are unavailable or categories are unqualified, text classification performance would be degraded. In this paper, we propose an unsupervised multi-label text classification method to classify documents using a large set of categories stored in a world ontology. The approach has been promisingly evaluated by compared with typical text classification methods, using a real-world document collection and based on the ground truth encoded by human experts.