Improving document clustering by utilizing meta-data

  • Authors:
  • Kam-Fai Wong;Nam-Kiu Chan;Kam-Lai Wong

  • Affiliations:
  • The Chinese University of Hong Kong;The Chinese University of Hong Kong;The Chinese University of Hong Kong

  • Venue:
  • AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we examine how to improve the precision and recall of document clustering by utilizing meta-data. We use meta-data through NewsML tags to assist clustering and show that this approach is effective through experiments on sample news data. Experimental result shows that clustering using NewsML could improve average recall and precision over the same without using NewsML by about 10%. Our algorithm facilitates effective e-business for the news media and publishing industry to empower e-business.