Toward generic title generation for clustered documents

  • Authors:
  • Yuen-Hsien Tseng;Chi-Jen Lin;Hsiu-Han Chen;Yu-I Lin

  • Affiliations:
  • National Taiwan Normal University, Taipei, Taiwan, R.O.C.;WebGenie Information LTD., Taipei, Taiwan, R.O.C.;WebGenie Information LTD., Taipei, Taiwan, R.O.C.;Taipei Municipal Univ. of Education, Taipei, Taiwan, R.O.C.

  • Venue:
  • AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

A cluster labeling algorithm for creating generic titles based on external resources such as WordNet is proposed. Our method first extracts category-specific terms as cluster descriptors. These descriptors are then mapped to generic terms based on a hypernym search algorithm. The proposed method has been evaluated on a patent document collection and a subset of the Reuters-21578 collection. Experimental results revealed that our method performs as anticipated. Real-case applications of these generic terms show promising in assisting humans in interpreting the clustered topics. Our method is general enough such that it can be easily extended to use other hierarchical resources for adaptable label generation.