Annotating text segments using a web-based categorization approach

  • Authors:
  • Hsin-Chen Chiao;Hsiao-Tieh Pu;Lee-Feng Chien

  • Affiliations:
  • Institute of Information Science, Academia Sinica, Taipei, Taiwan;Graduate Institute of Library & Information Studies, National Taiwan Normal University, Taipei, Taiwan;Institute of Information Science, Academia Sinica, Taipei, Taiwan

  • Venue:
  • ICADL'05 Proceedings of the 8th international conference on Asian Digital Libraries: implementing strategies and sharing experiences
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Conventional automatic text annotation tools mostly extract named entities from texts and annotate them with information about persons, locations, and dates, etc. Such kind of entity type information, however, is insufficient for machines to understand the context or facts contained in the texts. This paper presents a general text categorization approach to categorize text segments into broader subject categories, such as categorizing a text string into a category of paper title in Mathematics or a category of conference name in Computer Science. Experimental results confirm its wide applicability to various digital library applications.