Extraction of Text Phrases Using Hierarchical Grammar

  • Authors:
  • Jan Bakus;Mohamed S. Kamel;Tom Carey

  • Affiliations:
  • -;-;-

  • Venue:
  • AI '02 Proceedings of the 15th Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper presents an algorithm for extraction of phrases from text documents. The algorithm builds phrases by iteratively merging bigrams according to an association measure.Tw o association measures are presented: mutual information and t-test. The extracted phrases are tested in a document classification task using a tf/idf model and a k-nearest neighbor classifier.