Dual filtering strategy for chinese term extraction

  • Authors:
  • Xiaoming Chen;Xuening Li;Yi Hu;Ruzhan Lu

  • Affiliations:
  • ,Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic term extraction (ATR) is an important problem in natural language processing. But most of extraction methods focus on the extraction of multiword units. Inevitably, many common words (or phrases) as terms are extracted at the same time. In this paper, we propose a hybrid method for automatic extraction of term from domain-specific un-annotated Chinese documents by means of linguistics knowledge and statistical techniques, taking dual filtering strategy and introducing a weight formula to filter term candidates. The results of the research indicate that our system is more efficient and precise than previous methods.