Dual filtering strategy for chinese term extraction

Authors:
Xiaoming Chen;Xuening Li;Yi Hu;Ruzhan Lu
Affiliations:
,Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai;Dept. of computer science and engineering, Shanghai Jiao Tong Univ., Shanghai
Venue:
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Year:
2005

Citing 3
Cited 0

A Statistical Corpus-Based Term Extractor

AI '01 Proceedings of the 14th Biennial Conference of the Canadian Society on Computational Studies of Intelligence: Advances in Artificial Intelligence
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Word association norms, mutual information, and lexicography

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Automatic term extraction (ATR) is an important problem in natural language processing. But most of extraction methods focus on the extraction of multiword units. Inevitably, many common words (or phrases) as terms are extracted at the same time. In this paper, we propose a hybrid method for automatic extraction of term from domain-specific un-annotated Chinese documents by means of linguistics knowledge and statistical techniques, taking dual filtering strategy and introducing a weight formula to filter term candidates. The results of the research indicate that our system is more efficient and precise than previous methods.