A general and multi-lingual phrase chunking model based on masking method

  • Authors:
  • Yu-Chieh Wu;Chia-Hui Chang;Yue-Shi Lee

  • Affiliations:
  • Department of Computer Science and Information Engineering, National Central University, Jhongli City, Taoyuan County, Taiwan,R.O.C.;Department of Computer Science and Information Engineering, National Central University, Jhongli City, Taoyuan County, Taiwan,R.O.C.;Department of Computer Science and Information Engineering, Ming Chuan University, Taoyuan, Taiwan, R.O.C.

  • Venue:
  • CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several phrase chunkers have been proposed over the past few years. Some state-of-the-art chunkers achieved better performance via integrating external resources, e.g., parsers and additional training data, or combining multiple learners. However, in many languages and domains, such external materials are not easily available and the combination of multiple learners will increase the cost of training and testing. In this paper, we propose a mask method to improve the chunking accuracy. The experimental results show that our chunker achieves better performance in comparison with other deep parsers and chunkers. For CoNLL-2000 data set, our system achieves 94.12 in F rate. For the base-chunking task, our system reaches 92.95 in F rate. When porting to Chinese, the performance of the base-chunking task is 92.36 in F rate. Also, our chunker is quite efficient. The complete chunking time of a 50K words document is about 50 seconds.