Efficient Part-of-Speech Tagging with a Min-Max Modular Neural-Network Model

  • Authors:
  • Bao-Liang Lu;Qing Ma;Michinori Ichikawa;Hitoshi Isahara

  • Affiliations:
  • Department of Computer Science, Shanghai Jiao Tong University, 1954 Hua Shan Road, Shanghai 200030, People's Republic of China. blu@cs.sjtu.edu.cn;Department of Applied Mathematics and Informatics, Faculty of Science and Technology, Ryukoku University, Seta, Otsu 520-2194, Japan. qma@math.ryukoku.ac.jp;Lab. for Brain-Operative Device, RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-shi 351-0198, Japan. ichikawa@brainway.riken.go.jp;Communications Research Laboratory, 2-2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0289, Japan. isahara@crl.go.jp

  • Venue:
  • Applied Intelligence
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a part-of-speech tagging method based on a min-max modular neural-network model. The method has three main steps. First, a large-scale tagging problem is decomposed into a number of relatively smaller and simpler subproblems according to the class relations among a given training corpus. Secondly, all of the subproblems are learned by smaller network modules in parallel. Finally, following two simple module combination laws, all of the trained network modules are integrated into a modular parallel tagging system that produces solutions to the original tagging problem. The proposed method has several advantages over existing tagging systems based on multilayer perceptrons. (1) Training times can be drastically reduced and desired learning accuracy can be easily achieved; (2) the method can scale up to larger tagging problems; (3) the tagging system has quick response and facilitates hardware implementation. In order to demonstrate the effectiveness of the proposed method, we perform simulations on two different language corpora: a Thai corpus and a Chinese corpus, which have 29,028 and 45,595 ambiguous words, respectively. We also compare our method with several existing tagging models including hidden Markov models, multilayer perceptrons and neuro-taggers. The results show that both the learning accuracy and generalization performance of the proposed tagging model are better than statistical models and multilayer perceptrons, and they are comparable to the most successful tagging models.