Text chunking based on a generalization of winnow
The Journal of Machine Learning Research
Introduction to the CoNLL-2000 shared task: chunking
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Use of support vector learning for chunk identification
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Hi-index | 0.00 |
The traditional English text chunking approach identifies phrases by using only one model and phrases with the same types of features. It has been shown that the limitations of using only one model are that: the use of the same types of features is not suitable for all phrases, and data sparseness may also result. In this paper, the Distributed Multi-Agent based architecture approach is proposed and applied in the identification of English phrases. This strategy put phrases into agents according to their sensitive features and identifies different phrases in parallel, where the main features are: one, easy and quick communication between phrases; two, avoidance of data sparseness. By applying and testing the approach on the public training and test corpus, the F score for arbitrary phrases identification using Distributed Multi-Agent strategy achieves 95.70% compared to the previous best F score of 94.17%.