C4.5: programs for machine learning
C4.5: programs for machine learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Learning and Inference for Clause Identification
ECML '02 Proceedings of the 13th European Conference on Machine Learning
Finding clauses in unrestricted text by finitary and stochastic methods
ANLC '88 Proceedings of the second conference on Applied natural language processing
Text chunking by combining hand-crafted rules and memory-based learning
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Use of support vector learning for chunk identification
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Japanese dependency structure analysis based on support vector machines
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Boosting trees for clause splitting
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Use of support vector machines in extended named entity recognition
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Introduction to the CoNLL-2001 shared task: clause identification
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Dependency Analysis of Clauses Using Parse Tree Kernels
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.00 |
This paper proposes a method for Korean clause boundary recognition. Clause boundary identification can be regarded as a three-class classification task, and it can be converted into a two-phase binary classification task. Then it is natural to apply SVMs to clause boundary recognition, since SVMs are basically binary classifiers. Specifically we first recognize the ending points of clauses, and then identify the starting points by considering the typological characteristics of Korean. In addition, since there is not a standard Korean corpus containing clause boundary information, we prepare a Korean clause identification dataset. In the evaluation, support vector machines yield the improvement of performance over memory-based learning or decision trees.