On learning the past tenses of English verbs
Parallel distributed processing: explorations in the microstructure of cognition, vol. 2
Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Regular models of phonological rule systems
Computational Linguistics - Special issue on computational phonology
IGTree: Using Trees for Compression and Classification in Lazy LearningAlgorithms
Artificial Intelligence Review - Special issue on lazy learning
An Efficient, Probabilistically Sound Algorithm for Segmentation andWord Discovery
Machine Learning - Special issue on natural language learning
Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Genetic Algorithms in Search, Optimization and Machine Learning
Genetic Algorithms in Search, Optimization and Machine Learning
Machine Learning
Inductive Logic Programming: Techniques and Applications
Inductive Logic Programming: Techniques and Applications
Machine Learning
Active Learning for Natural Language Parsing and Information Extraction
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Part-of-Speech Tagging Using Progol
ILP '97 Proceedings of the 7th International Workshop on Inductive Logic Programming
Learning Multilingual Morphology with CLOG
ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
A Hybrid Approach t Word Segmentation
ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Automatic rule induction for unknown-word guessing
Computational Linguistics
Paradigmatic cascades: a linguistically sound model of pronunciation by analogy
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A Mathematical Theory of Communication
A Mathematical Theory of Communication
Induction of first-order decision lists: results on learning the past tense of English verbs
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
This article presents a combination of unsupervised and supervised learning techniques for the generation of word segmentation rules from a raw list of words. First, a language bias for word segmentation is introduced and a simple genetic algorithm is used in the search for a segmentation that corresponds to the best bias value. In the second phase, the words segmented by the genetic algorithm are used as an input for the first order decision list learner CLOG. The result is a set of first order rules which can be used for segmentation of unseen words. When applied on either the training data or unseen data, these rules produce segmentations which are linguistically meaningful, and to a large degree conforming to the annotation provided.