Definite clause grammars for language analysis
Readings in natural language processing
Compression, significance and accuracy
ML92 Proceedings of the ninth international workshop on Machine learning
ILP '96 Selected Papers from the 6th International Workshop on Inductive Logic Programming
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
An ILP Refinement Operator for Biological Grammar Learning
Inductive Logic Programming
Pertinent background knowledge for learning protein grammars
ECML'06 Proceedings of the 17th European conference on Machine Learning
AMIE: association rule mining under incomplete evidence in ontological knowledge bases
Proceedings of the 22nd international conference on World Wide Web
Hi-index | 0.00 |
We identify a shortcoming of a standard positive-only clause evaluation function within the context of learning biological grammars. To overcome this shortcoming we propose L-modification, a modification to this evaluation function such that the lengths of individual examples are considered. We use a set of bio-sequences known as neuropeptide precursor middles (NPP-middles). Using L-modification to learn from these NPP-middles results in induced grammars that have a better performance than that achieved when using the standard positive-only clause evaluation function. We also show that L-modification improves the performance of induced grammars when learning on short, medium or long NPPs-middles. A potential disadvantage of L-modification is discussed. Finally, we show that, as the limit on the search space size increases, the greater is the increase in predictive performance arising from L-modification.