Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Noun phrase recognition by system combination
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
A memory-based approach to learning shallow natural language patterns
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Manually annotated Hungarian corpus
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Learning tree patterns for syntactic parsing
Acta Cybernetica
Hi-index | 0.00 |
This paper offers a method for the noun phrase recognition of Hungarian natural language texts based on machine learning methods. The approach learns noun phrase tree patterns described by regular expressions from an annotated corpus. The tree patterns are completed with probability values using error statistics. The noun phrase recognition parser tries to find the best-fitting trees for a sentence using backtracking technique. The results are used in an information extraction toolchain.