Word association norms, mutual information, and lexicography
Computational Linguistics
The effects of noisy data on text retrieval
Journal of the American Society for Information Science
The nature of statistical learning theory
The nature of statistical learning theory
Statistical Language Learning
The NYU system for MUC-6 or where's the syntax?
MUC6 '95 Proceedings of the 6th conference on Message understanding
Acquiring word-meaning mappings for natural language interfaces
Journal of Artificial Intelligence Research
Hi-index | 0.00 |
This paper is an outline of a statistical learning algorithm for information extraction systems. It is based on a lexically intensive analysis of a small number of texts that belong to one domain and provides a robust lemmatisation of the word forms and the collection of the most important syntagmatic dependencies in weighted regular expressions. The lexical and syntactical knowledge is collected in a very compact knowledge base that enables the analysis of correct and partly incorrect texts or messages, which due to transmission errors, spelling or grammatical mistakes otherwise would have been rejected by conventional systems.