Discovery procedures for sublanguage selectional patterns: initial experiments
Computational Linguistics
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Word association norms, mutual information, and lexicography
Computational Linguistics
Term clustering of syntactic phrases
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Natural Language Information Processing: A Computer Grammmar of English and Its Applications
Natural Language Information Processing: A Computer Grammmar of English and Its Applications
Extracting semantic hierarchies from a large on-line dictionary
ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures
ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Natural language information retrieval in digital libraries
Proceedings of the first ACM international conference on Digital libraries
Information retrieval using robust natural language processing
ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
Information retrieval using robust natural language processing
HLT '91 Proceedings of the workshop on Speech and Natural Language
Hi-index | 0.00 |
We describe an advanced text processing system for information retrieval from natural language document collections. We use both syntactic processing as well as statistical term clustering to obtain a representation of documents which would be more accurate than those obtained with more traditional key-word methods. A reliable top-down parser has been developed that allows for fast processing of large amounts of text, and for a precise identification of desired types of phrases for statistical analysis. Two statistical measures are computed: the measure of informational contribution of words in phrases, and the similarity measure between words.