Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Finding parts in very large corpora
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Learning semantic constraints for the automatic discovery of part-whole relations
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Hi-index | 0.00 |
This paper focuses on the automatic acquisition of semantic relationships from Chinese corpus, motivated by improving the performances of our QA systems named NL-WAS. Linguistic patterns designed for Chinese sentences are applied to a collection of texts to extract synonymy relationship, hyponymy relationship, and meronymy relationship. Patterns are broken down into unambiguous and ambiguous, and different strategies are adopted to refine the candidates extracted using this two kinds of patterns. Compared to other previous works, we apply not only strict unambiguous patterns but also loose unambiguous patterns to extract relationships and proposed efficient approach to refine the outputs of these patterns for the sake of high recall and high precision. The experimental result shows that the proposed method can delete most noisy pairs of terms and improve accuracy and efficiency of NL-WAS. At the same time, our method is complementary to statistically based approaches that find semantic relationships between terms.