Internal and external evidence in the identification and semantic categorization of proper names
Corpus processing for lexical acquisition
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Automatic semantic classification for Chinese unknown compound nouns
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Word identification for Mandarin Chinese sentences
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Identification and classification of proper nouns in Chinese texts
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Semantic classification of Chinese unknown words
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Hi-index | 0.00 |
In this paper, a knowledge extraction process was proposed to extract the knowledge for identifying Chinese organization names. The knowledge extraction process utilizes the structure property, statistical property as well as partial linguistic knowledge of the organization names to extract new organizations from domain texts. The knowledge extraction processes were experimented on large amount of texts retrieved from WWW. With high standard of threshold values, new organization names can be identified with very high precision. Therefore the knowledge extraction processes can be carried out automatically to self improve the performance in the future.