Knowledge extraction for identification of Chinese organization names

Authors:
Keh-Jiann Chen;Chao-jan Chen
Affiliations:
Institute of Information Science, Academia Sinica, Taipei;Institute of Information Science, Academia Sinica, Taipei
Venue:
CLPW '00 Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12
Year:
2000

Citing 5
Cited 1

Internal and external evidence in the identification and semantic categorization of proper names

Corpus processing for lexical acquisition
Retrieving collocations from text: Xtract

Computational Linguistics - Special issue on using large corpora: I
Automatic semantic classification for Chinese unknown compound nouns

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Word identification for Mandarin Chinese sentences

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Identification and classification of proper nouns in Chinese texts

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1

Semantic classification of Chinese unknown words

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a knowledge extraction process was proposed to extract the knowledge for identifying Chinese organization names. The knowledge extraction process utilizes the structure property, statistical property as well as partial linguistic knowledge of the organization names to extract new organizations from domain texts. The knowledge extraction processes were experimented on large amount of texts retrieved from WWW. With high standard of threshold values, new organization names can be identified with very high precision. Therefore the knowledge extraction processes can be carried out automatically to self improve the performance in the future.