Exploring Word Similarity to Improve Chinese Personal Name Disambiguation

Authors:
Xia Yang;Peng Jin;Wei Xiang
Affiliations:
-;-;-
Venue:
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Year:
2011

Citing 3
Cited 0

Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Entity-based cross-document coreferencing using the Vector Space Model

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
CU-COMSEM: exploring rich features for unsupervised web personal name disambiguation

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an approach to the Chinese Personal Name Disambiguation (PND). The key to clustering is the similarity measure of context, which depends on the features selection and representation and calculation method. First HIT Tongyici Cilin (Extended) is introduced to Chinese PND to enhance the clustering effect. Exploration about more word similarity is also performed to alleviate the data sparseness. In this system, a HAC (Hierarchical Agglomerative Clustering) algorithm is adopted to cluster the mentions referring to a same person with features extracted from documents. The results show that the word similarity information is very helpful to improve the system's performance.