WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
Alleviating the Problem of Wrong Coreferences in Web Person Search
CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Hi-index | 0.01 |
This paper considers five features, personal titles, community chains, terms, temporal expressions, and hostnames for personal name disambiguation. In 9 test data sets covering 3 ambiguous personal names, we address the issues of awareness degree of an entity, the source of materials and web pages in different areas. Two approaches, single-clusterer and cascaded multiple-clusterer, are proposed. In the experiments, the proposed features are quite useful; the multiple-clusterer approach is better than the single-clusterer approach; and expanding community chains using the web has positive effects on personal name disambiguation.