Name Disambiguation in Person Information Mining

  • Authors:
  • Yu-Chuan Wei;Ming-Shun Lin;Hsin-Hsi Chen

  • Affiliations:
  • National Taiwan University, Taiwan;National Taiwan University, Taiwan;National Taiwan University, Taiwan

  • Venue:
  • WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper considers five features, personal titles, community chains, terms, temporal expressions, and hostnames for personal name disambiguation. In 9 test data sets covering 3 ambiguous personal names, we address the issues of awareness degree of an entity, the source of materials and web pages in different areas. Two approaches, single-clusterer and cascaded multiple-clusterer, are proposed. In the experiments, the proposed features are quite useful; the multiple-clusterer approach is better than the single-clusterer approach; and expanding community chains using the web has positive effects on personal name disambiguation.