NAYOSE: a system for reference disambiguation of proper nouns appearing on web pages

  • Authors:
  • Shingo Ono;Minoru Yoshida;Hiroshi Nakagawa

  • Affiliations:
  • Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan;Information Technology Center, The University of Tokyo;Information Technology Center, The University of Tokyo

  • Venue:
  • AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We are developing a reference disambiguation system called NAYOSE System. In order to cope with the case the same person name or place name appears over two or more Web pages, we propose a system classifying each page into a cluster which corresponds to the same entity in the real world. For this purpose, we propose two new methods involving algorithms to classify these pages. In our evaluation, the combination of local text matching and named entities matching outperformed the previous baseline algorithm used in simple document classification method by 0.22 in the overall F-measure.