Using web page layout for extraction of sender names

  • Authors:
  • Rintaro Miyazaki;Ryo Momose;Hideyuki Shibuki;Tatsunori Mori

  • Affiliations:
  • Yokohama National University, Hodogaya-ku, Yokohama, Japan;Yokohama National University, Hodogaya-ku, Yokohama, Japan;Yokohama National University, Hodogaya-ku, Yokohama, Japan;Yokohama National University, Hodogaya-ku, Yokohama, Japan

  • Venue:
  • Proceedings of the 3rd International Universal Communication Symposium
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recently, the credibility of information available on the Web has been regarded as an important issue. Sender name is one of the important indicators of the credibility of the information. In this paper, we propose a new method for extracting sender name. The proposed method use the named entity recognition method, and reducing the DOM node using Web page Layout for preprocessing. Experimental result shows that our proposed method can effectively extract sender names when the preprocessing is successful.