Extraction of anchor-related text and its evaluation by user studies

  • Authors:
  • Bui Quang Hung;Masanori Otsubo;Yoshinori Hijikata;Shogo Nishida

  • Affiliations:
  • Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka, Japan;Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka, Japan;Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka, Japan;Department of Systems Innovation, Graduate School of Engineering Science, Osaka University, Toyonaka, Osaka, Japan

  • Venue:
  • Proceedings of the 2007 conference on Human interface: Part I
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Semantic Text Portion (STP) is a text portion in the original page which is semantically related to the anchor pointing to the target page. STPs may include the facts and the people's opinions about the target pages. STPs can be used for various upper-level applications such as automatic summarization and document categorization. In this paper, we concentrate on extracting STPs. We conduct a survey of STP to see the positions of STPs in original pages and find out HTML tags which can divide STPs from the other text portions in original pages. We then develop a method for extracting STPs based on the result of the survey. The experimental results show that our method achieves high performance.