Webpage understanding: beyond page-level search

  • Authors:
  • Zaiqing Nie;Ji-Rong Wen;Wei-Ying Ma

  • Affiliations:
  • Microsoft Research Asia, Beijing, P. R. China;Microsoft Research Asia, Beijing, P. R. China;Microsoft Research Asia, Beijing, P. R. China

  • Venue:
  • ACM SIGMOD Record
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we introduce the webpage understanding problem which consists of three subtasks: webpage segmentation, webpage structure labeling, and webpage text segmentation and labeling. The problem is motivated by the search applications we have been working on including Microsoft Academic Search, Windows Live Product Search and Renlifang Entity Relationship Search. We believe that integrated webpage understanding will be an important direction for future research in Web mining.