Learning to extract and summarize hot item features from multiple auction web sites

  • Authors:
  • Tak-Lam Wong;Wai Lam

  • Affiliations:
  • City University of Hong Kong, Department of Computer Science, 83 Tat Chee Avenue, Kowloon, Hong Kong;The Chinese University of Hong Kong, Department of Systems Engineering and Engineering Management, 83 Tat Chee Avenue, Shatin, Hong Kong

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

It is difficult to digest the poorly organized and vast amount of information contained in auction Web sites which are fast changing and highly dynamic. We develop a unified framework which can automatically extract product features and summarize hot item features from multiple auction sites. To deal with the irregularity in the layout format of Web pages and harness the uncertainty involved, we formulate the tasks of product feature extraction and hot item feature summarization as a single graph labeling problem using conditional random fields. One characteristic of this graphical model is that it can model the inter-dependence between neighbouring tokens in a Web page, tokens in different Web pages, as well as various information such as hot item features across different auction sites. We have conducted extensive experiments on several real-world auction Web sites to demonstrate the effectiveness of our framework.