An algorithm of online goods information extraction with two-stage working pattern

  • Authors:
  • Wang Xun;Ling Yun;Yu-lian Fei

  • Affiliations:
  • College of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, china;College of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, china;College of Computer and Information Engineering, Zhejiang Gongshang University, Hangzhou, china

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part I
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The key technology in comparison-shopping is the online goods information extraction. Based on DOM, the information extraction with two-stage working pattern and the conception of page information unit have been proposed after a large number of sample pages testing. PIU is extracted and categorized by the classifying algorithm, and information is extracted from PIU. It is implemented that the key information of online goods is extracted based on the above-mentioned information extraction algorithm. It shows that the algorithm is steady and has higher Recall and Precision rate with the sample page testing.