Concept extraction for online shopping

  • Authors:
  • Yongzheng Zhang;Rajyashree Mukherjee;Benny Soetarman

  • Affiliations:
  • eBay Research Labs, San Jose, CA;eBay Inc., San Jose, CA;eBay Inc., San Jose, CA

  • Venue:
  • Proceedings of the 14th Annual International Conference on Electronic Commerce
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Online shopping has been more and more popular nowadays. Online shopping starts with research and shopping research starts with search. In order to provide a more streamlined user experience in shopping related research, it is critical for e-commerce sites to accurately identify what a Web page is talking about. Concept extraction is a nice solution for this purpose. In this paper, we investigate two concept extraction methods: Automatic Concept Extractor (ACE) and Automatic Keyphrase Extraction (KEA). ACE is an unsupervised method that looks at both text and HTML tags. We upgrade ACE into Improved Concept Extractor (ICE) with significant improvements. KEA is a supervised learning system. It first builds a Naive Bayes model from training documents where concepts are manually assigned. The trained model is then used to automatically find concepts in new documents. In order to evaluate the two systems, we create a gold standard by manually assigning concepts to each page in the collection. We tune different parameters of ICE and KEA to generate concepts. And we use precision, recall and F1 to evaluate the concepts. The experimental results demonstrate that ICE significantly outperforms KEA in concept extraction for online shopping.