Using Text Mining to Infer Semantic Attributes for Retail Data Mining

  • Authors:
  • Rayid Ghani;Andrew E. Fano

  • Affiliations:
  • -;-

  • Venue:
  • ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Current Data Mining techniques usually do not have amechanism to automatically infer semantic features inherentin the data being "mined". The semantics are eitherinjected in the initial stages (by feature construction) or byinterpreting the results produced by the algorithms. Bothof these techniques have proved effective but require a lotof human effort. In many domains, semantic informationis implicitly available and can be extracted automaticallyto improve data mining systems. In this paper, we present acase study of a system that is trained to extract semantic featuresfor apparel products and populate a knowledge basewith these products and features. We show that semanticfeatures of these items can be successfully extracted by applyingtext learning techniques to the descriptions obtainedfrom websites of retailers. We also describe several applicationsof such a knowledge base of product semantics that wehave built including recommender systems and competitiveintelligence tools and provide evidence that our approachcan successfully build a knowledge base with accurate factswhich can then be used to create profiles of individual customers,groups of customers, or entire retail stores.