Web scale competitor discovery using mutual information

  • Authors:
  • Rui Li;Shenghua Bao;Jin Wang;Yuanjie Liu;Yong Yu

  • Affiliations:
  • Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China

  • Venue:
  • ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The web with its rapid expansion has become an excellent resource for gathering information and people’s opinion. A company owner wants to know who is the competitor, and a customer also wants to know which company provides similar product or service to what he/she is in want of. This paper proposes an approach based on mutual information, which focuses on mining competitors of the entity(such as company, product, person ) from the web. The proposed techniques first extract a set of candidates of the input entity, and then rank them according to the comparability, and finally find and organize the reviews related to both original entity and its competitors. A novel system called ”CoDis” based upon these techniques is implemented, which is able to automate the tedious process in a domain-independent and web-scale dynamical manner. In the experiment we use 32 different entities distributed in varied domains as inputs and the CoDis discovers 143 competitors. The experimental results show that the proposed techniques are highly effective.