Web scale competitor discovery using mutual information

Authors:
Rui Li;Shenghua Bao;Jin Wang;Yuanjie Liu;Yong Yu
Affiliations:
Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China;Department of Computer Science and Engineering, Shanghai JiaoTong University, Shanghai, P.R. China
Venue:
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Year:
2006

Citing 14
Cited 1

Wrapper generation for semi-structured Internet sources

ACM SIGMOD Record
Towards text knowledge engineering

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
A flexible learning system for wrapping tables and lists in HTML documents

Proceedings of the 11th international conference on World Wide Web
Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL

EMCL '01 Proceedings of the 12th European Conference on Machine Learning
Mining product reputations on the Web

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining topic-specific concepts and definitions on the web

WWW '03 Proceedings of the 12th international conference on World Wide Web
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Towards the self-annotating web

Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Finding parts in very large corpora

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
CORDER: COmmunity relation discovery by named entity recognition

Proceedings of the 3rd international conference on Knowledge capture
Mining Web Data for Competency Management

WI '05 Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence
Moving up the information food chain: deploying softbots on the world wide web

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2

Efficient and domain-invariant competitor mining

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

The web with its rapid expansion has become an excellent resource for gathering information and people’s opinion. A company owner wants to know who is the competitor, and a customer also wants to know which company provides similar product or service to what he/she is in want of. This paper proposes an approach based on mutual information, which focuses on mining competitors of the entity(such as company, product, person ) from the web. The proposed techniques first extract a set of candidates of the input entity, and then rank them according to the comparability, and finally find and organize the reviews related to both original entity and its competitors. A novel system called ”CoDis” based upon these techniques is implemented, which is able to automate the tedious process in a domain-independent and web-scale dynamical manner. In the experiment we use 32 different entities distributed in varied domains as inputs and the CoDis discovers 143 competitors. The experimental results show that the proposed techniques are highly effective.