A sequential algorithm for training text classifiers
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Using latent semantic analysis to find different names for the same entity in free text
Proceedings of the 4th international workshop on Web information and data management
Email alias detection using social network analysis
Proceedings of the 3rd international workshop on Link discovery
Mining for personal name aliases on the web
Proceedings of the 17th international conference on World Wide Web
Toward detection of aliases without string similarity
Information Sciences: an International Journal
Discovering emerging entities with ambiguous names
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Entity aliases commonly exist and accurately detecting these aliases plays a vital role in various applications. In this paper, we use an active-learning-based method to detect aliases without string similarity. To minimize the cost on pairwise comparison, a subset-based method restricts the alias selection within a small-scale entity set. Within each generated entity set, an active learning based logistic regression classifier is employed to predict whether a candidate is the alias of a given entity. The experimental results on three datasets clearly demonstrate that our proposed approach can effectively detect this kind of entity aliases.