Communications of the ACM
Modern Information Retrieval
Algorithms for estimating relative importance in networks
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 13th international conference on World Wide Web
Two supervised learning approaches for name disambiguation in author citations
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Iterative record linkage for cleaning and integration
Proceedings of the 9th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Can pseudonymity really guarantee privacy?
SSYM'00 Proceedings of the 9th conference on USENIX Security Symposium - Volume 9
Finding lists of people on the web
ACM SIGCAS Computers and Society
Towards compatible primitive structures
Journal of Experimental & Theoretical Artificial Intelligence - Special issue: conceptual graphs workshop
Adaptive graphical approach to entity resolution
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Stylometric Identification in Electronic Markets: Scalability and Robustness
Journal of Management Information Systems
idMesh: graph-based disambiguation of linked data
Proceedings of the 18th international conference on World wide web
Analysis of tag within online social networks
Proceedings of the ACM 2009 international conference on Supporting group work
I seek you: searching and matching individuals in social networks
Proceedings of the eleventh international workshop on Web information and data management
Analysis of dynamic social network: e-mail messages exchange network
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
ACM Transactions on Information Systems (TOIS)
Groups without tears: mining social topologies from email
Proceedings of the 16th international conference on Intelligent user interfaces
Robustness of dynamic social networks
Journal of Mobile Multimedia
Towards alias detection without string similarity: an active learning based approach
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Combining Entity Matching Techniques for Detecting Extremist Behavior on Discussion Boards
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Toward detection of aliases without string similarity
Information Sciences: an International Journal
Hi-index | 0.01 |
This research addresses the problem of correctly relating aliases that belong to the same entity. Previous approaches focused on natural language processing and structured data, whereas in this research we analyze the local association, or "social" network in which aliases reside. The network is constructed from email data mined from the Internet. Links in the network represent web pages on which two email addresses are collocated. The problem is defined as given social network S, constructed from email address collocations, and an email address E, identify any aliases for E that also appear in S. The alias detection methods are evaluated on a data set of over 14,000 University X email addresses for which ground truth relations are known. The results are reported as partial lists of k choices for possible aliases, ranked by predicted relational strength within the network. Given a source email address, a portion of all email addresses, 2%, are correctly linked to another alias that corresponds to the same entity by best rank, which is significantly better than random (0.007%) and a geodesic distance (1%) baseline prediction. Correct linkages increase to 15% and 30% within top-10 (0.07% of all emails) and top-100 rank lists (0.7% of all emails), respectively.