Semi-supervised OWA aggregation for link-based similarity evaluation and alias detection

  • Authors:
  • Tossapon Boongoen;Qiang Shen

  • Affiliations:
  • Department of Computer Science, Aberystwyth University, UK;Department of Computer Science, Aberystwyth University, UK

  • Venue:
  • FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Within the past decades, many fuzzy aggregation techniques, ordered weighted averaging (OWA) in particular, ave proven effective for a wide range of information processing tasks, such as decision making, image analysis, database and machine learning. Despite reported successes, their potentials have yet to be explored for the emerging problem of link analysis, which aims to discover similarity and relations amongst objects through their associations. Recently, several link-based similarity methods have been put forward to identifying similar objects in the Internet and publication domains. However, these techniques only take into account the cardinality property of a link structure that is highly sensitive to noise and causes a great number of false positives. In light of such challenge, this paper presents a novel OWA aggregation model that is capable of efficiently deriving a similarity measure through the integration of multiple link properties. The underlying approach is based on the methodology of stress function by which the aggregation behavior can be easily interpreted and modeled. In addition, a semi-supervised method is introduced to assist a user in designing a stress function, i.e. the weighting scheme of link properties, appropriate for a particular link network. The application of the OWA aggregation approach to alias detection is demonstrated and evaluated, against state-of-art link-based techniques, over datasets specifically related toterrorism, publication and email domains.