On-demand extraction of domain concepts and relationships from social tagging websites

  • Authors:
  • Vijayan Sugumaran;Sandeep Purao;Veda C. Storey;Jordi Conesa

  • Affiliations:
  • School of Business Administration, Oakland University, Rochester, MI and Department of Service Systems Management and Engineering, Sogang University, Seoul, South Korea;School of Information, University of Washington, Seattle, WA and College of Information Sciences & Technology, The Pennsylvania State University, University Park State College, PA;Department of Computer Information Systems, J. Mark Robinson Collage of Business, Georgia State University, Atlanta, GA;Estudis d'Informatica i Multimedia, Universitat Oberta de Catalunya, Barcelona, Spain

  • Venue:
  • NLDB'10 Proceedings of the Natural language processing and information systems, and 15th international conference on Applications of natural language to information systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Much content on the World Wide Web is becoming tagged with simple words or phrases in natural language as web citizens create tags that organize information primarily to facilitate their personal retrieval and use. These tags represent, often incomplete, pieces of knowledge about concepts in a domain. Aggregated across a large number of contributors, these tags provide the potential to identify, in a bottom-up manner, key constructs in a domain. This research develops a set of heuristics that aggregate and analyze tags contributed by individual users on the web to extract and generate domain-level constructs. The heuristics infer the existence of constructs, and distinguish entities, attributes, and relationships.