Asymmetric information distances for automated taxonomy construction

  • Authors:
  • Wei Lee Woon;Stuart Madnick

  • Affiliations:
  • Masdar Institute of Science and Technology, MASDAR, P.O. Box 54224, Abu Dhabi, UAE;M.I.T., Sloan School of Management, E53-321, 02139, Cambridge, MA, USA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A novel method for automatically constructing taxonomies for specific research domains is presented. The proposed methodology uses term co-occurrence frequencies as an indicator of the semantic closeness between terms. To support the automated creation of taxonomies or subject classifications we present a simple modification to the basic distance measure, and describe a set of procedures by which these measures may be converted into estimates of the desired taxonomy. To demonstrate the viability of this approach, a pilot study on renewable energy technologies is conducted, where the proposed method is used to construct a hierarchy of terms related to alternative energy. These techniques have many potential applications, but one activity in which we are particularly interested is the mapping and subsequent prediction of future developments in the technology and research.