Corpus-based semantic class mining: distributional vs. pattern-based approaches

  • Authors:
  • Shuming Shi;Huibin Zhang;Xiaojie Yuan;Ji-Rong Wen

  • Affiliations:
  • Microsoft Research Asia;Nankai University;Nankai University;Microsoft Research Asia

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Main approaches to corpus-based semantic class mining include distributional similarity (DS) and pattern-based (PB). In this paper, we perform an empirical comparison of them, based on a publicly available dataset containing 500 million web pages, using various categories of queries. We further propose a frequency-based rule to select appropriate approaches for different types of terms.