Maximum entropy modeling in sparse semantic tagging

  • Authors:
  • Jia Cui;David Guthrie

  • Affiliations:
  • Johns Hopkins University, Baltimore, MD;University of Sheffield, Sheffield, UK

  • Venue:
  • HLT-SRWS '04 Proceedings of the Student Research Workshop at HLT-NAACL 2004
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work, we are concerned with a coarse grained semantic analysis over sparse data, which labels all nouns with a set of semantic categories. To get the benefit of unlabeled data, we propose a bootstrapping framework with Maximum Entropy modeling (MaxEnt) as the statistical learning component. During the iterative tagging process, unlabeled data is used not only for better statistical estimation, but also as a medium to integrate non-statistical knowledge into the model training. Two main issues are discussed in this paper. First, Association Rule principles are suggested to guide MaxEnt feature selections. Second, to guarantee the convergence of the boot-strapping process, three adjusting strategies are proposed to soft tag unlabeled data.