Self organizing maps in NLP: exploration of coreference feature space

  • Authors:
  • Andre Burkovski;Wiltrud Kessler;Gunther Heidemann;Hamidreza Kobdani;Hinrich Schütze

  • Affiliations:
  • University of Stuttgart, Stuttgart, Germany;University of Stuttgart, Stuttgart, Germany;University of Stuttgart, Stuttgart, Germany;Institute for Natural Language Processing, University of Stuttgart, Stuttgart, Germany;Institute for Natural Language Processing, University of Stuttgart, Stuttgart, Germany

  • Venue:
  • WSOM'11 Proceedings of the 8th international conference on Advances in self-organizing maps
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In Natural Language Processing, large annotated data sets are needed to train language models using supervised machine learning methods. To obtain such labeled data sets, time consuming manual annotation is required. To facilitate this process, we propose a SOM-based approach: The SOM sorts the data through unsupervised training, mapping the space of linguistic features to a 2D-grid. The grid visualization is used for efficient interactive labeling of the data clusters. In addition, the interactive SOM visualization allows computational linguists to explore the topology of the feature space and design new features.