Piggyback: using search engines for robust cross-domain named entity recognition

  • Authors:
  • Stefan Rüd;Massimiliano Ciaramita;Jens Müller;Hinrich Schütze

  • Affiliations:
  • University of Stuttgart, Germany;Google Research, Zürich, Switzerland;University of Stuttgart, Germany;University of Stuttgart, Germany

  • Venue:
  • HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We use search engine results to address a particularly difficult cross-domain language processing task, the adaptation of named entity recognition (NER) from news text to web queries. The key novelty of the method is that we submit a token with context to a search engine and use similar contexts in the search results as additional information for correctly classifying the token. We achieve strong gains in NER performance on news, in-domain and out-of-domain, and on web queries.