Pulling their weight: exploiting syntactic forms for the automatic identification of idiomatic expressions in context

  • Authors:
  • Paul Cook;Afsaneh Fazly;Suzanne Stevenson

  • Affiliations:
  • University of Toronto, Toronto, Canada;University of Toronto, Toronto, Canada;University of Toronto, Toronto, Canada

  • Venue:
  • MWE '07 Proceedings of the Workshop on a Broader Perspective on Multiword Expressions
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Much work on idioms has focused on type identification, i.e., determining whether a sequence of words can form an idiomatic expression. Since an idiom type often has a literal interpretation as well, token classification of potential idioms in context is critical for NLP. We explore the use of informative prior knowledge about the overall syntactic behaviour of a potentially-idiomatic expression (type-based knowledge) to determine whether an instance of the expression is used idiomatically or literally (token-based knowledge). We develop unsupervised methods for the task, and show that their performance is comparable to that of state-of-the-art supervised techniques.