Corpus-Based semantic filtering in discovering derivational relations

  • Authors:
  • Maciej Piasecki;Radosław Ramocki;Paweł Minda

  • Affiliations:
  • G4.19 Research Group, Institute of Informatics, Wrocław University of Technology, Poland;G4.19 Research Group, Institute of Informatics, Wrocław University of Technology, Poland;G4.19 Research Group, Institute of Informatics, Wrocław University of Technology, Poland

  • Venue:
  • AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Derivational relations are an important part of the lexical semantics system in many languages, especially those of rich inflection. They represent wide variety of semantic oppositions. Analysis of morphological word forms in terms of prefixes and suffixes provides limited information about their semantics. We propose a method of semantic classification of the potential derivational pairs. The method is based on supervised learning, but requires only a list of word pairs assigned to the derivational relations. The classification was based on a combination of features describing distribution of a derivative and derivational base in a large corpus together with their morphological and morpho-syntactic properties. The method does not use patterns based on close co-occurrence of a derivative and its base. Two classification schemes were evaluated: a multiclass and a cascade of binary classifiers, both expressed good performance in experiments on the selected nominal derivational relations.