Automatic discovery of word semantic relations using paraphrase alignment and distributional lexical semantics analysis

  • Authors:
  • GaËl Dias;Rumen Moraliyski;JoÃo Cordeiro;Antoine Doucet;Helena Ahonen-myka

  • Affiliations:
  • Centre for hlt and bioinformatics, department of computer science, university of beira interior, 6201-001-covilhã, portugal emails: ddg@di.ubi.pt, rumen@penhas.di.ubi.pt, jpaulo@di.ubi.pt;Centre for hlt and bioinformatics, department of computer science, university of beira interior, 6201-001-covilhã, portugal emails: ddg@di.ubi.pt, rumen@penhas.di.ubi.pt, jpaulo@di.ubi.pt;Centre for hlt and bioinformatics, department of computer science, university of beira interior, 6201-001-covilhã, portugal emails: ddg@di.ubi.pt, rumen@penhas.di.ubi.pt, jpaulo@di.ubi.pt;Campus côte de nacre, boulevard du maréchal juin, university of caen, bp 5186-14032-caen cedex, france email: doucet@info.unicaen.fr;Department of computer science, university of helsinki, p.o. box 68 (gustaf hällströmin katu 2b), fi-00014, helsinki, finland email: helena.ahonen-myka@cs.helsinki.fi

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Thesauri, which list the most salient semantic relations between words, have mostly been compiled manually. Therefore, the inclusion of an entry depends on the subjective decision of the lexicographer. As a consequence, those resources are usually incomplete. In this paper, we propose an unsupervised methodology to automatically discover pairs of semantically related words by highlighting their local environment and evaluating their semantic similarity in local and global semantic spaces. This proposal differs from all other research presented so far as it tries to take the best of two different methodologies, i.e. semantic space models and information extraction models. In particular, it can be applied to extract close semantic relations, it limits the search space to few, highly probable options and it is unsupervised.