Exploring graph structure for detection of reliability zones within synonym resources: experiment with the gene ontology

  • Authors:
  • Thierry Hamon;Natalia Grabar

  • Affiliations:
  • LIPN -- UMR, Université Paris -- CNRS, Villetaneuse, France;Université Paris Descartes, Paris, France

  • Venue:
  • BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Computing the semantic similarity between terms relies on existence and usage of semantic resources. However, these resources, often composed of equivalent units, or synonyms, must be first analyzed and weighted in order to define within them the reliability zones where the semantic cohesiveness is stronger. We propose an original method for acquisition of elementary synonyms based on exploitation of structured terminologies, analysis of syntactic structure of complex (multi-unit) terms and their compositionality. The acquired synonyms are then profiled thanks to endogenous lexical and linguistic indicators (other types of relations, lexical inclusions, productivity), which are automatically inferred within the same terminologies. Additionally, synonymy relations are observed within graph, and its structure is analyzed. Particularly, we explore the usefulness of the graph theory notions such as connected component, clique, density, bridge, articulation vertex, and centrality of vertices.