A symbolic approach to automatic multiword term structuring

  • Authors:
  • Eric SanJuan;James Dowdall;Fidelia Ibekwe-SanJuan;Fabio Rinaldi

  • Affiliations:
  • LITA, Université Paul Verlaine & URI-INIST/CNRS, F-54514 France;NLP Group, Dept. of Informatics, University of Sussex, BN1 9RH, UK;University of LyonIII, 69007, France;Institute of Computational Linguistics, University of Zurich, CH-8050, Switzerland

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a three-level structuring of multiword terms basing on lexical inclusion, WordNet similarity and a clustering approach. Term clustering by automatic data analysis methods offers an interesting way of organizing a domain's knowledge structure, useful for several information-oriented tasks like science and technology watch, textmining, computer-assisted ontology population, Question Answering (Q-A). This paper explores how this three-level term structuring brings to light the knowledge structures from a corpus of genomics and compares the mapping of the domain topics against a hand-built ontology (the GENIA ontology). Ways of integrating the results into a Q-A system are discussed.