Combining evidence for automatic extraction of terms

  • Authors:
  • Boris Dobrov;Natalia Loukachevitch

  • Affiliations:
  • Research Computing Center of Lomonosov Moscow State University, Moscow, Russia;Research Computing Center of Lomonosov Moscow State University, Moscow, Russia

  • Venue:
  • PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper describes the method of extraction of two-word domain terms combining their features. The features are computed from three sources: the occurrence statistics in a domain-specific text collection, the statistics of global search engines, and a domain-specific thesaurus. The evaluation of the approach is based on the terminology of manually created thesauri. We show that the use of multiple features considerably improves the automatic extraction of domain-specific terms. We compare the quality of the proposed method in two different domains.