Towards automatic domain classification of technical terms: estimating domain specificity of a term using the web

  • Authors:
  • Takehito Utsuro;Mitsuhiro Kida;Masatsugu Tonoike;Satoshi Sato

  • Affiliations:
  • Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan;Nintendo Co.,Ltd., Kyoto-shi, Japan;Graduate School of Informatics, Kyoto University, Kyoto, Japan;Graduate School of Engineering, Nagoya University, Nagoya, Japan

  • Venue:
  • AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method of domain specificity estimation of technical terms using the Web. In the proposed method, it is assumed that, for a certain technical domain, a list of known technical terms of the domain is given. Technical documents of the domain are collected through the Web search engine, which are then used for generating a vector space model for the domain. The domain specificity of a target term is estimated according to the distribution of the domain of the sample pages of the target term. Experimental evaluation results show that the proposed method achieved mostly 90% precision/recall.