Statistical Classification of Scientific Publications

  • Authors:
  • Vaidas Balys;Rimantas Rudzkis

  • Affiliations:
  • -;Vilnius University Institute of Mathematics and Informatics, Akademijos 4, LT-08663 Vilnius, Lithuania, E-mail: rudzkis@ktl.mii.lt

  • Venue:
  • Informatica
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of automatic classification of scientific texts is considered. Methods based on statistical analysis of probabilistic distributions of scientific terms in texts are discussed. The procedures for selecting the most informative terms and the method of making use of auxiliary information related to the terms positions are presented. The results of experimental evaluation of proposed algorithms and procedures over real-world data are reported.