Improving term extraction by utilizing user annotations

  • Authors:
  • Jozef Harinek;Marián Šimko

  • Affiliations:
  • Slovak University of Technology in Bratislava, Bratislava, Slovakia;Slovak University of Technology in Bratislava, Bratislava, Slovakia

  • Venue:
  • Proceedings of the 2013 ACM symposium on Document engineering
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automated acquisition of relevant domain terms from educational documents available in social educational systems can benefit from processing a growing number of user-created annotations assigned to the content. Annotations provide us potentially useful information about documents and can improve the results of base Automatic Term Recognition (ATR) algorithms. We propose a method for relevant domain terms extraction based on user-created annotations processing. We consider three basic annotation types: tags, comments and highlights. The final term weight is computed by combining relevant domain terms weights obtained from the individual annotation types and those obtained from the text. The method was evaluated using data from Principles of Software Engineering course in adaptive educational system ALEF and showed that enhancements based on annotation processing yield significant improvement of results.