A model for evaluating the quality of user-created documents

  • Authors:
  • Linh Hoang;Jung-Tae Lee;Young-In Song;Hae-Chang Rim

  • Affiliations:
  • Dept. of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Dept. of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Dept. of Computer and Radio Communications Engineering, Korea University, Seoul, Korea;Dept. of Computer and Radio Communications Engineering, Korea University, Seoul, Korea

  • Venue:
  • AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a model for evaluating the quality of general user-created documents. The model is based on supervised classification approach, in which output scores are considered as quality of given document. In order to utilize both textual and nontextual attributes of documents, we incorporated a number of objectively measurable, real-valued features selected upon predefined criteria for quality. Experiments on two datasets of real world documents show that textual features are stable indicators for evaluating documents' quality. Some features are inferred to be effective for general kinds of documents.