Building quality into a digital library

  • Authors:
  • Hussein Suleman;Edward A. Fox;Marc Abrams

  • Affiliations:
  • Department of Computer Science, Virginia Tech, Blacksburg, VA;Department of Computer Science, Virginia Tech, Blacksburg, VA;Department of Computer Science, Virginia Tech, Blacksburg, VA

  • Venue:
  • DL '00 Proceedings of the fifth ACM conference on Digital libraries
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Web Characterization Repository contains a collection of internet log files used by researchers to analyze and improve on the architecture of the Web. This repository improves on prior collections by thoroughly testing the log files for format to assure a degree of data quality. Instituting quality control into the digital library addressed many complex issues including technical support for quality assessment, the definition of a workflow to achieve quality control, the assignment of tasks to different people and the definition and automation of quality assessment for log files. By reaching realistic compromises on these issues it was possible to build quality control as an integral part of the digital library.