Managing very large document collections using semantics

  • Authors:
  • GuoRen Wang;HongJun Lu;Ge Yu;YuBin Bao

  • Affiliations:
  • Department of Computer Science, Northeastern University, Shenyang 110004, P.R. China;Department of Computer Science, Hong Kong University of Science and Technology, P.R. China;Department of Computer Science, Northeastern University, Shenyang 110004, P.R. China;Department of Computer Science, Northeastern University, Shenyang 110004, P.R. China

  • Venue:
  • Journal of Computer Science and Technology
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a system is presented where documents are no longer identified by their file names. Instead, a document is represented by its semantics in terms of descriptor and content vector. The descriptor of a document consists of a set of attributes, such as date of creation, its type, its size, annotations, etc. The content vector of a document consists of a set of terms extracted from the document. In this paper, a semantic document management system XBASE is designed and implemented based on the semantics and the functions of three main modules, X-Loader, X-Explorer and X-Query.