A new model of document structure analysis

  • Authors:
  • Zhiqi Wang;Yongcheng Wang;Kai Gao

  • Affiliations:
  • Department of Computer Science and Technology, Shanghai Jiao Tong University, Shanghai, P.R. China;Department of Computer Science and Technology, Shanghai Jiao Tong University, Shanghai, P.R. China;Department of Computer Science and Technology, Shanghai Jiao Tong University, Shanghai, P.R. China

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The purpose of document structure analysis is to get the document structure of the source text. Document structure is defined as 3 layers in the paper. A new model of document structure analysis — DLM is proposed. The model is composed of three layers: physical structure layer, logical structure layer and semantic structure layer, which are corresponding to the definition of the document structure. The input, output and operation of each layer are illustrated in details in the paper. The model has the feature of flexible, systematic and extendible. DLM is implemented on the Automatic Summarization System. It shows that the model is feasible and good result can be achieved.