Bibliographic Attributes Extraction with Layer-upon-Layer Tagging

  • Authors:
  • W. Wei;I. King;J. H-M. Lee

  • Affiliations:
  • Royal Institute of Technology, Sweden;Chinese University of Hong Kong;Chinese University of Hong Kong

  • Venue:
  • ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Bibliographic attributes extraction is an important re- search topic for digital libraries. In this paper we pro- pose a rule-based method for bibliographic attributes ex- traction with Layer-upon-Layer Tagging (LLT). The method analyzes bibliographic attributes' appearances and punc- tuations to perform format and semantic taggings on two defined parsing layers. The method also resolves to specif- ically constructed lexicons to achieve high accuracy of se- mantic tagging. In the experimental evaluation on 1,000 reference strings, the accuracy of author tagging reaches to 96.8% and the accuracy of whole reference tagging is 82.9%. The experimental results demonstrate that the pro- posed LLT method can tag bibliographic attributes in refer- ence strings with high degree of accuracy.