CEBBIP: a parser of bibliographic information in chinese electronic books
Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Parsing citations in biomedical articles using conditional random fields
Computers in Biology and Medicine
Web-based citation parsing, correction and augmentation
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Hi-index | 0.01 |
Bibliographic attributes extraction is an important re- search topic for digital libraries. In this paper we pro- pose a rule-based method for bibliographic attributes ex- traction with Layer-upon-Layer Tagging (LLT). The method analyzes bibliographic attributes' appearances and punc- tuations to perform format and semantic taggings on two defined parsing layers. The method also resolves to specif- ically constructed lexicons to achieve high accuracy of se- mantic tagging. In the experimental evaluation on 1,000 reference strings, the accuracy of author tagging reaches to 96.8% and the accuracy of whole reference tagging is 82.9%. The experimental results demonstrate that the pro- posed LLT method can tag bibliographic attributes in refer- ence strings with high degree of accuracy.