Scientific literature metadata extraction based on HMM

  • Authors:
  • Binge Cui

  • Affiliations:
  • College of Information Science and Engineering, Shandong University of Science and Technology, Qingdao, Shandong, P.R. China

  • Venue:
  • CDVE'09 Proceedings of the 6th international conference on Cooperative design, visualization, and engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Metadata serves as an important role in the archiving, management and sharing of the scientific literatures. It consists of title, authors, affiliation, address, email, abstract, keywords, etc. However, the metadata is usually easy-to-read for human and difficult-to-recognize for computers. In this paper, we propose to improve Viterbi algorithm based on text blocks instead of words, increase the precision and recall based on unique characteristics of metadata items.