Extraction of Vectorized Graphical Information from Scientific Chart Images

  • Authors:
  • W. Huang;R. Liu;C.-L. Tan

  • Affiliations:
  • National University of Singapore;National University of Singapore;National University of Singapore

  • Venue:
  • ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Graphical components information extraction is a crucial step in the chart recognition and understanding process. However, existing methods of information extraction from chart images either are type-dependent or rely on certain assumptions. In this paper, we present a general method to extract vectorized graphical information from scientific chart images. Our algorithm firstly constructs a data structure called directional single-connected chains (DSCC). It then employs ellipse-specific fitting and orthogonal diagonalization to calculate the curvatures of the chains and classify the chains into either straight lines or arcs. Finally we combine all straight lines and all arcs accordingly and use linear regression to compute their attributes. The DSCC has a good property in that it is less susceptible to noise. The experiment results show that our algorithm is efficient, robust and accurate.