Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?

  • Authors:
  • Weihua Huang;Chew Lim Tan;Jiuzhou Zhao

  • Affiliations:
  • School of Computing, National University of Singapore, Singapore 117543;School of Computing, National University of Singapore, Singapore 117543;School of Computing, National University of Singapore, Singapore 117543

  • Venue:
  • Graphics Recognition. Recent Advances and New Opportunities
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating a chart image dataset and multi-level ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dataset as well as the ground truth data are publicly available so that other researchers can access them for evaluating and comparing performances of different systems.