Math Spotting: Retrieving Math in Technical Documents Using Handwritten Query Images

  • Authors:
  • Richard Zanibbi;Li Yu

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '11 Proceedings of the 2011 International Conference on Document Analysis and Recognition
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A method for locating mathematical expressions in document images without the use of optical character recognition is presented. An index of document regions is produced from recursive X-Y trees produced for each page in the corpus. Queries are provided as images of handwritten expressions, for which an X-Y tree is computed. During retrieval, the query is looked up in the document region index using features of its X-Y tree, producing a set of candidate regions. Candidate regions are ranked by the similarity of vertical pixel projections in their upper and lower halves with those of the query image, as computed using Dynamic Time Warping of the image columns. In an experiment, ten participants each wrote twenty queries from a 200-page corpus. On average, the top-10 retrieval candidates included a candidate covering 43.3% of the test query image (σ = 14.0), with the correct page being returned between 30.0% and 85.0% of the time across participants (μ = 63.2%, s = 14.9%). When testing using the original query images, 90.0% of the queries were retrieved correctly.