Recognizing Mathematical Expressions Using Tree Transformation

Authors:
Richard Zanibbi;Dorothea Blostein;James R. Cordy
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2002

Citing 17
Cited 34

TXL: a rapid prototyping system for programming language dialects

Computer Languages
Optical character recognition for typeset mathematics

ISSAC '94 Proceedings of the international symposium on Symbolic and algebraic computation
Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
A Parsing Methodology for the Implementation of Visual Systems

IEEE Transactions on Software Engineering
Ambiguity and constraint in mathematical expression recognition

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Empirical Performance Evaluation of Graphics Recognition Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Mathematica book (4th edition)

The Mathematica book (4th edition)
Statistical Language Learning

Statistical Language Learning
Aiding manipulation of handwritten mathematical expressions through style-preserving morphs

GRIN'01 No description on Graphics interface 2001
A Technique of Mathematical Expression Structure Analysis for the Handwriting Input System

ICDAR '99 Proceedings of the Fifth International Conference on Document Analysis and Recognition
Mathematics recognition using graph rewriting

ICDAR '95 Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1
A System for Real-time Recognition of Handwritten Mathematical Formulas

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Baseline Structure Analysis of Handwritten Mathematics Notation

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Mathematical Formula Recognition Using Virtual Link Network

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Performance Evaluation of a Robust Method for Mathematical Expression Recognition

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Principles of Compiler Design (Addison-Wesley series in computer science and information processing)

Principles of Compiler Design (Addison-Wesley series in computer science and information processing)
TEX and METAFONT: New directions in typesetting

TEX and METAFONT: New directions in typesetting

Recognition of On-line Handwritten Mathematical Formulas in the E-Chalk System

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Online Recognition of Chinese Characters: The State-of-the-Art

IEEE Transactions on Pattern Analysis and Machine Intelligence
MathPad2: a system for the creation and exploration of mathematical sketches

ACM SIGGRAPH 2004 Papers
Using Fuzzy Logic to Analyze Superscript and Subscript Relations in Handwritten Mathematical Expressions

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Recognition of On-Line Handwritten Mathematical Expressions in the E-Chalk System - An Extension

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Source transformation, analysis and generation in TXL

Proceedings of the 2006 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
Animating Chinese paintings through stroke-based decomposition

ACM Transactions on Graphics (TOG)
The TXL source transformation language

Science of Computer Programming - The fourth workshop on language descriptions, tools, and applications (LDTA'04)
Structural analysis of handwritten mathematical expressions through fuzzy parsing

ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
A co-operative methodology for automatic solutions to problems in indefinite integral calculus

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications
MathPad2: a system for the creation and exploration of mathematical sketches

ACM SIGGRAPH 2006 Courses
Document zone content classification and its performance evaluation

Pattern Recognition
MathPad2: a system for the creation and exploration of mathematical sketches

ACM SIGGRAPH 2007 courses
Bibliography

ACM SIGGRAPH 2007 courses
Adapting handwriting recognition for applications in algebra learning

Proceedings of the international workshop on Educational multimedia and multimedia education
Calligraphic Interfaces: An initial evaluation of MathPad2: A tool for creating dynamic mathematical illustrations

Computers and Graphics
Progressive structural analysis for dynamic recognition of on-line handwritten mathematical expressions

Pattern Recognition Letters
A video-based text and equation editor for LaTeX

Engineering Applications of Artificial Intelligence
Math-Literate Computers

Calculemus '09/MKM '09 Proceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM '09 on Intelligent Computer Mathematics
Efficient search strategy in structural analysis for handwritten mathematical expression recognition

Pattern Recognition
Towards a web-based progressive handwriting recognition environment for mathematical problem solving

Expert Systems with Applications: An International Journal
Cerno: Light-weight tool support for semantic annotation of textual documents

Data & Knowledge Engineering
TXL - A Language for Programming Language Tools and Applications

Electronic Notes in Theoretical Computer Science (ENTCS)
Classifying and resolving ambiguities in sketch-based interaction

International Journal of Virtual Technology and Multimedia
A model-based recognition engine for sketched diagrams

Journal of Visual Languages and Computing
An improved method of formula structural analysis

RSKT'08 Proceedings of the 3rd international conference on Rough sets and knowledge technology
GOAL: towards understanding of graphic objects from architectural to line drawings

GREC'09 Proceedings of the 8th international conference on Graphics recognition: achievements, challenges, and evolution
Is the iPad useful for sketch input?: a comparison with the tablet PC

Proceedings of the Eighth Eurographics Symposium on Sketch-Based Interfaces and Modeling
Structural analysis of mathematical formulae with verification based on formula description grammar

DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
A rule-based approach to form mathematical symbols in printed mathematical expressions

MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
HBF49 feature set: A first unified baseline for online symbol recognition

Pattern Recognition
A paradigm for handwriting-based intelligent tutors

International Journal of Human-Computer Studies
Understanding Digital Documents Using Gestalt Properties of Isothetic Components

International Journal of Digital Library Systems
A global learning approach for an online handwritten mathematical expression recognition system

Pattern Recognition Letters

Quantified Score

Hi-index	0.15

Visualization

Abstract

We describe a robust and efficient system for recognizing typeset and handwritten mathematical notation. From a list of symbols with bounding boxes the system analyzes an expression in three successive passes. The Layout Pass constructs a Baseline Structure Tree (BST) describing the two-dimensional arrangement of input symbols. Reading order and operator dominance are used to allow efficient recognition of symbol layout even when symbols deviate greatly from their ideal positions. Next, the Lexical Pass produces a Lexed BST from the initial BST by grouping tokens comprised of multiple input symbols; these include decimal numbers, function names, and symbols comprised of nonoverlapping primitives such as 驴=驴. The Lexical Pass also labels vertical structures such as fractions and accents. The Lexed BST is translated into $\LaTeX$. Additional processing, necessary for producing output for symbolic algebra systems, is carried out in the Expression Analysis Pass. The Lexed BST is translated into an Operator Tree, which describes the order and scope of operations in the input expression. The tree manipulations used in each pass are represented compactly using tree transformations. The compiler-like architecture of the system allows robust handling of unexpected input, increases the scalability of the system, and provides the groundwork for handling dialects of mathematical notation.