An Approach for Processing Mathematical Expressions in Printed Document
DAS '98 Selected Papers from the Third IAPR Workshop on Document Analysis Systems: Theory and Practice
Mathematical Formulas Extraction
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Recognition of On-line Handwritten Mathematical Formulas in the E-Chalk System
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Automated Segmentation of Math-Zones from Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
MathPad2: a system for the creation and exploration of mathematical sketches
ACM SIGGRAPH 2004 Papers
Semantic Analysis of Matrix Structures
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
MathPad2: a system for the creation and exploration of mathematical sketches
ACM SIGGRAPH 2006 Courses
MathPad2: a system for the creation and exploration of mathematical sketches
ACM SIGGRAPH 2007 courses
ACM SIGGRAPH 2007 courses
A rule-based approach to form mathematical symbols in printed mathematical expressions
MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
A Unified Algorithm for Identification of Various Tabular Structures from Document Images
International Journal of Digital Library Systems
Hi-index | 0.00 |
We present a system to segment and recognize texts and mathematical expressions in a document. The system can be divided into six stages: page segmentation and labeling, character segmentation, feature extraction, character recognition, expression formation, and error correction and expression extraction. In expression formation, we build a symbol relation tree for each text line to represent the relationships among the symbols in the text line. Some heuristic rules based on the primitive tokens are used to correct the recognition errors in a text line. We extract all mathematical expressions according to some basic expression forms. Our database consists of 190 symbols in the current stage. The average recognition rate is about 96.16%.