Mathematical Formulas Extraction
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
INFTY: an integrated OCR system for mathematical documents
Proceedings of the 2003 ACM symposium on Document engineering
A bottom-up OCR system for mathematical formulas recognition
ICIC'06 Proceedings of the 2006 international conference on Intelligent Computing - Volume Part I
Hi-index | 0.00 |
We present a method for automatic extraction of mathematical formulas from images of documents without character recognition. Formula extraction is first done by location of its most significant symbols, then extension to adjoining symbols using contextual rules until delimitation of the whole formula space. Mathematical symbols labeling is realized from models created at the learning stage using fuzzy logic. From the experiments, we found that the average rate of primary labeling of mathematical symbols is about 95.3%.The obtained results have demonstrated the applicability of our system since 90% of mathematical formulas are well extracted from documents printed with high quality.