Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images

Authors:
Xiabi Liu;Hui Fu;Yunde Jia
Affiliations:
School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China and School of Information Technology, Beijing Forestry University, Beijing 100083, China;School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
Venue:
Pattern Recognition
Year:
2008

Citing 9
Cited 4

TextFinder: An Automatic System to Detect and Recognize Text In Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video OCR for Digital News Archive

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Locating Characters in Scene Images Using Frequency Features

ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 3 - Volume 3
ICDAR 2003 Robust Reading Competitions

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Colour Perception

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Learning to Detect Scene Text Using a Higher-Order MRF with Belief Propagation

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 6 - Volume 06
Fuzzy curve-tracing algorithm

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Convergence condition and efficient implementation of the fuzzy curve-tracing (FCT) algorithm

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A kurtosis-based dynamic approach to Gaussian mixture modeling

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans

Is there a best color space for color image characterization or representation based on Multivariate Gaussian Mixture Model?

Computer Vision and Image Understanding
Combining evolution strategy and gradient descent method for discriminative learning of bayesian classifiers

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing
Dirichlet Gaussian mixture model: Application to image segmentation

Image and Vision Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper proposes an approach based on the statistical modeling and learning of neighboring characters to extract multilingual texts in images. The case of three neighboring characters is represented as the Gaussian mixture model and discriminated from other cases by the corresponding 'pseudo-probability' defined under Bayes framework. Based on this modeling, text extraction is completed through labeling each connected component in the binary image as character or non-character according to its neighbors, where a mathematical morphology based method is introduced to detect and connect the separated parts of each character, and a Voronoi partition based method is advised to establish the neighborhoods of connected components. We further present a discriminative training algorithm based on the maximum-minimum similarity (MMS) criterion to estimate the parameters in the proposed text extraction approach. Experimental results in Chinese and English text extraction demonstrate the effectiveness of our approach trained with the MMS algorithm, which achieved the precision rate of 93.56% and the recall rate of 98.55% for the test data set. In the experiments, we also show that the MMS provides significant improvement of overall performance, compared with influential training criterions of the maximum likelihood (ML) and the maximum classification error (MCE).