Figure-ground segmentation using factor graphs

Authors:
Huiying Shen;James Coughlan;Volodymyr Ivanchenko
Affiliations:
Smith-Kettlewell Eye Research Institute, Rehabilitation Engineering Research Center, 2318 Fillmore St., San Francisco, CA 94115, USA;Smith-Kettlewell Eye Research Institute, Rehabilitation Engineering Research Center, 2318 Fillmore St., San Francisco, CA 94115, USA;Smith-Kettlewell Eye Research Institute, Rehabilitation Engineering Research Center, 2318 Fillmore St., San Francisco, CA 94115, USA
Venue:
Image and Vision Computing
Year:
2009

Citing 19
Cited 1

Elements of information theory

Elements of information theory
Finding text in images

DL '97 Proceedings of the second ACM international conference on Digital libraries
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Stereo Matching Using Belief Propagation

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part II
Finding Deformable Shapes Using Loopy Belief Propagation

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
Understanding belief propagation and its generalizations

Exploring artificial intelligence in the new millennium
Text Identification in Noisy Document Images Using Markov Random Field

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Learning to Detect Scene Text Using a Higher-Order MRF with Belief Propagation

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 6 - Volume 06
Beyond Pairwise Clustering

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Finding Text in Natural Scenes by Figure-Ground Segmentation

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 04
Deformable templates for face recognition

Journal of Cognitive Neuroscience
Grouping using factor graphs: an approach for finding text with a camera phone

GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition
Detecting and reading text in natural scenes

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiscale conditional random fields for image labeling

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Man-made structure detection in natural images using a causal multiscale random field

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Loopy belief propagation for approximate inference: an empirical study

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Factor graphs and the sum-product algorithm

IEEE Transactions on Information Theory
Automatic text detection and tracking in digital video

IEEE Transactions on Image Processing

Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Foreground-background segmentation has recently been applied [S.X. Yu, J. Shi, Object-specific figure-ground segregation, Computer Vision and Pattern Recognition (CVPR), 2003; S. Kumar, M. Hebert, Man-made structure detection in natural images using a causal multiscale random field, Computer Vision and Pattern Recognition (CVPR), 2003] to the detection and segmentation of specific objects or structures of interest from the background as an efficient alternative to techniques such as deformable templates [A.L. Yuille, Deformable templates for face recognition. Journal of Cognitive Neuroscience 3 (1) (1991)]. We introduce a graphical model (i.e. Markov random field)-based formulation of structure-specific figure-ground segmentation based on simple geometric features extracted from an image, such as local configurations of linear features, that are characteristic of the desired figure structure. Our formulation is novel in that it is based on factor graphs, which are graphical models that encode interactions among arbitrary numbers of random variables. The ability of factor graphs to express interactions higher than pairwise order (the highest order encountered in most graphical models used in computer vision) is useful for modeling a variety of pattern recognition problems. In particular, we show how this property makes factor graphs a natural framework for performing grouping and segmentation, and demonstrate that the factor graph framework emerges naturally from a simple maximum entropy model of figure-ground segmentation. We cast our approach in a learning framework, in which the contributions of multiple grouping cues are learned from training data, and apply our framework to the problem of finding printed text in natural scenes. Experimental results are described, including a performance analysis that demonstrates the feasibility of the approach.