Elements of information theory
Elements of information theory
DL '97 Proceedings of the second ACM international conference on Digital libraries
Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Stereo Matching Using Belief Propagation
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part II
Finding Deformable Shapes Using Loopy Belief Propagation
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
Understanding belief propagation and its generalizations
Exploring artificial intelligence in the new millennium
Text Identification in Noisy Document Images Using Markov Random Field
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Learning to Detect Scene Text Using a Higher-Order MRF with Belief Propagation
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 6 - Volume 06
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Finding Text in Natural Scenes by Figure-Ground Segmentation
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 04
Deformable templates for face recognition
Journal of Cognitive Neuroscience
Grouping using factor graphs: an approach for finding text with a camera phone
GbRPR'07 Proceedings of the 6th IAPR-TC-15 international conference on Graph-based representations in pattern recognition
Detecting and reading text in natural scenes
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiscale conditional random fields for image labeling
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Man-made structure detection in natural images using a causal multiscale random field
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Loopy belief propagation for approximate inference: an empirical study
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Factor graphs and the sum-product algorithm
IEEE Transactions on Information Theory
Automatic text detection and tracking in digital video
IEEE Transactions on Image Processing
Text detection in images using sparse representation with discriminative dictionaries
Image and Vision Computing
Hi-index | 0.00 |
Foreground-background segmentation has recently been applied [S.X. Yu, J. Shi, Object-specific figure-ground segregation, Computer Vision and Pattern Recognition (CVPR), 2003; S. Kumar, M. Hebert, Man-made structure detection in natural images using a causal multiscale random field, Computer Vision and Pattern Recognition (CVPR), 2003] to the detection and segmentation of specific objects or structures of interest from the background as an efficient alternative to techniques such as deformable templates [A.L. Yuille, Deformable templates for face recognition. Journal of Cognitive Neuroscience 3 (1) (1991)]. We introduce a graphical model (i.e. Markov random field)-based formulation of structure-specific figure-ground segmentation based on simple geometric features extracted from an image, such as local configurations of linear features, that are characteristic of the desired figure structure. Our formulation is novel in that it is based on factor graphs, which are graphical models that encode interactions among arbitrary numbers of random variables. The ability of factor graphs to express interactions higher than pairwise order (the highest order encountered in most graphical models used in computer vision) is useful for modeling a variety of pattern recognition problems. In particular, we show how this property makes factor graphs a natural framework for performing grouping and segmentation, and demonstrate that the factor graph framework emerges naturally from a simple maximum entropy model of figure-ground segmentation. We cast our approach in a learning framework, in which the contributions of multiple grouping cues are learned from training data, and apply our framework to the problem of finding printed text in natural scenes. Experimental results are described, including a performance analysis that demonstrates the feasibility of the approach.