Text Segmentation from Complex Background Using Sparse Representations

Authors:
W. Pan;T. Bui;C. Suen
Affiliations:
Concordia University;Concordia University;Concordia University
Venue:
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 01
Year:
2007

Citing 0
Cited 2

Text extraction from graphical document images using sparse representation

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Fuzzy graph modeling for text segmentation from land map images

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A novel text segmentation method from complex back- ground is presented in this paper. The idea is inspired by the recent development in searching for the sparse sig- nal representation among a family of over-complete atoms, which is called a dictionary. We assume that the image un- der investigation is composed of two components: the fore- ground text and the complex background. We further as- sume that the latter can be modeled as a piece-wise smooth function. Then we choose two dictionaries, where the first one gives sparse representation to one component and non- sparse representation to another while the second one does the opposite. By looking for the sparse representations in each dictionary, we can decompose the image into the two composing components. After that, text segmentation can be easily achieved by applying simple thresholding to the text component. Preliminary experiments show some promising results.