Text segmentation in mixed-mode images using classification trees and transform tree-structured vector quantization

Authors:
K. O. Perlmutter;N. Chaddha;J. B. Buckheit;R. M. Gray;R. A. Olshen
Affiliations:
Inf. Syst. Lab., Stanford Univ., CA, USA;-;-;-;-
Venue:
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 04
Year:
1996

Citing 0
Cited 2

Object-Based Classification of Mixed-Mode Images

PCM '01 Proceedings of the Second IEEE Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Identifying Story and Preview Images in News Web Pages

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

Multimedia applications such as educational videos and color facsimile contain images that are rich in both textual and continuous tone data. Because these two types of data have different properties, segmentation of the images into text and continuous tone data can improve compression by allowing different compression parameters or even algorithms to be employed on the different types. We propose and compare algorithms that use classification trees (CLTR) or tree-structured vector quantization (TSVQ) for block-based classification in mixed-mode images. We also examine different types of features that can be used in these classifiers. The results show that using linear transform features with either the CLTR or TSVQ can be effective for accurate text classification. In addition, the results indicate that combining these classifiers with another TSVQ that is designed simultaneously to minimize both compression and classification error can provide better classification than does either system alone.