Videotext OCR Using Hidden Markov Models

Authors:
Baback Elmieh
Affiliations:
-
Venue:
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Year:
2001

Citing 0
Cited 2

Caption Text Recognition in Video Frames by MAP Matching

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Performance Improvements to the BBN Byblos OCR System

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Abstract: In this paper we present a method for performing Optical Character Recognition (OCR) of text in video images. Recognition of videotext is a challenging problem due to various factors such as the presence of rich, dynamic backgrounds, low resolution, color, etc. Our strategy is to process the video images to produce high-resolution binarized text images that resemble printed text. We describe a novel clustering and relaxation procedure that combines stroke and color information to separate the text from the background. The binarized text image is then recognized with our Byblos OCR engine [5][6] using hidden Markov models trained on similar data. We present experimental results on a video-data corpus collected from broadcast news programs. Currently the system delivers a character error rate of 8.3% on independent multi-font test data from this corpus.