A novel mutual nearest neighbor based symmetry for text frame classification in video

Authors:
Palaiahnakote Shivakumara;Anjan Dutta;Trung Quy Phan;Chew Lim Tan;Umapada Pal
Affiliations:
School of Computing, National University of Singapore, Singapore;Computer Vision Center, Universitat Autònoma de Barcelona, Barcelona, Spain;School of Computing, National University of Singapore, Singapore;School of Computing, National University of Singapore, Singapore;Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, India
Venue:
Pattern Recognition
Year:
2011

Citing 27
Cited 0

TextFinder: An Automatic System to Detect and Recognize Text In Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Caption Localization in Compressed Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Neural network-based text location in color images

Pattern Recognition Letters
Event detection in baseball video using superimposed caption recognition

Proceedings of the tenth ACM international conference on Multimedia
Progress in Camera-Based Document Image Analysis

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
ICDAR 2003 Robust Reading Competitions

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm

IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Detection in Images Based on Unsupervised Classification of Edge-based Features

ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Live sports event detection based on broadcast video and web-casting text

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Text detection, localization, and tracking in compressed video

Image Communication
Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis

Pattern Recognition
Extraction of Text Objects in Video Documents: Recent Progress

DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A Robust System to Detect and Localize Texts in Natural Scene Images

DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
An Efficient Edge Based Technique for Text Detection in Video Frames

DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A Laplacian Method for Video Text Detection

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Gradient Difference Based Technique for Video Text Detection

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Robust Wavelet Transform Based Technique for Video Text Detection

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Video text detection based on filters and edge features

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Fast and robust text detection in images and video frames

Image and Vision Computing
Accurate video text detection through classification of low and high contrast images

Pattern Recognition
Text detection in images using sparse representation with discriminative dictionaries

Image and Vision Computing
Detecting and reading text in natural scenes

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Automatic text detection and tracking in digital video

IEEE Transactions on Image Processing
Automatic detection and recognition of signs from natural scenes

IEEE Transactions on Image Processing
Localizing and segmenting text in images and videos

IEEE Transactions on Circuits and Systems for Video Technology
An automatic performance evaluation protocol for video text detection algorithms

IEEE Transactions on Circuits and Systems for Video Technology
A comprehensive method for multilingual video text detection, localization, and extraction

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

In the field of multimedia retrieval in video, text frame classification is essential for text detection, event detection, event boundary detection, etc. We propose a new text frame classification method that introduces a combination of wavelet and median moment with k-means clustering to select probable text blocks among 16 equally sized blocks of a video frame. The same feature combination is used with a new Max-Min clustering at the pixel level to choose probable dominant text pixels in the selected probable text blocks. For the probable text pixels, a so-called mutual nearest neighbor based symmetry is explored with a four-quadrant formation centered at the centroid of the probable dominant text pixels to know whether a block is a true text block or not. If a frame produces at least one true text block then it is considered as a text frame otherwise it is a non-text frame. Experimental results on different text and non-text datasets including two public datasets and our own created data show that the proposed method gives promising results in terms of recall and precision at the block and frame levels. Further, we also show how existing text detection methods tend to misclassify non-text frames as text frames in term of recall and precision at both the block and frame levels.