TextFinder: An Automatic System to Detect and Recognize Text In Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Caption Localization in Compressed Video
IEEE Transactions on Pattern Analysis and Machine Intelligence
Neural network-based text location in color images
Pattern Recognition Letters
Event detection in baseball video using superimposed caption recognition
Proceedings of the tenth ACM international conference on Multimedia
Progress in Camera-Based Document Image Analysis
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
ICDAR 2003 Robust Reading Competitions
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Detection in Images Based on Unsupervised Classification of Edge-based Features
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Live sports event detection based on broadcast video and web-casting text
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Text detection, localization, and tracking in compressed video
Image Communication
Extraction of Text Objects in Video Documents: Recent Progress
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A Robust System to Detect and Localize Texts in Natural Scene Images
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
An Efficient Edge Based Technique for Text Detection in Video Frames
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A Laplacian Method for Video Text Detection
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Gradient Difference Based Technique for Video Text Detection
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A Robust Wavelet Transform Based Technique for Video Text Detection
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Video text detection based on filters and edge features
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Fast and robust text detection in images and video frames
Image and Vision Computing
Text detection in images using sparse representation with discriminative dictionaries
Image and Vision Computing
Detecting and reading text in natural scenes
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Automatic text detection and tracking in digital video
IEEE Transactions on Image Processing
Automatic detection and recognition of signs from natural scenes
IEEE Transactions on Image Processing
Localizing and segmenting text in images and videos
IEEE Transactions on Circuits and Systems for Video Technology
An automatic performance evaluation protocol for video text detection algorithms
IEEE Transactions on Circuits and Systems for Video Technology
A comprehensive method for multilingual video text detection, localization, and extraction
IEEE Transactions on Circuits and Systems for Video Technology
Hi-index | 0.01 |
In the field of multimedia retrieval in video, text frame classification is essential for text detection, event detection, event boundary detection, etc. We propose a new text frame classification method that introduces a combination of wavelet and median moment with k-means clustering to select probable text blocks among 16 equally sized blocks of a video frame. The same feature combination is used with a new Max-Min clustering at the pixel level to choose probable dominant text pixels in the selected probable text blocks. For the probable text pixels, a so-called mutual nearest neighbor based symmetry is explored with a four-quadrant formation centered at the centroid of the probable dominant text pixels to know whether a block is a true text block or not. If a frame produces at least one true text block then it is considered as a text frame otherwise it is a non-text frame. Experimental results on different text and non-text datasets including two public datasets and our own created data show that the proposed method gives promising results in terms of recall and precision at the block and frame levels. Further, we also show how existing text detection methods tend to misclassify non-text frames as text frames in term of recall and precision at both the block and frame levels.